2025-05-07T19:42:35.6848994Z Current runner version: '2.323.0' 2025-05-07T19:42:35.6854685Z Runner name: 'i-066c3e49b24aebf4f' 2025-05-07T19:42:35.6855589Z Machine name: 'ip-10-0-69-70' 2025-05-07T19:42:35.6858141Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:35.6860267Z Contents: read 2025-05-07T19:42:35.6860780Z Metadata: read 2025-05-07T19:42:35.6861587Z Packages: read 2025-05-07T19:42:35.6862113Z ##[endgroup] 2025-05-07T19:42:35.6864682Z Secret source: None 2025-05-07T19:42:35.6866121Z Prepare workflow directory 2025-05-07T19:42:35.7482786Z Prepare all required actions 2025-05-07T19:42:35.7522463Z Getting action download info 2025-05-07T19:42:36.3462352Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:36.6094758Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:37.1252635Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.13, 11.8.0, gcc) 2025-05-07T19:42:37.2138338Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:37.2274769Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:37.2285804Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:37.2287304Z ##[endgroup] 2025-05-07T19:42:38.3689230Z Runner Type: linux.24xlarge 2025-05-07T19:42:38.3689698Z Instance Type: c5.24xlarge 2025-05-07T19:42:38.3689994Z AMI Name: unknown 2025-05-07T19:42:38.3721797Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:43.4489780Z ##[group]Checking docker version 2025-05-07T19:42:43.4503626Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:43.4716411Z '1.44' 2025-05-07T19:42:43.4741513Z Docker daemon API version: '1.44' 2025-05-07T19:42:43.4742154Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:43.4953887Z '1.44' 2025-05-07T19:42:43.4968329Z Docker client API version: '1.44' 2025-05-07T19:42:43.4973919Z ##[endgroup] 2025-05-07T19:42:43.4976493Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:43.4980852Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=3417bb" 2025-05-07T19:42:43.5149242Z ##[command]/usr/bin/docker network prune --force --filter "label=3417bb" 2025-05-07T19:42:43.5302694Z ##[endgroup] 2025-05-07T19:42:43.5303086Z ##[group]Create local container network 2025-05-07T19:42:43.5314138Z ##[command]/usr/bin/docker network create --label 3417bb github_network_0023449067dc45b89da1976825adb551 2025-05-07T19:42:43.8621098Z cf88abac00020fb013119cdd7e0ad3906cf0193ff06ac59bae82e8b797fcba62 2025-05-07T19:42:43.8641418Z ##[endgroup] 2025-05-07T19:42:43.8673450Z ##[group]Starting job container 2025-05-07T19:42:43.8696137Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:43.9750498Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:43.9872509Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:43.9873152Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:43.9896802Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:43.9987436Z ##[command]/usr/bin/docker create --name 1f736eeadaf44e318f4e00a477aace86_amazonlinux2023_ac52de --label 3417bb --workdir /__w/FBGEMM/FBGEMM --network github_network_0023449067dc45b89da1976825adb551 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:44.0435710Z 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b 2025-05-07T19:42:44.0459576Z ##[command]/usr/bin/docker start 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b 2025-05-07T19:42:44.5918497Z 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b 2025-05-07T19:42:44.5940656Z ##[command]/usr/bin/docker ps --all --filter id=3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:44.6086991Z 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b Up Less than a second 2025-05-07T19:42:44.6106090Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b 2025-05-07T19:42:44.6246831Z HOME=/github/home 2025-05-07T19:42:44.6247291Z GITHUB_ACTIONS=true 2025-05-07T19:42:44.6247786Z CI=true 2025-05-07T19:42:44.6248260Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:44.6270782Z ##[endgroup] 2025-05-07T19:42:44.6280532Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:44.6282552Z ##[endgroup] 2025-05-07T19:42:44.6364446Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:44.6365309Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:44.6366140Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:44.6366549Z env: 2025-05-07T19:42:44.6366837Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:44.6367260Z BUILD_ENV: build_binary 2025-05-07T19:42:44.6367598Z BUILD_TARGET: default 2025-05-07T19:42:44.6367881Z BUILD_VARIANT: cuda 2025-05-07T19:42:44.6368269Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:44.6368560Z ##[endgroup] 2025-05-07T19:42:45.5326351Z Amazon Linux 2023 repository 63 MB/s | 37 MB 00:00 2025-05-07T19:42:52.1110011Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:42:52.6660704Z Dependencies resolved. 2025-05-07T19:42:52.6835395Z Nothing to do. 2025-05-07T19:42:52.9297853Z Complete! 2025-05-07T19:42:52.9298818Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:42:52.9925002Z Dependencies resolved. 2025-05-07T19:42:53.0152721Z ======================================================================================== 2025-05-07T19:42:53.0154534Z Package Arch Version Repository Size 2025-05-07T19:42:53.0156055Z ======================================================================================== 2025-05-07T19:42:53.0157825Z Installing: 2025-05-07T19:42:53.0159154Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:53.0160301Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:53.0160935Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:53.0161825Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:53.0162365Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:53.0162923Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:53.0163444Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:53.0164002Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.0164490Z Installing dependencies: 2025-05-07T19:42:53.0164934Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:53.0165539Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:53.0166148Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.0166956Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:53.0167771Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:53.0168337Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:53.0168967Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:53.0169485Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:53.0170022Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:53.0170621Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:53.0171198Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:53.0171749Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:53.0172455Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:53.0173025Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:53.0173550Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:53.0174275Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:53.0174861Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:53.0175502Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:53.0176147Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.0176756Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:53.0177314Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:53.0177955Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:53.0178484Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:53.0179031Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:53.0293403Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:53.0293944Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:53.0294470Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:53.0295183Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:53.0295717Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:53.0296328Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.0296937Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:53.0297501Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:53.0298044Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:53.0298681Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:53.0299279Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:53.0299848Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:53.0300429Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.0301028Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:53.0301826Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:53.0302363Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.0302895Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.0303464Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.0304009Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:53.0304591Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:53.0305185Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:53.0305756Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.0306445Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:53.0307119Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:53.0307704Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:53.0308296Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:53.0308907Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:53.0309507Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.0310049Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:53.0310590Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:53.0311129Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:53.0311990Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.0312582Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:53.0313139Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:53.0313706Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:53.0314297Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:53.0314887Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:53.0315491Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:53.0316062Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.0316674Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:53.0317289Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:53.0317875Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:53.0318433Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:53.0318980Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.0319577Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:53.0320164Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:53.0320752Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.0321370Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:53.0321996Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:53.0322708Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:53.0323264Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:53.0323815Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:53.0324444Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:53.0324967Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:53.0325483Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:53.0325980Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.0326474Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:53.0326995Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:53.0327520Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:53.0328028Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:53.0328557Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:53.0329097Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:53.0329631Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:53.0330142Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:53.0330649Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:53.0331141Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:53.0331653Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:53.0332154Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:53.0332648Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:53.0333065Z Installing weak dependencies: 2025-05-07T19:42:53.0333479Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:53.0334053Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.0334597Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:53.0335151Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:53.0335681Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:53.0336200Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:53.0336538Z 2025-05-07T19:42:53.0336628Z Transaction Summary 2025-05-07T19:42:53.0336892Z ======================================================================================== 2025-05-07T19:42:53.0337213Z Install 107 Packages 2025-05-07T19:42:53.0337350Z 2025-05-07T19:42:53.0337518Z Total download size: 38 M 2025-05-07T19:42:53.0337761Z Installed size: 151 M 2025-05-07T19:42:53.0338009Z Downloading Packages: 2025-05-07T19:42:53.3136783Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 4.2 MB/s | 82 kB 00:00 2025-05-07T19:42:53.3241593Z (2/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 26 MB/s | 786 kB 00:00 2025-05-07T19:42:53.3335539Z (3/107): elfutils-debuginfod-client-0.188-3.amz 2.1 MB/s | 41 kB 00:00 2025-05-07T19:42:53.3427858Z (4/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 50 MB/s | 539 kB 00:00 2025-05-07T19:42:53.3652097Z (5/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 74 MB/s | 5.3 MB 00:00 2025-05-07T19:42:53.3661372Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 1.7 MB/s | 54 kB 00:00 2025-05-07T19:42:53.3944324Z (7/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 93 MB/s | 4.7 MB 00:00 2025-05-07T19:42:53.4053561Z (8/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 33 MB/s | 1.1 MB 00:00 2025-05-07T19:42:53.4195670Z (9/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 59 MB/s | 2.8 MB 00:00 2025-05-07T19:42:53.4258108Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 39 MB/s | 1.0 MB 00:00 2025-05-07T19:42:53.4277486Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 7.6 MB/s | 160 kB 00:00 2025-05-07T19:42:53.4449730Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 67 MB/s | 1.6 MB 00:00 2025-05-07T19:42:53.4464100Z (13/107): jansson-2.14-0.amzn2023.x86_64.rpm 2.9 MB/s | 46 kB 00:00 2025-05-07T19:42:53.4475273Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 3.6 MB/s | 62 kB 00:00 2025-05-07T19:42:53.4562873Z (15/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 7.2 MB/s | 57 kB 00:00 2025-05-07T19:42:53.4602527Z (16/107): less-608-2.amzn2023.0.2.x86_64.rpm 14 MB/s | 168 kB 00:00 2025-05-07T19:42:53.4645132Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 46 MB/s | 756 kB 00:00 2025-05-07T19:42:53.4667392Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 3.1 MB/s | 28 kB 00:00 2025-05-07T19:42:53.4718179Z (19/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 22 MB/s | 153 kB 00:00 2025-05-07T19:42:53.4746286Z (20/107): libedit-3.1-38.20210714cvs.amzn2023.0 11 MB/s | 108 kB 00:00 2025-05-07T19:42:53.4764249Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 9.7 MB/s | 95 kB 00:00 2025-05-07T19:42:53.4786082Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 5.3 MB/s | 31 kB 00:00 2025-05-07T19:42:53.4850111Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 13 MB/s | 106 kB 00:00 2025-05-07T19:42:53.4874850Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 11 MB/s | 121 kB 00:00 2025-05-07T19:42:53.4891435Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 2.6 MB/s | 26 kB 00:00 2025-05-07T19:42:53.4978077Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 57 MB/s | 706 kB 00:00 2025-05-07T19:42:53.4998372Z (27/107): nano-default-editor-8.3-1.amzn2023.no 983 kB/s | 10 kB 00:00 2025-05-07T19:42:53.5034653Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 28 MB/s | 394 kB 00:00 2025-05-07T19:42:53.5145678Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 36 MB/s | 573 kB 00:00 2025-05-07T19:42:53.5187023Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 18 MB/s | 256 kB 00:00 2025-05-07T19:42:53.5225959Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 25 MB/s | 454 kB 00:00 2025-05-07T19:42:53.5290757Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 52 MB/s | 708 kB 00:00 2025-05-07T19:42:53.5336150Z (33/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 8.8 MB/s | 93 kB 00:00 2025-05-07T19:42:53.5381455Z (34/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 35 MB/s | 542 kB 00:00 2025-05-07T19:42:53.5400778Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 3.7 MB/s | 41 kB 00:00 2025-05-07T19:42:53.5420978Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 2.8 MB/s | 22 kB 00:00 2025-05-07T19:42:53.5449539Z (37/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 28 MB/s | 179 kB 00:00 2025-05-07T19:42:53.5465443Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 4.9 MB/s | 29 kB 00:00 2025-05-07T19:42:53.5477472Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 4.1 MB/s | 22 kB 00:00 2025-05-07T19:42:53.5500321Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 11 MB/s | 55 kB 00:00 2025-05-07T19:42:53.5518899Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 4.8 MB/s | 26 kB 00:00 2025-05-07T19:42:53.5542043Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 6.4 MB/s | 36 kB 00:00 2025-05-07T19:42:53.5556551Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 5.0 MB/s | 26 kB 00:00 2025-05-07T19:42:53.5681712Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 107 MB/s | 1.7 MB 00:00 2025-05-07T19:42:53.5698123Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 999 kB/s | 15 kB 00:00 2025-05-07T19:42:53.5717791Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.7 MB/s | 41 kB 00:00 2025-05-07T19:42:53.5742942Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 5.7 MB/s | 31 kB 00:00 2025-05-07T19:42:53.5770300Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 4.9 MB/s | 21 kB 00:00 2025-05-07T19:42:53.5788063Z (49/107): perl-File-Basename-2.85-477.amzn2023. 2.9 MB/s | 18 kB 00:00 2025-05-07T19:42:53.5801945Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 4.2 MB/s | 26 kB 00:00 2025-05-07T19:42:53.5823066Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 7.3 MB/s | 36 kB 00:00 2025-05-07T19:42:53.5846158Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 11 MB/s | 60 kB 00:00 2025-05-07T19:42:53.5878056Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 2.5 MB/s | 17 kB 00:00 2025-05-07T19:42:53.5895480Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.3 MB/s | 16 kB 00:00 2025-05-07T19:42:53.5918000Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 8.8 MB/s | 60 kB 00:00 2025-05-07T19:42:53.5941221Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.9 MB/s | 16 kB 00:00 2025-05-07T19:42:53.5956918Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 7.5 MB/s | 42 kB 00:00 2025-05-07T19:42:53.5992929Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 8.3 MB/s | 56 kB 00:00 2025-05-07T19:42:53.6016241Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 12 MB/s | 87 kB 00:00 2025-05-07T19:42:53.6025562Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 6.4 MB/s | 42 kB 00:00 2025-05-07T19:42:53.6069458Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 31 MB/s | 218 kB 00:00 2025-05-07T19:42:53.6089254Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 4.3 MB/s | 23 kB 00:00 2025-05-07T19:42:53.6110973Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.2 MB/s | 31 kB 00:00 2025-05-07T19:42:53.6129385Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.3 MB/s | 13 kB 00:00 2025-05-07T19:42:53.6150524Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 4.1 MB/s | 23 kB 00:00 2025-05-07T19:42:53.6194119Z (66/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 48 MB/s | 392 kB 00:00 2025-05-07T19:42:53.6220710Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 11 MB/s | 97 kB 00:00 2025-05-07T19:42:53.6234816Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 11 MB/s | 85 kB 00:00 2025-05-07T19:42:53.6255096Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 3.8 MB/s | 20 kB 00:00 2025-05-07T19:42:53.6291509Z (70/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 16 MB/s | 84 kB 00:00 2025-05-07T19:42:53.6323545Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 26 MB/s | 215 kB 00:00 2025-05-07T19:42:53.6339729Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 4.9 MB/s | 41 kB 00:00 2025-05-07T19:42:53.6363219Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 11 MB/s | 71 kB 00:00 2025-05-07T19:42:53.6394320Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 2.6 MB/s | 12 kB 00:00 2025-05-07T19:42:53.6420097Z (75/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 7.6 MB/s | 55 kB 00:00 2025-05-07T19:42:53.6441305Z (76/107): perl-Storable-3.21-458.amzn2023.0.2.x 13 MB/s | 96 kB 00:00 2025-05-07T19:42:53.6464883Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.3 MB/s | 15 kB 00:00 2025-05-07T19:42:53.6481082Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 8.5 MB/s | 48 kB 00:00 2025-05-07T19:42:53.6498269Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 4.1 MB/s | 22 kB 00:00 2025-05-07T19:42:53.6532113Z (80/107): perl-Text-ParseWords-3.30-458.amzn202 3.6 MB/s | 17 kB 00:00 2025-05-07T19:42:53.6548074Z (81/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 4.5 MB/s | 36 kB 00:00 2025-05-07T19:42:53.6562607Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.6 MB/s | 22 kB 00:00 2025-05-07T19:42:53.6584600Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 6.9 MB/s | 34 kB 00:00 2025-05-07T19:42:53.6611843Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 19 MB/s | 108 kB 00:00 2025-05-07T19:42:53.6631474Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.7 MB/s | 17 kB 00:00 2025-05-07T19:42:53.6649740Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 3.7 MB/s | 23 kB 00:00 2025-05-07T19:42:53.6671004Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.6 MB/s | 14 kB 00:00 2025-05-07T19:42:53.6692368Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 13 MB/s | 71 kB 00:00 2025-05-07T19:42:53.6714541Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 2.5 MB/s | 15 kB 00:00 2025-05-07T19:42:53.6750224Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 16 MB/s | 126 kB 00:00 2025-05-07T19:42:53.6883980Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 107 MB/s | 2.0 MB 00:00 2025-05-07T19:42:53.6899967Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.5 MB/s | 29 kB 00:00 2025-05-07T19:42:53.6915548Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.8 MB/s | 46 kB 00:00 2025-05-07T19:42:53.6944636Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.4 MB/s | 13 kB 00:00 2025-05-07T19:42:53.6978740Z (95/107): perl-podlators-4.14-458.amzn2023.0.2. 20 MB/s | 112 kB 00:00 2025-05-07T19:42:53.7006288Z (96/107): perl-subs-1.03-477.amzn2023.0.6.noarc 2.0 MB/s | 12 kB 00:00 2025-05-07T19:42:53.7023459Z (97/107): perl-parent-0.238-458.amzn2023.0.2.no 1.4 MB/s | 14 kB 00:00 2025-05-07T19:42:53.7037136Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.3 MB/s | 13 kB 00:00 2025-05-07T19:42:53.7129306Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 93 MB/s | 1.1 MB 00:00 2025-05-07T19:42:53.7217055Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 67 MB/s | 1.3 MB 00:00 2025-05-07T19:42:53.7234604Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 3.0 MB/s | 56 kB 00:00 2025-05-07T19:42:53.7284776Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 44 MB/s | 613 kB 00:00 2025-05-07T19:42:53.7371681Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 57 MB/s | 879 kB 00:00 2025-05-07T19:42:53.7496067Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 87 MB/s | 2.2 MB 00:00 2025-05-07T19:42:53.7551010Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 19 MB/s | 432 kB 00:00 2025-05-07T19:42:53.7595709Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 35 MB/s | 779 kB 00:00 2025-05-07T19:42:53.7613965Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 4.3 MB/s | 42 kB 00:00 2025-05-07T19:42:53.7630770Z -------------------------------------------------------------------------------- 2025-05-07T19:42:53.7632471Z Total 51 MB/s | 38 MB 00:00 2025-05-07T19:42:54.8072422Z Running transaction check 2025-05-07T19:42:54.8526763Z Transaction check succeeded. 2025-05-07T19:42:54.8527161Z Running transaction test 2025-05-07T19:42:55.2186461Z Transaction test succeeded. 2025-05-07T19:42:55.2187285Z Running transaction 2025-05-07T19:42:56.1337208Z Preparing : 1/1 2025-05-07T19:42:56.1508464Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:56.1765146Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:56.1995434Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:56.2071963Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:56.2141261Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:56.2244730Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:56.2545901Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:56.2627805Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:56.2693823Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:56.3216506Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:56.3309800Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:56.3769502Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:56.3837966Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:56.3915311Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:56.3988744Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:56.4050815Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:56.4201037Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:56.4270378Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:56.4338855Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:56.4422356Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:56.4490416Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:56.4546760Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:56.4986384Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:56.5074462Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:56.5231157Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:56.5676260Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:56.5873129Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:56.6699051Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:56.6700038Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:56.6700618Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:56.6700889Z 2025-05-07T19:42:56.6913563Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:56.7249582Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:56.7442970Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:56.7516000Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:56.8635805Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:57.0138613Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:57.0275399Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:57.0685760Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.0772662Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.0856855Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.0932959Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:57.1018670Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:57.1078692Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:57.1131363Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:57.1186223Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:57.1274465Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:57.1359413Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:57.1465666Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:57.1684733Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:57.1777574Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:57.1826614Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:57.1872897Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:57.1934398Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:57.1990107Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:57.2050428Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:57.2143131Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:57.2214314Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:57.2262771Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:57.2321171Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:57.2382371Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:57.2441869Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:57.2490112Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:57.2546114Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:57.2618718Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:57.2675504Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:57.2788893Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:57.2874224Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:57.2935893Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:57.2990225Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:57.3038225Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:57.3119952Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:57.3220775Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:57.3294621Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:57.3352989Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:57.3410153Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:57.3480599Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:57.3545005Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:57.3598286Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:57.3672586Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:57.3719881Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:57.3774308Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:57.3834427Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:57.3912632Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:57.3993977Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:57.4059247Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:57.4125136Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:57.4180762Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:57.4231049Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:57.4293513Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:57.4343784Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:57.4395332Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:57.4448742Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:57.4505140Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:57.4586201Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:57.5121330Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:57.6092770Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:57.6224363Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:57.6309831Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:57.6384016Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:57.6453489Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:57.6522343Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:57.6577369Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:57.6642652Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:57.6724234Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:57.6928966Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:57.7060752Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:57.7148637Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:57.7552639Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:57.8790431Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:57.8883544Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:57.8994239Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:57.9306591Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:57.9403135Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:57.9652249Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:57.9867392Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:57.9953969Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:58.0068888Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:58.7742004Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:58.7744025Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:58.7746126Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:58.7747798Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:58.7749662Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:58.7750351Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:58.7750977Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:58.7751708Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:58.7752409Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:58.7753356Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:58.7753964Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:58.7754641Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:58.7755212Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:58.7755853Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:58.7756512Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:58.7757116Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:58.7757729Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:58.7758351Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:58.7758991Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:58.7759722Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:58.7760366Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:58.7761042Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:58.7761673Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:58.7762334Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:58.7763003Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:58.7763618Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:58.7764289Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:58.7764883Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:58.7765553Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:58.7766214Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:58.7766787Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:58.7767520Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:58.7768148Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:58.7768818Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:58.7769448Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:58.7770047Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:58.7770747Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:58.7771498Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:58.7772174Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:58.7772846Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:58.7773525Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:58.7774215Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:58.7774817Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:58.7775558Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:58.7776234Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:58.7776825Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:58.7777554Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:58.7778097Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:58.7778619Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:58.7779172Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:58.7779736Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:58.7780274Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:58.7780823Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:58.7781346Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:58.7781917Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:58.7782456Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:58.7783021Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:58.7783569Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:58.7784214Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:58.7784743Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:58.7785248Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:58.7785796Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:58.7786344Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:58.7786888Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:58.7787428Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:58.7787940Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:58.7788467Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:58.7788995Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:58.7789492Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:58.7790017Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:58.7790973Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:58.7791646Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:58.7792161Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:58.7792701Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:58.7793444Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:58.7793968Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:58.7794493Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:58.7795008Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:58.7795563Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:58.7796119Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:58.7796647Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:58.7797214Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:58.7797755Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:58.7798391Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:58.7798988Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:58.7799485Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:58.7799995Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:58.7800477Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:58.7800975Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:58.7801448Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:58.7801946Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:58.7802434Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:58.7802894Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:58.7803395Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:58.7803899Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:58.7804405Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:58.7804885Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:58.7805386Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:58.7805883Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:58.7806356Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:58.7806837Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:58.7807325Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:58.7807846Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:58.7808308Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:58.7808783Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:58.7809282Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:58.7809749Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:58.8862353Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:58.8863416Z 2025-05-07T19:42:58.8863674Z Installed: 2025-05-07T19:42:58.8864600Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:58.8866126Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8867648Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:58.8868554Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8869131Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8869606Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8870100Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8870607Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:58.8871129Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:58.8871968Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8872468Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:58.8872985Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:58.8873628Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:58.8874150Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:58.8874639Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8875144Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8875657Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8876153Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:58.8876703Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8877233Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:58.8877915Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8878414Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8878936Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8879437Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8879948Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8880414Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:58.8880916Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:58.8881446Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8881920Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:58.8882425Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:58.8882917Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:58.8883466Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:58.8883966Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:58.8884468Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8884997Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8885531Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8886071Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8886569Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:58.8887126Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8887669Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8888327Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:58.8888876Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8889409Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8889972Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8890704Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8891497Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:58.8892057Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:58.8892617Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8893228Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8893927Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8894534Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:58.8895096Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:58.8895692Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8896300Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8896878Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:58.8897596Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8898123Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:58.8898678Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:58.8899194Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8899753Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:58.8900319Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:58.8900847Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8901416Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8902132Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:58.8902713Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8903467Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:58.8904030Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8904621Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8905190Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:58.8905782Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:58.8906396Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:58.8906979Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:58.8907543Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8908259Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8908858Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8909410Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8909949Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8911851Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:58.8912435Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:58.8912991Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8913583Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:58.8914169Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:58.8914742Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:58.8915267Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:58.8915809Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8916366Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:58.8916901Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8917584Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8918226Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8918765Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:58.8919274Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8919794Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:58.8920343Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8920891Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8921446Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:58.8921970Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:58.8922682Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8923194Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:58.8923682Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:58.8924259Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:58.8924949Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:58.8925662Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:58.8926147Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:58.8926645Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:58.8927182Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:58.8927674Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:58.8928161Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:58.8928459Z 2025-05-07T19:42:58.8928548Z Complete! 2025-05-07T19:42:58.9626124Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:58.9626469Z with: 2025-05-07T19:42:58.9626753Z submodules: true 2025-05-07T19:42:58.9626993Z repository: pytorch/FBGEMM 2025-05-07T19:42:58.9627516Z token: *** 2025-05-07T19:42:58.9627731Z ssh-strict: true 2025-05-07T19:42:58.9627989Z ssh-user: git 2025-05-07T19:42:58.9628225Z persist-credentials: true 2025-05-07T19:42:58.9628515Z clean: true 2025-05-07T19:42:58.9628771Z sparse-checkout-cone-mode: true 2025-05-07T19:42:58.9629048Z fetch-depth: 1 2025-05-07T19:42:58.9629294Z fetch-tags: false 2025-05-07T19:42:58.9629525Z show-progress: true 2025-05-07T19:42:58.9629782Z lfs: false 2025-05-07T19:42:58.9630003Z set-safe-directory: true 2025-05-07T19:42:58.9630491Z env: 2025-05-07T19:42:58.9630713Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:58.9631052Z BUILD_ENV: build_binary 2025-05-07T19:42:58.9631415Z BUILD_TARGET: default 2025-05-07T19:42:58.9631862Z BUILD_VARIANT: cuda 2025-05-07T19:42:58.9632220Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:58.9632553Z ##[endgroup] 2025-05-07T19:42:58.9678352Z ##[command]/usr/bin/docker exec 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:42:59.2475078Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:42:59.2476608Z ##[group]Getting Git version info 2025-05-07T19:42:59.2476993Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:42:59.2477565Z [command]/usr/bin/git version 2025-05-07T19:42:59.2477849Z git version 2.47.1 2025-05-07T19:42:59.2478860Z ##[endgroup] 2025-05-07T19:42:59.2482810Z Temporarily overriding HOME='/__w/_temp/03032cd9-f59b-4932-bca7-8709b3accb14' before making global git config changes 2025-05-07T19:42:59.2483672Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:42:59.2484436Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:42:59.2506245Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:42:59.2522689Z https://github.com/pytorch/FBGEMM 2025-05-07T19:42:59.2538092Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:42:59.2541228Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:42:59.2557267Z HEAD 2025-05-07T19:42:59.2591644Z ##[endgroup] 2025-05-07T19:42:59.2592397Z [command]/usr/bin/git submodule status 2025-05-07T19:42:59.2972301Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:42:59.3035138Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (4a61bdd) 2025-05-07T19:42:59.3098219Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:42:59.3157236Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (3ed8d2e) 2025-05-07T19:42:59.3220892Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (f8d7d77) 2025-05-07T19:42:59.3278421Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (4200844) 2025-05-07T19:42:59.3337007Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (9cca280) 2025-05-07T19:42:59.3346075Z ##[group]Cleaning the repository 2025-05-07T19:42:59.3348949Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:42:59.3396254Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:42:59.4481580Z HEAD is now at a5ab0b0 Merge 3e0eb9844c62b4a9cef00aa8fd072a26f76b40ac into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:59.4485004Z ##[endgroup] 2025-05-07T19:42:59.4486559Z ##[group]Disabling automatic garbage collection 2025-05-07T19:42:59.4491906Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:42:59.4518333Z ##[endgroup] 2025-05-07T19:42:59.4519554Z ##[group]Setting up auth 2025-05-07T19:42:59.4526025Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:42:59.4549581Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:42:59.4829641Z Entering 'external/asmjit' 2025-05-07T19:42:59.4873981Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.4935356Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.4989110Z Entering 'external/cutlass' 2025-05-07T19:42:59.5051718Z Entering 'external/googletest' 2025-05-07T19:42:59.5095468Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.5144795Z Entering 'external/json' 2025-05-07T19:42:59.5202771Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:42:59.5239563Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:42:59.5506240Z Entering 'external/asmjit' 2025-05-07T19:42:59.5553385Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.5612668Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.5659792Z Entering 'external/cutlass' 2025-05-07T19:42:59.5714057Z Entering 'external/googletest' 2025-05-07T19:42:59.5771714Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.5819858Z Entering 'external/json' 2025-05-07T19:42:59.5878604Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:59.5918832Z ##[endgroup] 2025-05-07T19:42:59.5926008Z ##[group]Fetching the repository 2025-05-07T19:42:59.5926845Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:42:59.7661005Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:42:59.7662048Z + a5ab0b0...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:42:59.7677397Z ##[endgroup] 2025-05-07T19:42:59.7678485Z ##[group]Determining the checkout info 2025-05-07T19:42:59.7679709Z ##[endgroup] 2025-05-07T19:42:59.7682130Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:42:59.8183392Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:42:59.8209712Z ##[group]Checking out the ref 2025-05-07T19:42:59.8210189Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:42:59.9208123Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:42:59.9208533Z any of your branches: 2025-05-07T19:42:59.9208711Z 2025-05-07T19:42:59.9209074Z a5ab0b0 Merge 3e0eb9844c62b4a9cef00aa8fd072a26f76b40ac into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:59.9209575Z 2025-05-07T19:42:59.9209802Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:42:59.9210212Z to do so with: 2025-05-07T19:42:59.9210342Z 2025-05-07T19:42:59.9210497Z git branch a5ab0b0 2025-05-07T19:42:59.9210701Z 2025-05-07T19:42:59.9211104Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:59.9212358Z ##[endgroup] 2025-05-07T19:42:59.9212802Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:42:59.9213385Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:59.9234477Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:42:59.9257212Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:42:59.9284431Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:42:59.9309894Z ##[endgroup] 2025-05-07T19:42:59.9311020Z ##[group]Fetching submodules 2025-05-07T19:42:59.9312113Z [command]/usr/bin/git submodule sync 2025-05-07T19:42:59.9633831Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:42:59.9634336Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:42:59.9635136Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:42:59.9635570Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:42:59.9635980Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:42:59.9636425Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:42:59.9636823Z Synchronizing submodule url for 'external/json' 2025-05-07T19:42:59.9648851Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:00.0441888Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:00.3070771Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:00.3994380Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:01.0835176Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:01.1208694Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:01.1296339Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:01.2343544Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:01.2351533Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:01.2630753Z Entering 'external/asmjit' 2025-05-07T19:43:01.2655697Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.2691052Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.2723235Z Entering 'external/cutlass' 2025-05-07T19:43:01.2753504Z Entering 'external/googletest' 2025-05-07T19:43:01.2788375Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.2818901Z Entering 'external/json' 2025-05-07T19:43:01.2855959Z ##[endgroup] 2025-05-07T19:43:01.2856468Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:01.2858451Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:01.3129823Z Entering 'external/asmjit' 2025-05-07T19:43:01.3158431Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3158840Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3189403Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.3222836Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3223277Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3257925Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.3304895Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3305888Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3338401Z Entering 'external/cutlass' 2025-05-07T19:43:01.3373661Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3374107Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3412761Z Entering 'external/googletest' 2025-05-07T19:43:01.3458140Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3459095Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3491671Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.3525932Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3526658Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3563512Z Entering 'external/json' 2025-05-07T19:43:01.3601264Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3602244Z url.https://github.com/.insteadof 2025-05-07T19:43:01.3649192Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:01.3952128Z Entering 'external/asmjit' 2025-05-07T19:43:01.3997057Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:01.3998470Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.4048560Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:01.4049295Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.4095459Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:01.4095936Z Entering 'external/cutlass' 2025-05-07T19:43:01.4148033Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:01.4149521Z Entering 'external/googletest' 2025-05-07T19:43:01.4195997Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:01.4203973Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.4249805Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:01.4252869Z Entering 'external/json' 2025-05-07T19:43:01.4296656Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:01.4370965Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:01.4643152Z Entering 'external/asmjit' 2025-05-07T19:43:01.4675655Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.4704934Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.4737668Z Entering 'external/cutlass' 2025-05-07T19:43:01.4768205Z Entering 'external/googletest' 2025-05-07T19:43:01.4793391Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.4824680Z Entering 'external/json' 2025-05-07T19:43:01.4860979Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:01.5125238Z Entering 'external/asmjit' 2025-05-07T19:43:01.5160770Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.5187560Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.5221670Z Entering 'external/cutlass' 2025-05-07T19:43:01.5251996Z Entering 'external/googletest' 2025-05-07T19:43:01.5281941Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.5311624Z Entering 'external/json' 2025-05-07T19:43:01.5356226Z ##[endgroup] 2025-05-07T19:43:01.5383408Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:01.5399735Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:01.5550831Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:01.5551408Z . $PRELUDE; print_system_info 2025-05-07T19:43:01.5552145Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:01.5552607Z env: 2025-05-07T19:43:01.5552853Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.5553202Z BUILD_ENV: build_binary 2025-05-07T19:43:01.5553470Z BUILD_TARGET: default 2025-05-07T19:43:01.5553746Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.5554005Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:01.5554296Z ##[endgroup] 2025-05-07T19:43:02.0126107Z ################################################################################ 2025-05-07T19:43:02.0126485Z # Print System Info 2025-05-07T19:43:02.0126718Z # 2025-05-07T19:43:02.0142581Z # [2025-05-07T19:43:02.013Z] + print_system_info 2025-05-07T19:43:02.0142986Z ################################################################################ 2025-05-07T19:43:02.0144328Z 2025-05-07T19:43:02.0144532Z ################################################################################ 2025-05-07T19:43:02.0144880Z [INFO] Printing environment variables ... 2025-05-07T19:43:02.0145379Z + printenv 2025-05-07T19:43:02.0145552Z 2025-05-07T19:43:02.0157394Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:02.0157766Z BUILD_VARIANT=cuda 2025-05-07T19:43:02.0158009Z HOSTNAME=3a46c8861204 2025-05-07T19:43:02.0158425Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_155f08f4-cb5f-4a32-b375-f939d8fe3a4f 2025-05-07T19:43:02.0158914Z GITHUB_ACTION=__run_2 2025-05-07T19:43:02.0159145Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:02.0159423Z RUNNER_NAME=i-066c3e49b24aebf4f 2025-05-07T19:43:02.0159721Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:02.0160021Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:02.0160307Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:02.0160556Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:02.0160858Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:02.0161159Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:02.0161740Z *** 2025-05-07T19:43:02.0161941Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:02.0162216Z GITHUB_ACTIONS=true 2025-05-07T19:43:02.0162497Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:02.0163061Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:02.0163607Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:02.0163882Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:02.0164157Z RUNNER_OS=Linux 2025-05-07T19:43:02.0164380Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:02.0164650Z HOME=/github/home 2025-05-07T19:43:02.0165148Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:02.0165466Z RUNNER_ARCH=X64 2025-05-07T19:43:02.0165686Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:02.0165945Z BUILD_TARGET=default 2025-05-07T19:43:02.0166387Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_155f08f4-cb5f-4a32-b375-f939d8fe3a4f 2025-05-07T19:43:02.0167035Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_155f08f4-cb5f-4a32-b375-f939d8fe3a4f 2025-05-07T19:43:02.0167540Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:02.0167865Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:02.0168151Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:02.0168611Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_155f08f4-cb5f-4a32-b375-f939d8fe3a4f 2025-05-07T19:43:02.0169139Z BUILD_ENV=build_binary 2025-05-07T19:43:02.0169372Z GITHUB_ACTOR=q10 2025-05-07T19:43:02.0169606Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:02.0169849Z KERN_NAME_LC=linux 2025-05-07T19:43:02.0170076Z BUILD_CUDA_VERSION=11.8.0 2025-05-07T19:43:02.0170398Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:02.0170744Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:02.0171148Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:02.0171423Z SHLVL=1 2025-05-07T19:43:02.0171750Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:02.0171980Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:02.0172481Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:02.0172835Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:02.0173078Z KERN_NAME=Linux 2025-05-07T19:43:02.0173299Z GITHUB_JOB=build_artifact 2025-05-07T19:43:02.0173540Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:02.0173815Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:02.0174046Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:02.0174308Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:02.0174628Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:02.0174992Z GITHUB_BASE_REF=main 2025-05-07T19:43:02.0175199Z CI=true 2025-05-07T19:43:02.0175410Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:02.0175854Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:02.0176141Z GITHUB_ACTION_REF= 2025-05-07T19:43:02.0176394Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:02.0176869Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_155f08f4-cb5f-4a32-b375-f939d8fe3a4f 2025-05-07T19:43:02.0177386Z MACHINE_NAME=x86_64 2025-05-07T19:43:02.0177607Z _=/usr/bin/printenv 2025-05-07T19:43:02.0177807Z 2025-05-07T19:43:02.0177925Z ################################################################################ 2025-05-07T19:43:02.0178428Z [INFO] Print ldd version ... 2025-05-07T19:43:02.0178702Z + ldd --version 2025-05-07T19:43:02.0178832Z 2025-05-07T19:43:02.0178938Z ldd (GNU libc) 2.34 2025-05-07T19:43:02.0179210Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:02.0179678Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:02.0180237Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:02.0180720Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:02.0180947Z 2025-05-07T19:43:02.0181062Z ################################################################################ 2025-05-07T19:43:02.0181392Z [INFO] Print CPU info ... 2025-05-07T19:43:02.0181648Z + nproc 2025-05-07T19:43:02.0181758Z 2025-05-07T19:43:02.0190255Z 96 2025-05-07T19:43:02.0190932Z 2025-05-07T19:43:02.0191291Z + lscpu 2025-05-07T19:43:02.0191692Z 2025-05-07T19:43:02.0456374Z Architecture: x86_64 2025-05-07T19:43:02.0457532Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:02.0458728Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0459919Z Byte Order: Little Endian 2025-05-07T19:43:02.0460265Z CPU(s): 96 2025-05-07T19:43:02.0460567Z On-line CPU(s) list: 0-95 2025-05-07T19:43:02.0460912Z Vendor ID: GenuineIntel 2025-05-07T19:43:02.0461574Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0461987Z CPU family: 6 2025-05-07T19:43:02.0462300Z Model: 85 2025-05-07T19:43:02.0462597Z Thread(s) per core: 2 2025-05-07T19:43:02.0462919Z Core(s) per socket: 24 2025-05-07T19:43:02.0463209Z Socket(s): 2 2025-05-07T19:43:02.0463506Z Stepping: 7 2025-05-07T19:43:02.0463924Z BogoMIPS: 5999.99 2025-05-07T19:43:02.0466318Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0468579Z Hypervisor vendor: KVM 2025-05-07T19:43:02.0469052Z Virtualization type: full 2025-05-07T19:43:02.0469380Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:02.0469752Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:02.0470118Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:02.0470639Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:02.0470983Z NUMA node(s): 2 2025-05-07T19:43:02.0471420Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:02.0471954Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:02.0472420Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:02.0473095Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:02.0473593Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:02.0474230Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:02.0474832Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:02.0475442Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:02.0476071Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:02.0476455Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:02.0476851Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:02.0477230Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:02.0477816Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:02.0478756Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:02.0479415Z Vulnerability Srbds: Not affected 2025-05-07T19:43:02.0479820Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:02.0480060Z 2025-05-07T19:43:02.0480162Z + cat /proc/cpuinfo 2025-05-07T19:43:02.0480318Z 2025-05-07T19:43:02.0480590Z processor : 0 2025-05-07T19:43:02.0480842Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0481096Z cpu family : 6 2025-05-07T19:43:02.0481347Z model : 85 2025-05-07T19:43:02.0481627Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0482012Z stepping : 7 2025-05-07T19:43:02.0482238Z microcode : 0x5003901 2025-05-07T19:43:02.0482464Z cpu MHz : 1281.436 2025-05-07T19:43:02.0482701Z cache size : 36608 KB 2025-05-07T19:43:02.0482928Z physical id : 0 2025-05-07T19:43:02.0483259Z siblings : 48 2025-05-07T19:43:02.0483461Z core id : 0 2025-05-07T19:43:02.0483709Z cpu cores : 24 2025-05-07T19:43:02.0483922Z apicid : 0 2025-05-07T19:43:02.0484163Z initial apicid : 0 2025-05-07T19:43:02.0484390Z fpu : yes 2025-05-07T19:43:02.0484679Z fpu_exception : yes 2025-05-07T19:43:02.0484910Z cpuid level : 13 2025-05-07T19:43:02.0485151Z wp : yes 2025-05-07T19:43:02.0487444Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0490113Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0490942Z bogomips : 5999.99 2025-05-07T19:43:02.0491169Z clflush size : 64 2025-05-07T19:43:02.0491490Z cache_alignment : 64 2025-05-07T19:43:02.0491789Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0492252Z power management: 2025-05-07T19:43:02.0492400Z 2025-05-07T19:43:02.0492508Z processor : 1 2025-05-07T19:43:02.0492732Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0492996Z cpu family : 6 2025-05-07T19:43:02.0493203Z model : 85 2025-05-07T19:43:02.0493506Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0493866Z stepping : 7 2025-05-07T19:43:02.0494092Z microcode : 0x5003901 2025-05-07T19:43:02.0494322Z cpu MHz : 1196.822 2025-05-07T19:43:02.0494562Z cache size : 36608 KB 2025-05-07T19:43:02.0494814Z physical id : 0 2025-05-07T19:43:02.0495025Z siblings : 48 2025-05-07T19:43:02.0495255Z core id : 1 2025-05-07T19:43:02.0495464Z cpu cores : 24 2025-05-07T19:43:02.0495693Z apicid : 2 2025-05-07T19:43:02.0495897Z initial apicid : 2 2025-05-07T19:43:02.0496130Z fpu : yes 2025-05-07T19:43:02.0496331Z fpu_exception : yes 2025-05-07T19:43:02.0496568Z cpuid level : 13 2025-05-07T19:43:02.0496777Z wp : yes 2025-05-07T19:43:02.0499071Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0501720Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0502307Z bogomips : 5999.99 2025-05-07T19:43:02.0502545Z clflush size : 64 2025-05-07T19:43:02.0502783Z cache_alignment : 64 2025-05-07T19:43:02.0503054Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0503400Z power management: 2025-05-07T19:43:02.0503536Z 2025-05-07T19:43:02.0503625Z processor : 2 2025-05-07T19:43:02.0503861Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0504107Z cpu family : 6 2025-05-07T19:43:02.0504334Z model : 85 2025-05-07T19:43:02.0504608Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0504977Z stepping : 7 2025-05-07T19:43:02.0505189Z microcode : 0x5003901 2025-05-07T19:43:02.0505436Z cpu MHz : 1200.928 2025-05-07T19:43:02.0505675Z cache size : 36608 KB 2025-05-07T19:43:02.0505905Z physical id : 0 2025-05-07T19:43:02.0506138Z siblings : 48 2025-05-07T19:43:02.0506344Z core id : 2 2025-05-07T19:43:02.0506684Z cpu cores : 24 2025-05-07T19:43:02.0506895Z apicid : 4 2025-05-07T19:43:02.0507119Z initial apicid : 4 2025-05-07T19:43:02.0507331Z fpu : yes 2025-05-07T19:43:02.0507553Z fpu_exception : yes 2025-05-07T19:43:02.0507773Z cpuid level : 13 2025-05-07T19:43:02.0508001Z wp : yes 2025-05-07T19:43:02.0510281Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0512997Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0513598Z bogomips : 5999.99 2025-05-07T19:43:02.0513816Z clflush size : 64 2025-05-07T19:43:02.0514058Z cache_alignment : 64 2025-05-07T19:43:02.0514346Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0514672Z power management: 2025-05-07T19:43:02.0514806Z 2025-05-07T19:43:02.0514913Z processor : 3 2025-05-07T19:43:02.0515239Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0515497Z cpu family : 6 2025-05-07T19:43:02.0515708Z model : 85 2025-05-07T19:43:02.0516005Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0516357Z stepping : 7 2025-05-07T19:43:02.0516590Z microcode : 0x5003901 2025-05-07T19:43:02.0516818Z cpu MHz : 2999.998 2025-05-07T19:43:02.0517055Z cache size : 36608 KB 2025-05-07T19:43:02.0517301Z physical id : 0 2025-05-07T19:43:02.0517513Z siblings : 48 2025-05-07T19:43:02.0517730Z core id : 3 2025-05-07T19:43:02.0518051Z cpu cores : 24 2025-05-07T19:43:02.0518276Z apicid : 6 2025-05-07T19:43:02.0518481Z initial apicid : 6 2025-05-07T19:43:02.0518707Z fpu : yes 2025-05-07T19:43:02.0518909Z fpu_exception : yes 2025-05-07T19:43:02.0519149Z cpuid level : 13 2025-05-07T19:43:02.0519360Z wp : yes 2025-05-07T19:43:02.0521613Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0524184Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0524758Z bogomips : 5999.99 2025-05-07T19:43:02.0525009Z clflush size : 64 2025-05-07T19:43:02.0525251Z cache_alignment : 64 2025-05-07T19:43:02.0525526Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0525876Z power management: 2025-05-07T19:43:02.0526013Z 2025-05-07T19:43:02.0526103Z processor : 4 2025-05-07T19:43:02.0526348Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0526596Z cpu family : 6 2025-05-07T19:43:02.0526824Z model : 85 2025-05-07T19:43:02.0527098Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0527471Z stepping : 7 2025-05-07T19:43:02.0527681Z microcode : 0x5003901 2025-05-07T19:43:02.0527925Z cpu MHz : 1200.913 2025-05-07T19:43:02.0528161Z cache size : 36608 KB 2025-05-07T19:43:02.0528388Z physical id : 0 2025-05-07T19:43:02.0528620Z siblings : 48 2025-05-07T19:43:02.0528824Z core id : 4 2025-05-07T19:43:02.0529049Z cpu cores : 24 2025-05-07T19:43:02.0529248Z apicid : 8 2025-05-07T19:43:02.0529459Z initial apicid : 8 2025-05-07T19:43:02.0529747Z fpu : yes 2025-05-07T19:43:02.0529958Z fpu_exception : yes 2025-05-07T19:43:02.0530171Z cpuid level : 13 2025-05-07T19:43:02.0530392Z wp : yes 2025-05-07T19:43:02.0532604Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0535163Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0535739Z bogomips : 5999.99 2025-05-07T19:43:02.0535955Z clflush size : 64 2025-05-07T19:43:02.0536188Z cache_alignment : 64 2025-05-07T19:43:02.0536475Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0536794Z power management: 2025-05-07T19:43:02.0536926Z 2025-05-07T19:43:02.0537035Z processor : 5 2025-05-07T19:43:02.0537255Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0537516Z cpu family : 6 2025-05-07T19:43:02.0537798Z model : 85 2025-05-07T19:43:02.0538140Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0538481Z stepping : 7 2025-05-07T19:43:02.0538703Z microcode : 0x5003901 2025-05-07T19:43:02.0538950Z cpu MHz : 2999.998 2025-05-07T19:43:02.0539164Z cache size : 36608 KB 2025-05-07T19:43:02.0539405Z physical id : 0 2025-05-07T19:43:02.0539615Z siblings : 48 2025-05-07T19:43:02.0539838Z core id : 5 2025-05-07T19:43:02.0540038Z cpu cores : 24 2025-05-07T19:43:02.0540261Z apicid : 10 2025-05-07T19:43:02.0540462Z initial apicid : 10 2025-05-07T19:43:02.0540693Z fpu : yes 2025-05-07T19:43:02.0540896Z fpu_exception : yes 2025-05-07T19:43:02.0541129Z cpuid level : 13 2025-05-07T19:43:02.0541338Z wp : yes 2025-05-07T19:43:02.0543570Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0546137Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0546703Z bogomips : 5999.99 2025-05-07T19:43:02.0546936Z clflush size : 64 2025-05-07T19:43:02.0547171Z cache_alignment : 64 2025-05-07T19:43:02.0547439Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0547774Z power management: 2025-05-07T19:43:02.0547908Z 2025-05-07T19:43:02.0547997Z processor : 6 2025-05-07T19:43:02.0548229Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0548467Z cpu family : 6 2025-05-07T19:43:02.0548695Z model : 85 2025-05-07T19:43:02.0548969Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0549326Z stepping : 7 2025-05-07T19:43:02.0549532Z microcode : 0x5003901 2025-05-07T19:43:02.0549774Z cpu MHz : 2999.998 2025-05-07T19:43:02.0550010Z cache size : 36608 KB 2025-05-07T19:43:02.0550231Z physical id : 0 2025-05-07T19:43:02.0550452Z siblings : 48 2025-05-07T19:43:02.0550650Z core id : 6 2025-05-07T19:43:02.0550860Z cpu cores : 24 2025-05-07T19:43:02.0551061Z apicid : 12 2025-05-07T19:43:02.0551375Z initial apicid : 12 2025-05-07T19:43:02.0551767Z fpu : yes 2025-05-07T19:43:02.0551991Z fpu_exception : yes 2025-05-07T19:43:02.0552380Z cpuid level : 13 2025-05-07T19:43:02.0552609Z wp : yes 2025-05-07T19:43:02.0554876Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0557513Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0558114Z bogomips : 5999.99 2025-05-07T19:43:02.0558351Z clflush size : 64 2025-05-07T19:43:02.0558575Z cache_alignment : 64 2025-05-07T19:43:02.0558872Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0559201Z power management: 2025-05-07T19:43:02.0559334Z 2025-05-07T19:43:02.0559442Z processor : 7 2025-05-07T19:43:02.0559664Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0559919Z cpu family : 6 2025-05-07T19:43:02.0560122Z model : 85 2025-05-07T19:43:02.0560412Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0560826Z stepping : 7 2025-05-07T19:43:02.0561054Z microcode : 0x5003901 2025-05-07T19:43:02.0561296Z cpu MHz : 2999.998 2025-05-07T19:43:02.0561528Z cache size : 36608 KB 2025-05-07T19:43:02.0561779Z physical id : 0 2025-05-07T19:43:02.0561996Z siblings : 48 2025-05-07T19:43:02.0562223Z core id : 7 2025-05-07T19:43:02.0562431Z cpu cores : 24 2025-05-07T19:43:02.0562663Z apicid : 14 2025-05-07T19:43:02.0562875Z initial apicid : 14 2025-05-07T19:43:02.0563118Z fpu : yes 2025-05-07T19:43:02.0563328Z fpu_exception : yes 2025-05-07T19:43:02.0563573Z cpuid level : 13 2025-05-07T19:43:02.0563789Z wp : yes 2025-05-07T19:43:02.0566141Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0568744Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0569318Z bogomips : 5999.99 2025-05-07T19:43:02.0569560Z clflush size : 64 2025-05-07T19:43:02.0569799Z cache_alignment : 64 2025-05-07T19:43:02.0570072Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0570420Z power management: 2025-05-07T19:43:02.0570556Z 2025-05-07T19:43:02.0570643Z processor : 8 2025-05-07T19:43:02.0570882Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0571120Z cpu family : 6 2025-05-07T19:43:02.0571346Z model : 85 2025-05-07T19:43:02.0571614Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0571973Z stepping : 7 2025-05-07T19:43:02.0572182Z microcode : 0x5003901 2025-05-07T19:43:02.0572423Z cpu MHz : 1708.756 2025-05-07T19:43:02.0572654Z cache size : 36608 KB 2025-05-07T19:43:02.0572874Z physical id : 0 2025-05-07T19:43:02.0573096Z siblings : 48 2025-05-07T19:43:02.0573293Z core id : 8 2025-05-07T19:43:02.0573504Z cpu cores : 24 2025-05-07T19:43:02.0573708Z apicid : 16 2025-05-07T19:43:02.0573925Z initial apicid : 16 2025-05-07T19:43:02.0574136Z fpu : yes 2025-05-07T19:43:02.0574348Z fpu_exception : yes 2025-05-07T19:43:02.0574566Z cpuid level : 13 2025-05-07T19:43:02.0574788Z wp : yes 2025-05-07T19:43:02.0577024Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0579841Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0580425Z bogomips : 5999.99 2025-05-07T19:43:02.0580658Z clflush size : 64 2025-05-07T19:43:02.0580874Z cache_alignment : 64 2025-05-07T19:43:02.0581153Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0581467Z power management: 2025-05-07T19:43:02.0581603Z 2025-05-07T19:43:02.0581701Z processor : 9 2025-05-07T19:43:02.0581912Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0582157Z cpu family : 6 2025-05-07T19:43:02.0582359Z model : 85 2025-05-07T19:43:02.0582641Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0582982Z stepping : 7 2025-05-07T19:43:02.0583200Z microcode : 0x5003901 2025-05-07T19:43:02.0583512Z cpu MHz : 1422.307 2025-05-07T19:43:02.0583727Z cache size : 36608 KB 2025-05-07T19:43:02.0583962Z physical id : 0 2025-05-07T19:43:02.0584170Z siblings : 48 2025-05-07T19:43:02.0584383Z core id : 9 2025-05-07T19:43:02.0584586Z cpu cores : 24 2025-05-07T19:43:02.0584811Z apicid : 18 2025-05-07T19:43:02.0585017Z initial apicid : 18 2025-05-07T19:43:02.0585245Z fpu : yes 2025-05-07T19:43:02.0585441Z fpu_exception : yes 2025-05-07T19:43:02.0585671Z cpuid level : 13 2025-05-07T19:43:02.0585878Z wp : yes 2025-05-07T19:43:02.0588110Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0591058Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0591725Z bogomips : 5999.99 2025-05-07T19:43:02.0592079Z clflush size : 64 2025-05-07T19:43:02.0592316Z cache_alignment : 64 2025-05-07T19:43:02.0592598Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0592942Z power management: 2025-05-07T19:43:02.0593077Z 2025-05-07T19:43:02.0593165Z processor : 10 2025-05-07T19:43:02.0593407Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0593647Z cpu family : 6 2025-05-07T19:43:02.0593868Z model : 85 2025-05-07T19:43:02.0594138Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0594500Z stepping : 7 2025-05-07T19:43:02.0594705Z microcode : 0x5003901 2025-05-07T19:43:02.0594946Z cpu MHz : 1470.253 2025-05-07T19:43:02.0595187Z cache size : 36608 KB 2025-05-07T19:43:02.0595410Z physical id : 0 2025-05-07T19:43:02.0595636Z siblings : 48 2025-05-07T19:43:02.0595838Z core id : 10 2025-05-07T19:43:02.0596056Z cpu cores : 24 2025-05-07T19:43:02.0596258Z apicid : 20 2025-05-07T19:43:02.0596477Z initial apicid : 20 2025-05-07T19:43:02.0596692Z fpu : yes 2025-05-07T19:43:02.0596908Z fpu_exception : yes 2025-05-07T19:43:02.0597128Z cpuid level : 13 2025-05-07T19:43:02.0597354Z wp : yes 2025-05-07T19:43:02.0599646Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0602436Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0603038Z bogomips : 5999.99 2025-05-07T19:43:02.0603273Z clflush size : 64 2025-05-07T19:43:02.0603490Z cache_alignment : 64 2025-05-07T19:43:02.0603797Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0604249Z power management: 2025-05-07T19:43:02.0604392Z 2025-05-07T19:43:02.0604508Z processor : 11 2025-05-07T19:43:02.0604742Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0605024Z cpu family : 6 2025-05-07T19:43:02.0605238Z model : 85 2025-05-07T19:43:02.0605545Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0605903Z stepping : 7 2025-05-07T19:43:02.0606156Z microcode : 0x5003901 2025-05-07T19:43:02.0606428Z cpu MHz : 1536.995 2025-05-07T19:43:02.0606659Z cache size : 36608 KB 2025-05-07T19:43:02.0607012Z physical id : 0 2025-05-07T19:43:02.0607245Z siblings : 48 2025-05-07T19:43:02.0607495Z core id : 11 2025-05-07T19:43:02.0607718Z cpu cores : 24 2025-05-07T19:43:02.0607975Z apicid : 22 2025-05-07T19:43:02.0608202Z initial apicid : 22 2025-05-07T19:43:02.0608464Z fpu : yes 2025-05-07T19:43:02.0608686Z fpu_exception : yes 2025-05-07T19:43:02.0608963Z cpuid level : 13 2025-05-07T19:43:02.0609196Z wp : yes 2025-05-07T19:43:02.0611463Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0614078Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0614703Z bogomips : 5999.99 2025-05-07T19:43:02.0614943Z clflush size : 64 2025-05-07T19:43:02.0615203Z cache_alignment : 64 2025-05-07T19:43:02.0615497Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0615867Z power management: 2025-05-07T19:43:02.0616010Z 2025-05-07T19:43:02.0616107Z processor : 12 2025-05-07T19:43:02.0616377Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0616641Z cpu family : 6 2025-05-07T19:43:02.0616892Z model : 85 2025-05-07T19:43:02.0617169Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0617544Z stepping : 7 2025-05-07T19:43:02.0617760Z microcode : 0x5003901 2025-05-07T19:43:02.0618019Z cpu MHz : 2999.998 2025-05-07T19:43:02.0618274Z cache size : 36608 KB 2025-05-07T19:43:02.0618506Z physical id : 0 2025-05-07T19:43:02.0618759Z siblings : 48 2025-05-07T19:43:02.0618973Z core id : 12 2025-05-07T19:43:02.0619213Z cpu cores : 24 2025-05-07T19:43:02.0619427Z apicid : 24 2025-05-07T19:43:02.0619672Z initial apicid : 24 2025-05-07T19:43:02.0619900Z fpu : yes 2025-05-07T19:43:02.0620132Z fpu_exception : yes 2025-05-07T19:43:02.0620544Z cpuid level : 13 2025-05-07T19:43:02.0620808Z wp : yes 2025-05-07T19:43:02.0623511Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0626183Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0626764Z bogomips : 5999.99 2025-05-07T19:43:02.0626989Z clflush size : 64 2025-05-07T19:43:02.0627202Z cache_alignment : 64 2025-05-07T19:43:02.0627483Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0627796Z power management: 2025-05-07T19:43:02.0627927Z 2025-05-07T19:43:02.0628026Z processor : 13 2025-05-07T19:43:02.0628241Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0628490Z cpu family : 6 2025-05-07T19:43:02.0628690Z model : 85 2025-05-07T19:43:02.0628979Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0629319Z stepping : 7 2025-05-07T19:43:02.0629539Z microcode : 0x5003901 2025-05-07T19:43:02.0629776Z cpu MHz : 2999.998 2025-05-07T19:43:02.0629987Z cache size : 36608 KB 2025-05-07T19:43:02.0630226Z physical id : 0 2025-05-07T19:43:02.0630428Z siblings : 48 2025-05-07T19:43:02.0630711Z core id : 13 2025-05-07T19:43:02.0630911Z cpu cores : 24 2025-05-07T19:43:02.0631131Z apicid : 26 2025-05-07T19:43:02.0631448Z initial apicid : 26 2025-05-07T19:43:02.0631859Z fpu : yes 2025-05-07T19:43:02.0632062Z fpu_exception : yes 2025-05-07T19:43:02.0632353Z cpuid level : 13 2025-05-07T19:43:02.0632563Z wp : yes 2025-05-07T19:43:02.0634847Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0637485Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0638083Z bogomips : 5999.99 2025-05-07T19:43:02.0638301Z clflush size : 64 2025-05-07T19:43:02.0638538Z cache_alignment : 64 2025-05-07T19:43:02.0638806Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0639146Z power management: 2025-05-07T19:43:02.0639279Z 2025-05-07T19:43:02.0639365Z processor : 14 2025-05-07T19:43:02.0639600Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0639837Z cpu family : 6 2025-05-07T19:43:02.0640056Z model : 85 2025-05-07T19:43:02.0640326Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0640691Z stepping : 7 2025-05-07T19:43:02.0640898Z microcode : 0x5003901 2025-05-07T19:43:02.0641141Z cpu MHz : 1192.532 2025-05-07T19:43:02.0641374Z cache size : 36608 KB 2025-05-07T19:43:02.0641599Z physical id : 0 2025-05-07T19:43:02.0641823Z siblings : 48 2025-05-07T19:43:02.0642025Z core id : 14 2025-05-07T19:43:02.0642242Z cpu cores : 24 2025-05-07T19:43:02.0642452Z apicid : 28 2025-05-07T19:43:02.0642677Z initial apicid : 28 2025-05-07T19:43:02.0642900Z fpu : yes 2025-05-07T19:43:02.0643127Z fpu_exception : yes 2025-05-07T19:43:02.0643352Z cpuid level : 13 2025-05-07T19:43:02.0643577Z wp : yes 2025-05-07T19:43:02.0645838Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0648564Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0649156Z bogomips : 5999.99 2025-05-07T19:43:02.0649392Z clflush size : 64 2025-05-07T19:43:02.0649609Z cache_alignment : 64 2025-05-07T19:43:02.0649896Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0650229Z power management: 2025-05-07T19:43:02.0650362Z 2025-05-07T19:43:02.0650466Z processor : 15 2025-05-07T19:43:02.0650690Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0651071Z cpu family : 6 2025-05-07T19:43:02.0651275Z model : 85 2025-05-07T19:43:02.0651560Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0651913Z stepping : 7 2025-05-07T19:43:02.0652156Z microcode : 0x5003901 2025-05-07T19:43:02.0652410Z cpu MHz : 1200.765 2025-05-07T19:43:02.0652636Z cache size : 36608 KB 2025-05-07T19:43:02.0653001Z physical id : 0 2025-05-07T19:43:02.0653201Z siblings : 48 2025-05-07T19:43:02.0653426Z core id : 15 2025-05-07T19:43:02.0653806Z cpu cores : 24 2025-05-07T19:43:02.0654058Z apicid : 30 2025-05-07T19:43:02.0654334Z initial apicid : 30 2025-05-07T19:43:02.0654573Z fpu : yes 2025-05-07T19:43:02.0654773Z fpu_exception : yes 2025-05-07T19:43:02.0655012Z cpuid level : 13 2025-05-07T19:43:02.0655219Z wp : yes 2025-05-07T19:43:02.0657455Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0660038Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0660624Z bogomips : 5999.99 2025-05-07T19:43:02.0660844Z clflush size : 64 2025-05-07T19:43:02.0661074Z cache_alignment : 64 2025-05-07T19:43:02.0661336Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0661664Z power management: 2025-05-07T19:43:02.0661796Z 2025-05-07T19:43:02.0661880Z processor : 16 2025-05-07T19:43:02.0662111Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0662344Z cpu family : 6 2025-05-07T19:43:02.0662553Z model : 85 2025-05-07T19:43:02.0662819Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0663173Z stepping : 7 2025-05-07T19:43:02.0663389Z microcode : 0x5003901 2025-05-07T19:43:02.0663615Z cpu MHz : 1417.560 2025-05-07T19:43:02.0663841Z cache size : 36608 KB 2025-05-07T19:43:02.0664063Z physical id : 0 2025-05-07T19:43:02.0664284Z siblings : 48 2025-05-07T19:43:02.0664491Z core id : 16 2025-05-07T19:43:02.0664706Z cpu cores : 24 2025-05-07T19:43:02.0664910Z apicid : 32 2025-05-07T19:43:02.0665126Z initial apicid : 32 2025-05-07T19:43:02.0665348Z fpu : yes 2025-05-07T19:43:02.0665584Z fpu_exception : yes 2025-05-07T19:43:02.0665815Z cpuid level : 13 2025-05-07T19:43:02.0666049Z wp : yes 2025-05-07T19:43:02.0668292Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0675598Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0676228Z bogomips : 5999.99 2025-05-07T19:43:02.0676505Z clflush size : 64 2025-05-07T19:43:02.0676743Z cache_alignment : 64 2025-05-07T19:43:02.0677037Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0677365Z power management: 2025-05-07T19:43:02.0677500Z 2025-05-07T19:43:02.0677605Z processor : 17 2025-05-07T19:43:02.0677826Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0678088Z cpu family : 6 2025-05-07T19:43:02.0678305Z model : 85 2025-05-07T19:43:02.0678610Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0678959Z stepping : 7 2025-05-07T19:43:02.0679187Z microcode : 0x5003901 2025-05-07T19:43:02.0679430Z cpu MHz : 1636.363 2025-05-07T19:43:02.0679653Z cache size : 36608 KB 2025-05-07T19:43:02.0679896Z physical id : 0 2025-05-07T19:43:02.0680109Z siblings : 48 2025-05-07T19:43:02.0680328Z core id : 17 2025-05-07T19:43:02.0680529Z cpu cores : 24 2025-05-07T19:43:02.0680753Z apicid : 34 2025-05-07T19:43:02.0680957Z initial apicid : 34 2025-05-07T19:43:02.0681191Z fpu : yes 2025-05-07T19:43:02.0681490Z fpu_exception : yes 2025-05-07T19:43:02.0681736Z cpuid level : 13 2025-05-07T19:43:02.0681946Z wp : yes 2025-05-07T19:43:02.0684239Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0687058Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0687657Z bogomips : 5999.99 2025-05-07T19:43:02.0687872Z clflush size : 64 2025-05-07T19:43:02.0688104Z cache_alignment : 64 2025-05-07T19:43:02.0688371Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0688706Z power management: 2025-05-07T19:43:02.0688838Z 2025-05-07T19:43:02.0688922Z processor : 18 2025-05-07T19:43:02.0689152Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0689387Z cpu family : 6 2025-05-07T19:43:02.0689604Z model : 85 2025-05-07T19:43:02.0689877Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0690237Z stepping : 7 2025-05-07T19:43:02.0690459Z microcode : 0x5003901 2025-05-07T19:43:02.0690864Z cpu MHz : 2999.998 2025-05-07T19:43:02.0691098Z cache size : 36608 KB 2025-05-07T19:43:02.0691382Z physical id : 0 2025-05-07T19:43:02.0691607Z siblings : 48 2025-05-07T19:43:02.0691846Z core id : 18 2025-05-07T19:43:02.0692066Z cpu cores : 24 2025-05-07T19:43:02.0692274Z apicid : 36 2025-05-07T19:43:02.0692489Z initial apicid : 36 2025-05-07T19:43:02.0692700Z fpu : yes 2025-05-07T19:43:02.0692913Z fpu_exception : yes 2025-05-07T19:43:02.0693137Z cpuid level : 13 2025-05-07T19:43:02.0693358Z wp : yes 2025-05-07T19:43:02.0695666Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0698714Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0699324Z bogomips : 5999.99 2025-05-07T19:43:02.0699566Z clflush size : 64 2025-05-07T19:43:02.0699789Z cache_alignment : 64 2025-05-07T19:43:02.0700085Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0700415Z power management: 2025-05-07T19:43:02.0700571Z 2025-05-07T19:43:02.0700658Z processor : 19 2025-05-07T19:43:02.0700884Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0701147Z cpu family : 6 2025-05-07T19:43:02.0701354Z model : 85 2025-05-07T19:43:02.0701653Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0702008Z stepping : 7 2025-05-07T19:43:02.0702235Z microcode : 0x5003901 2025-05-07T19:43:02.0702484Z cpu MHz : 1201.402 2025-05-07T19:43:02.0724069Z cache size : 36608 KB 2025-05-07T19:43:02.0724382Z physical id : 0 2025-05-07T19:43:02.0724616Z siblings : 48 2025-05-07T19:43:02.0724834Z core id : 19 2025-05-07T19:43:02.0725033Z cpu cores : 24 2025-05-07T19:43:02.0725245Z apicid : 38 2025-05-07T19:43:02.0725443Z initial apicid : 38 2025-05-07T19:43:02.0725672Z fpu : yes 2025-05-07T19:43:02.0725867Z fpu_exception : yes 2025-05-07T19:43:02.0726097Z cpuid level : 13 2025-05-07T19:43:02.0726291Z wp : yes 2025-05-07T19:43:02.0728715Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0731306Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0731878Z bogomips : 5999.99 2025-05-07T19:43:02.0732116Z clflush size : 64 2025-05-07T19:43:02.0732333Z cache_alignment : 64 2025-05-07T19:43:02.0732625Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0732966Z power management: 2025-05-07T19:43:02.0733098Z 2025-05-07T19:43:02.0733181Z processor : 20 2025-05-07T19:43:02.0733409Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0733641Z cpu family : 6 2025-05-07T19:43:02.0733854Z model : 85 2025-05-07T19:43:02.0734118Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0734471Z stepping : 7 2025-05-07T19:43:02.0735189Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:02.0735519Z microcode : 0x5003901 2025-05-07T19:43:02.0735753Z cpu MHz : 2999.998 2025-05-07T19:43:02.0735968Z cache size : 36608 KB 2025-05-07T19:43:02.0736208Z physical id : 0 2025-05-07T19:43:02.0736415Z siblings : 48 2025-05-07T19:43:02.0736629Z core id : 20 2025-05-07T19:43:02.0736830Z cpu cores : 24 2025-05-07T19:43:02.0737040Z apicid : 40 2025-05-07T19:43:02.0737229Z initial apicid : 40 2025-05-07T19:43:02.0737447Z fpu : yes 2025-05-07T19:43:02.0737624Z fpu_exception : yes 2025-05-07T19:43:02.0737841Z cpuid level : 13 2025-05-07T19:43:02.0738039Z wp : yes 2025-05-07T19:43:02.0740245Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0742876Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0743462Z bogomips : 5999.99 2025-05-07T19:43:02.0743675Z clflush size : 64 2025-05-07T19:43:02.0743902Z cache_alignment : 64 2025-05-07T19:43:02.0744164Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0744485Z power management: 2025-05-07T19:43:02.0744615Z 2025-05-07T19:43:02.0744692Z processor : 21 2025-05-07T19:43:02.0744901Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0745119Z cpu family : 6 2025-05-07T19:43:02.0745312Z model : 85 2025-05-07T19:43:02.0745566Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0745903Z stepping : 7 2025-05-07T19:43:02.0746101Z microcode : 0x5003901 2025-05-07T19:43:02.0746308Z cpu MHz : 1391.135 2025-05-07T19:43:02.0746519Z cache size : 36608 KB 2025-05-07T19:43:02.0746723Z physical id : 0 2025-05-07T19:43:02.0746929Z siblings : 48 2025-05-07T19:43:02.0747113Z core id : 21 2025-05-07T19:43:02.0747306Z cpu cores : 24 2025-05-07T19:43:02.0747499Z apicid : 42 2025-05-07T19:43:02.0747704Z initial apicid : 42 2025-05-07T19:43:02.0747905Z fpu : yes 2025-05-07T19:43:02.0748095Z fpu_exception : yes 2025-05-07T19:43:02.0748296Z cpuid level : 13 2025-05-07T19:43:02.0748501Z wp : yes 2025-05-07T19:43:02.0750764Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0753680Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0754271Z bogomips : 5999.99 2025-05-07T19:43:02.0754478Z clflush size : 64 2025-05-07T19:43:02.0754679Z cache_alignment : 64 2025-05-07T19:43:02.0754955Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0755275Z power management: 2025-05-07T19:43:02.0755419Z 2025-05-07T19:43:02.0755505Z processor : 22 2025-05-07T19:43:02.0755715Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0755958Z cpu family : 6 2025-05-07T19:43:02.0756153Z model : 85 2025-05-07T19:43:02.0756432Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0756770Z stepping : 7 2025-05-07T19:43:02.0756985Z microcode : 0x5003901 2025-05-07T19:43:02.0757218Z cpu MHz : 1374.990 2025-05-07T19:43:02.0757426Z cache size : 36608 KB 2025-05-07T19:43:02.0757651Z physical id : 0 2025-05-07T19:43:02.0757845Z siblings : 48 2025-05-07T19:43:02.0758053Z core id : 22 2025-05-07T19:43:02.0758242Z cpu cores : 24 2025-05-07T19:43:02.0758439Z apicid : 44 2025-05-07T19:43:02.0758636Z initial apicid : 44 2025-05-07T19:43:02.0758853Z fpu : yes 2025-05-07T19:43:02.0759044Z fpu_exception : yes 2025-05-07T19:43:02.0759265Z cpuid level : 13 2025-05-07T19:43:02.0759459Z wp : yes 2025-05-07T19:43:02.0761723Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0764415Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0765021Z bogomips : 5999.99 2025-05-07T19:43:02.0765211Z clflush size : 64 2025-05-07T19:43:02.0765411Z cache_alignment : 64 2025-05-07T19:43:02.0765648Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0765955Z power management: 2025-05-07T19:43:02.0766072Z 2025-05-07T19:43:02.0766153Z processor : 23 2025-05-07T19:43:02.0766363Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0766575Z cpu family : 6 2025-05-07T19:43:02.0766767Z model : 85 2025-05-07T19:43:02.0767011Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0767334Z stepping : 7 2025-05-07T19:43:02.0767528Z microcode : 0x5003901 2025-05-07T19:43:02.0767723Z cpu MHz : 2999.998 2025-05-07T19:43:02.0767925Z cache size : 36608 KB 2025-05-07T19:43:02.0768126Z physical id : 0 2025-05-07T19:43:02.0768322Z siblings : 48 2025-05-07T19:43:02.0768498Z core id : 23 2025-05-07T19:43:02.0768689Z cpu cores : 24 2025-05-07T19:43:02.0768872Z apicid : 46 2025-05-07T19:43:02.0769066Z initial apicid : 46 2025-05-07T19:43:02.0769252Z fpu : yes 2025-05-07T19:43:02.0769436Z fpu_exception : yes 2025-05-07T19:43:02.0769627Z cpuid level : 13 2025-05-07T19:43:02.0769820Z wp : yes 2025-05-07T19:43:02.0771988Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0774412Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0774954Z bogomips : 5999.99 2025-05-07T19:43:02.0775160Z clflush size : 64 2025-05-07T19:43:02.0775358Z cache_alignment : 64 2025-05-07T19:43:02.0775618Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0775911Z power management: 2025-05-07T19:43:02.0776043Z 2025-05-07T19:43:02.0776123Z processor : 24 2025-05-07T19:43:02.0776317Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0776542Z cpu family : 6 2025-05-07T19:43:02.0776723Z model : 85 2025-05-07T19:43:02.0776977Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0777291Z stepping : 7 2025-05-07T19:43:02.0777486Z microcode : 0x5003901 2025-05-07T19:43:02.0777701Z cpu MHz : 3085.582 2025-05-07T19:43:02.0777895Z cache size : 36608 KB 2025-05-07T19:43:02.0778105Z physical id : 1 2025-05-07T19:43:02.0778287Z siblings : 48 2025-05-07T19:43:02.0778477Z core id : 0 2025-05-07T19:43:02.0778651Z cpu cores : 24 2025-05-07T19:43:02.0778842Z apicid : 64 2025-05-07T19:43:02.0779023Z initial apicid : 64 2025-05-07T19:43:02.0779223Z fpu : yes 2025-05-07T19:43:02.0779393Z fpu_exception : yes 2025-05-07T19:43:02.0779598Z cpuid level : 13 2025-05-07T19:43:02.0779778Z wp : yes 2025-05-07T19:43:02.0781868Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0784296Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0784926Z bogomips : 5999.99 2025-05-07T19:43:02.0785275Z clflush size : 64 2025-05-07T19:43:02.0785484Z cache_alignment : 64 2025-05-07T19:43:02.0785724Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0786032Z power management: 2025-05-07T19:43:02.0786149Z 2025-05-07T19:43:02.0786224Z processor : 25 2025-05-07T19:43:02.0786432Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0786646Z cpu family : 6 2025-05-07T19:43:02.0786849Z model : 85 2025-05-07T19:43:02.0787086Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0787408Z stepping : 7 2025-05-07T19:43:02.0787598Z microcode : 0x5003901 2025-05-07T19:43:02.0787796Z cpu MHz : 3101.014 2025-05-07T19:43:02.0787990Z cache size : 36608 KB 2025-05-07T19:43:02.0788186Z physical id : 1 2025-05-07T19:43:02.0788393Z siblings : 48 2025-05-07T19:43:02.0788565Z core id : 1 2025-05-07T19:43:02.0788754Z cpu cores : 24 2025-05-07T19:43:02.0788932Z apicid : 66 2025-05-07T19:43:02.0789123Z initial apicid : 66 2025-05-07T19:43:02.0789314Z fpu : yes 2025-05-07T19:43:02.0789503Z fpu_exception : yes 2025-05-07T19:43:02.0789701Z cpuid level : 13 2025-05-07T19:43:02.0789893Z wp : yes 2025-05-07T19:43:02.0792758Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0795406Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0795982Z bogomips : 5999.99 2025-05-07T19:43:02.0796198Z clflush size : 64 2025-05-07T19:43:02.0796403Z cache_alignment : 64 2025-05-07T19:43:02.0796675Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0796981Z power management: 2025-05-07T19:43:02.0797109Z 2025-05-07T19:43:02.0797200Z processor : 26 2025-05-07T19:43:02.0797410Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0797644Z cpu family : 6 2025-05-07T19:43:02.0797837Z model : 85 2025-05-07T19:43:02.0798114Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0798441Z stepping : 7 2025-05-07T19:43:02.0798654Z microcode : 0x5003901 2025-05-07T19:43:02.0798874Z cpu MHz : 3777.951 2025-05-07T19:43:02.0799076Z cache size : 36608 KB 2025-05-07T19:43:02.0799299Z physical id : 1 2025-05-07T19:43:02.0799493Z siblings : 48 2025-05-07T19:43:02.0799689Z core id : 2 2025-05-07T19:43:02.0799874Z cpu cores : 24 2025-05-07T19:43:02.0800076Z apicid : 68 2025-05-07T19:43:02.0800269Z initial apicid : 68 2025-05-07T19:43:02.0800474Z fpu : yes 2025-05-07T19:43:02.0800662Z fpu_exception : yes 2025-05-07T19:43:02.0800878Z cpuid level : 13 2025-05-07T19:43:02.0801068Z wp : yes 2025-05-07T19:43:02.0803318Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0805877Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0806425Z bogomips : 5999.99 2025-05-07T19:43:02.0806623Z clflush size : 64 2025-05-07T19:43:02.0806997Z cache_alignment : 64 2025-05-07T19:43:02.0807235Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0807533Z power management: 2025-05-07T19:43:02.0807651Z 2025-05-07T19:43:02.0807724Z processor : 27 2025-05-07T19:43:02.0807932Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0808150Z cpu family : 6 2025-05-07T19:43:02.0808342Z model : 85 2025-05-07T19:43:02.0808591Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0808909Z stepping : 7 2025-05-07T19:43:02.0809107Z microcode : 0x5003901 2025-05-07T19:43:02.0809313Z cpu MHz : 3170.273 2025-05-07T19:43:02.0809515Z cache size : 36608 KB 2025-05-07T19:43:02.0809713Z physical id : 1 2025-05-07T19:43:02.0809905Z siblings : 48 2025-05-07T19:43:02.0810074Z core id : 3 2025-05-07T19:43:02.0810257Z cpu cores : 24 2025-05-07T19:43:02.0810432Z apicid : 70 2025-05-07T19:43:02.0810616Z initial apicid : 70 2025-05-07T19:43:02.0810801Z fpu : yes 2025-05-07T19:43:02.0810985Z fpu_exception : yes 2025-05-07T19:43:02.0811178Z cpuid level : 13 2025-05-07T19:43:02.0811362Z wp : yes 2025-05-07T19:43:02.0813498Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0815896Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0816420Z bogomips : 5999.99 2025-05-07T19:43:02.0816616Z clflush size : 64 2025-05-07T19:43:02.0816802Z cache_alignment : 64 2025-05-07T19:43:02.0817056Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0817337Z power management: 2025-05-07T19:43:02.0817461Z 2025-05-07T19:43:02.0817546Z processor : 28 2025-05-07T19:43:02.0817734Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0817961Z cpu family : 6 2025-05-07T19:43:02.0818137Z model : 85 2025-05-07T19:43:02.0818393Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0818698Z stepping : 7 2025-05-07T19:43:02.0818890Z microcode : 0x5003901 2025-05-07T19:43:02.0819094Z cpu MHz : 2999.998 2025-05-07T19:43:02.0819282Z cache size : 36608 KB 2025-05-07T19:43:02.0819487Z physical id : 1 2025-05-07T19:43:02.0819664Z siblings : 48 2025-05-07T19:43:02.0819840Z core id : 4 2025-05-07T19:43:02.0820004Z cpu cores : 24 2025-05-07T19:43:02.0820184Z apicid : 72 2025-05-07T19:43:02.0820354Z initial apicid : 72 2025-05-07T19:43:02.0820554Z fpu : yes 2025-05-07T19:43:02.0820727Z fpu_exception : yes 2025-05-07T19:43:02.0820924Z cpuid level : 13 2025-05-07T19:43:02.0821106Z wp : yes 2025-05-07T19:43:02.0823183Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0825601Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0826140Z bogomips : 5999.99 2025-05-07T19:43:02.0826326Z clflush size : 64 2025-05-07T19:43:02.0826522Z cache_alignment : 64 2025-05-07T19:43:02.0826785Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0827172Z power management: 2025-05-07T19:43:02.0827290Z 2025-05-07T19:43:02.0827363Z processor : 29 2025-05-07T19:43:02.0827566Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0827777Z cpu family : 6 2025-05-07T19:43:02.0827969Z model : 85 2025-05-07T19:43:02.0828209Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0828531Z stepping : 7 2025-05-07T19:43:02.0828732Z microcode : 0x5003901 2025-05-07T19:43:02.0828933Z cpu MHz : 2999.998 2025-05-07T19:43:02.0829128Z cache size : 36608 KB 2025-05-07T19:43:02.0829326Z physical id : 1 2025-05-07T19:43:02.0829516Z siblings : 48 2025-05-07T19:43:02.0829687Z core id : 5 2025-05-07T19:43:02.0829871Z cpu cores : 24 2025-05-07T19:43:02.0830057Z apicid : 74 2025-05-07T19:43:02.0830268Z initial apicid : 74 2025-05-07T19:43:02.0830471Z fpu : yes 2025-05-07T19:43:02.0830682Z fpu_exception : yes 2025-05-07T19:43:02.0830892Z cpuid level : 13 2025-05-07T19:43:02.0831103Z wp : yes 2025-05-07T19:43:02.0833701Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0836332Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0836936Z bogomips : 5999.99 2025-05-07T19:43:02.0837174Z clflush size : 64 2025-05-07T19:43:02.0837385Z cache_alignment : 64 2025-05-07T19:43:02.0837670Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0837990Z power management: 2025-05-07T19:43:02.0838125Z 2025-05-07T19:43:02.0838229Z processor : 30 2025-05-07T19:43:02.0838443Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0838692Z cpu family : 6 2025-05-07T19:43:02.0838894Z model : 85 2025-05-07T19:43:02.0839179Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0839522Z stepping : 7 2025-05-07T19:43:02.0839743Z microcode : 0x5003901 2025-05-07T19:43:02.0839982Z cpu MHz : 2999.998 2025-05-07T19:43:02.0840194Z cache size : 36608 KB 2025-05-07T19:43:02.0840431Z physical id : 1 2025-05-07T19:43:02.0840639Z siblings : 48 2025-05-07T19:43:02.0840855Z core id : 6 2025-05-07T19:43:02.0841057Z cpu cores : 24 2025-05-07T19:43:02.0841287Z apicid : 76 2025-05-07T19:43:02.0841488Z initial apicid : 76 2025-05-07T19:43:02.0841714Z fpu : yes 2025-05-07T19:43:02.0841909Z fpu_exception : yes 2025-05-07T19:43:02.0842146Z cpuid level : 13 2025-05-07T19:43:02.0842349Z wp : yes 2025-05-07T19:43:02.0844673Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0847115Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0847663Z bogomips : 5999.99 2025-05-07T19:43:02.0847861Z clflush size : 64 2025-05-07T19:43:02.0848081Z cache_alignment : 64 2025-05-07T19:43:02.0848333Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0848649Z power management: 2025-05-07T19:43:02.0848774Z 2025-05-07T19:43:02.0848915Z processor : 31 2025-05-07T19:43:02.0849133Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0849354Z cpu family : 6 2025-05-07T19:43:02.0849561Z model : 85 2025-05-07T19:43:02.0849815Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0850158Z stepping : 7 2025-05-07T19:43:02.0850366Z microcode : 0x5003901 2025-05-07T19:43:02.0850573Z cpu MHz : 3151.055 2025-05-07T19:43:02.0850790Z cache size : 36608 KB 2025-05-07T19:43:02.0851000Z physical id : 1 2025-05-07T19:43:02.0851209Z siblings : 48 2025-05-07T19:43:02.0851395Z core id : 7 2025-05-07T19:43:02.0851594Z cpu cores : 24 2025-05-07T19:43:02.0851786Z apicid : 78 2025-05-07T19:43:02.0851997Z initial apicid : 78 2025-05-07T19:43:02.0852196Z fpu : yes 2025-05-07T19:43:02.0852395Z fpu_exception : yes 2025-05-07T19:43:02.0852598Z cpuid level : 13 2025-05-07T19:43:02.0852805Z wp : yes 2025-05-07T19:43:02.0854947Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0857352Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0857899Z bogomips : 5999.99 2025-05-07T19:43:02.0858113Z clflush size : 64 2025-05-07T19:43:02.0858312Z cache_alignment : 64 2025-05-07T19:43:02.0858578Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0858874Z power management: 2025-05-07T19:43:02.0858998Z 2025-05-07T19:43:02.0859091Z processor : 32 2025-05-07T19:43:02.0859292Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0859529Z cpu family : 6 2025-05-07T19:43:02.0859718Z model : 85 2025-05-07T19:43:02.0859982Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0860302Z stepping : 7 2025-05-07T19:43:02.0860509Z microcode : 0x5003901 2025-05-07T19:43:02.0860732Z cpu MHz : 2999.998 2025-05-07T19:43:02.0860937Z cache size : 36608 KB 2025-05-07T19:43:02.0861179Z physical id : 1 2025-05-07T19:43:02.0861380Z siblings : 48 2025-05-07T19:43:02.0861591Z core id : 8 2025-05-07T19:43:02.0861779Z cpu cores : 24 2025-05-07T19:43:02.0861989Z apicid : 80 2025-05-07T19:43:02.0862177Z initial apicid : 80 2025-05-07T19:43:02.0862386Z fpu : yes 2025-05-07T19:43:02.0862571Z fpu_exception : yes 2025-05-07T19:43:02.0862785Z cpuid level : 13 2025-05-07T19:43:02.0862980Z wp : yes 2025-05-07T19:43:02.0865118Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0867578Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0868113Z bogomips : 5999.99 2025-05-07T19:43:02.0868305Z clflush size : 64 2025-05-07T19:43:02.0868512Z cache_alignment : 64 2025-05-07T19:43:02.0868757Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0869057Z power management: 2025-05-07T19:43:02.0869176Z 2025-05-07T19:43:02.0869253Z processor : 33 2025-05-07T19:43:02.0869461Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0869673Z cpu family : 6 2025-05-07T19:43:02.0869941Z model : 85 2025-05-07T19:43:02.0870187Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0870510Z stepping : 7 2025-05-07T19:43:02.0870701Z microcode : 0x5003901 2025-05-07T19:43:02.0870906Z cpu MHz : 2999.998 2025-05-07T19:43:02.0871103Z cache size : 36608 KB 2025-05-07T19:43:02.0871393Z physical id : 1 2025-05-07T19:43:02.0871767Z siblings : 48 2025-05-07T19:43:02.0871963Z core id : 9 2025-05-07T19:43:02.0872159Z cpu cores : 24 2025-05-07T19:43:02.0872432Z apicid : 82 2025-05-07T19:43:02.0872636Z initial apicid : 82 2025-05-07T19:43:02.0872836Z fpu : yes 2025-05-07T19:43:02.0873038Z fpu_exception : yes 2025-05-07T19:43:02.0873245Z cpuid level : 13 2025-05-07T19:43:02.0873458Z wp : yes 2025-05-07T19:43:02.0875712Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0878377Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0878957Z bogomips : 5999.99 2025-05-07T19:43:02.0879173Z clflush size : 64 2025-05-07T19:43:02.0879379Z cache_alignment : 64 2025-05-07T19:43:02.0879650Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0879961Z power management: 2025-05-07T19:43:02.0880101Z 2025-05-07T19:43:02.0880182Z processor : 34 2025-05-07T19:43:02.0880390Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0880623Z cpu family : 6 2025-05-07T19:43:02.0880817Z model : 85 2025-05-07T19:43:02.0881090Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0881427Z stepping : 7 2025-05-07T19:43:02.0881627Z microcode : 0x5003901 2025-05-07T19:43:02.0881849Z cpu MHz : 3153.915 2025-05-07T19:43:02.0882049Z cache size : 36608 KB 2025-05-07T19:43:02.0882261Z physical id : 1 2025-05-07T19:43:02.0882456Z siblings : 48 2025-05-07T19:43:02.0882649Z core id : 10 2025-05-07T19:43:02.0882832Z cpu cores : 24 2025-05-07T19:43:02.0883033Z apicid : 84 2025-05-07T19:43:02.0883569Z initial apicid : 84 2025-05-07T19:43:02.0883784Z fpu : yes 2025-05-07T19:43:02.0884071Z fpu_exception : yes 2025-05-07T19:43:02.0884154Z cpuid level : 13 2025-05-07T19:43:02.0884224Z wp : yes 2025-05-07T19:43:02.0886294Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0886663Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0886739Z bogomips : 5999.99 2025-05-07T19:43:02.0886813Z clflush size : 64 2025-05-07T19:43:02.0886895Z cache_alignment : 64 2025-05-07T19:43:02.0887019Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0887099Z power management: 2025-05-07T19:43:02.0887103Z 2025-05-07T19:43:02.0887189Z processor : 35 2025-05-07T19:43:02.0887269Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0887344Z cpu family : 6 2025-05-07T19:43:02.0887419Z model : 85 2025-05-07T19:43:02.0887581Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0887710Z stepping : 7 2025-05-07T19:43:02.0887787Z microcode : 0x5003901 2025-05-07T19:43:02.0887870Z cpu MHz : 2999.998 2025-05-07T19:43:02.0887943Z cache size : 36608 KB 2025-05-07T19:43:02.0888019Z physical id : 1 2025-05-07T19:43:02.0888091Z siblings : 48 2025-05-07T19:43:02.0888170Z core id : 11 2025-05-07T19:43:02.0888240Z cpu cores : 24 2025-05-07T19:43:02.0888316Z apicid : 86 2025-05-07T19:43:02.0888394Z initial apicid : 86 2025-05-07T19:43:02.0888480Z fpu : yes 2025-05-07T19:43:02.0888557Z fpu_exception : yes 2025-05-07T19:43:02.0888632Z cpuid level : 13 2025-05-07T19:43:02.0888716Z wp : yes 2025-05-07T19:43:02.0891181Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0891661Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0891759Z bogomips : 5999.99 2025-05-07T19:43:02.0891838Z clflush size : 64 2025-05-07T19:43:02.0891920Z cache_alignment : 64 2025-05-07T19:43:02.0892059Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0892141Z power management: 2025-05-07T19:43:02.0892145Z 2025-05-07T19:43:02.0892222Z processor : 36 2025-05-07T19:43:02.0892319Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0892402Z cpu family : 6 2025-05-07T19:43:02.0892480Z model : 85 2025-05-07T19:43:02.0892636Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0892727Z stepping : 7 2025-05-07T19:43:02.0892808Z microcode : 0x5003901 2025-05-07T19:43:02.0892884Z cpu MHz : 2999.998 2025-05-07T19:43:02.0892979Z cache size : 36608 KB 2025-05-07T19:43:02.0893056Z physical id : 1 2025-05-07T19:43:02.0893129Z siblings : 48 2025-05-07T19:43:02.0893206Z core id : 12 2025-05-07T19:43:02.0893296Z cpu cores : 24 2025-05-07T19:43:02.0893375Z apicid : 88 2025-05-07T19:43:02.0893459Z initial apicid : 88 2025-05-07T19:43:02.0893541Z fpu : yes 2025-05-07T19:43:02.0893634Z fpu_exception : yes 2025-05-07T19:43:02.0893730Z cpuid level : 13 2025-05-07T19:43:02.0893806Z wp : yes 2025-05-07T19:43:02.0895964Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0896346Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0896430Z bogomips : 5999.99 2025-05-07T19:43:02.0896520Z clflush size : 64 2025-05-07T19:43:02.0896604Z cache_alignment : 64 2025-05-07T19:43:02.0896725Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0896821Z power management: 2025-05-07T19:43:02.0896825Z 2025-05-07T19:43:02.0896905Z processor : 37 2025-05-07T19:43:02.0896994Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0897080Z cpu family : 6 2025-05-07T19:43:02.0897159Z model : 85 2025-05-07T19:43:02.0897314Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0897393Z stepping : 7 2025-05-07T19:43:02.0897557Z microcode : 0x5003901 2025-05-07T19:43:02.0897637Z cpu MHz : 3098.746 2025-05-07T19:43:02.0897719Z cache size : 36608 KB 2025-05-07T19:43:02.0897799Z physical id : 1 2025-05-07T19:43:02.0897888Z siblings : 48 2025-05-07T19:43:02.0897963Z core id : 13 2025-05-07T19:43:02.0898041Z cpu cores : 24 2025-05-07T19:43:02.0898127Z apicid : 90 2025-05-07T19:43:02.0898214Z initial apicid : 90 2025-05-07T19:43:02.0898294Z fpu : yes 2025-05-07T19:43:02.0898380Z fpu_exception : yes 2025-05-07T19:43:02.0898468Z cpuid level : 13 2025-05-07T19:43:02.0898542Z wp : yes 2025-05-07T19:43:02.0900691Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0901085Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0901166Z bogomips : 5999.99 2025-05-07T19:43:02.0901294Z clflush size : 64 2025-05-07T19:43:02.0901387Z cache_alignment : 64 2025-05-07T19:43:02.0901512Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0901592Z power management: 2025-05-07T19:43:02.0901597Z 2025-05-07T19:43:02.0901679Z processor : 38 2025-05-07T19:43:02.0901764Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0901842Z cpu family : 6 2025-05-07T19:43:02.0901923Z model : 85 2025-05-07T19:43:02.0902094Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0902174Z stepping : 7 2025-05-07T19:43:02.0902263Z microcode : 0x5003901 2025-05-07T19:43:02.0902360Z cpu MHz : 3206.873 2025-05-07T19:43:02.0902454Z cache size : 36608 KB 2025-05-07T19:43:02.0902538Z physical id : 1 2025-05-07T19:43:02.0902618Z siblings : 48 2025-05-07T19:43:02.0902707Z core id : 14 2025-05-07T19:43:02.0902791Z cpu cores : 24 2025-05-07T19:43:02.0902872Z apicid : 92 2025-05-07T19:43:02.0902977Z initial apicid : 92 2025-05-07T19:43:02.0903170Z fpu : yes 2025-05-07T19:43:02.0903261Z fpu_exception : yes 2025-05-07T19:43:02.0903346Z cpuid level : 13 2025-05-07T19:43:02.0903438Z wp : yes 2025-05-07T19:43:02.0905525Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0905924Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0906002Z bogomips : 5999.99 2025-05-07T19:43:02.0906082Z clflush size : 64 2025-05-07T19:43:02.0906172Z cache_alignment : 64 2025-05-07T19:43:02.0906310Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0906383Z power management: 2025-05-07T19:43:02.0906387Z 2025-05-07T19:43:02.0906467Z processor : 39 2025-05-07T19:43:02.0906568Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0906646Z cpu family : 6 2025-05-07T19:43:02.0906718Z model : 85 2025-05-07T19:43:02.0906866Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0906970Z stepping : 7 2025-05-07T19:43:02.0907053Z microcode : 0x5003901 2025-05-07T19:43:02.0907130Z cpu MHz : 2999.998 2025-05-07T19:43:02.0907220Z cache size : 36608 KB 2025-05-07T19:43:02.0907356Z physical id : 1 2025-05-07T19:43:02.0907430Z siblings : 48 2025-05-07T19:43:02.0907502Z core id : 15 2025-05-07T19:43:02.0907591Z cpu cores : 24 2025-05-07T19:43:02.0907661Z apicid : 94 2025-05-07T19:43:02.0907748Z initial apicid : 94 2025-05-07T19:43:02.0907837Z fpu : yes 2025-05-07T19:43:02.0907921Z fpu_exception : yes 2025-05-07T19:43:02.0908002Z cpuid level : 13 2025-05-07T19:43:02.0908080Z wp : yes 2025-05-07T19:43:02.0910095Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0910457Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0910552Z bogomips : 5999.99 2025-05-07T19:43:02.0910634Z clflush size : 64 2025-05-07T19:43:02.0910718Z cache_alignment : 64 2025-05-07T19:43:02.0910884Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0910979Z power management: 2025-05-07T19:43:02.0910983Z 2025-05-07T19:43:02.0911064Z processor : 40 2025-05-07T19:43:02.0911150Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0911331Z cpu family : 6 2025-05-07T19:43:02.0911408Z model : 85 2025-05-07T19:43:02.0911561Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0911816Z stepping : 7 2025-05-07T19:43:02.0911917Z microcode : 0x5003901 2025-05-07T19:43:02.0911999Z cpu MHz : 3100.321 2025-05-07T19:43:02.0912088Z cache size : 36608 KB 2025-05-07T19:43:02.0912186Z physical id : 1 2025-05-07T19:43:02.0912322Z siblings : 48 2025-05-07T19:43:02.0912400Z core id : 16 2025-05-07T19:43:02.0912482Z cpu cores : 24 2025-05-07T19:43:02.0912582Z apicid : 96 2025-05-07T19:43:02.0912667Z initial apicid : 96 2025-05-07T19:43:02.0912751Z fpu : yes 2025-05-07T19:43:02.0912852Z fpu_exception : yes 2025-05-07T19:43:02.0912936Z cpuid level : 13 2025-05-07T19:43:02.0913019Z wp : yes 2025-05-07T19:43:02.0915163Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0915564Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0915651Z bogomips : 5999.99 2025-05-07T19:43:02.0915754Z clflush size : 64 2025-05-07T19:43:02.0915836Z cache_alignment : 64 2025-05-07T19:43:02.0915962Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0916055Z power management: 2025-05-07T19:43:02.0916059Z 2025-05-07T19:43:02.0916158Z processor : 41 2025-05-07T19:43:02.0916241Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0916316Z cpu family : 6 2025-05-07T19:43:02.0916417Z model : 85 2025-05-07T19:43:02.0916579Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0916661Z stepping : 7 2025-05-07T19:43:02.0916749Z microcode : 0x5003901 2025-05-07T19:43:02.0916848Z cpu MHz : 3160.821 2025-05-07T19:43:02.0916931Z cache size : 36608 KB 2025-05-07T19:43:02.0917012Z physical id : 1 2025-05-07T19:43:02.0917107Z siblings : 48 2025-05-07T19:43:02.0917263Z core id : 17 2025-05-07T19:43:02.0917347Z cpu cores : 24 2025-05-07T19:43:02.0917422Z apicid : 98 2025-05-07T19:43:02.0917526Z initial apicid : 98 2025-05-07T19:43:02.0917610Z fpu : yes 2025-05-07T19:43:02.0917696Z fpu_exception : yes 2025-05-07T19:43:02.0917781Z cpuid level : 13 2025-05-07T19:43:02.0917880Z wp : yes 2025-05-07T19:43:02.0920017Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0920419Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0920503Z bogomips : 5999.99 2025-05-07T19:43:02.0920579Z clflush size : 64 2025-05-07T19:43:02.0920666Z cache_alignment : 64 2025-05-07T19:43:02.0920791Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0920869Z power management: 2025-05-07T19:43:02.0922448Z 2025-05-07T19:43:02.0922536Z processor : 42 2025-05-07T19:43:02.0922634Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0922712Z cpu family : 6 2025-05-07T19:43:02.0922785Z model : 85 2025-05-07T19:43:02.0922953Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0923034Z stepping : 7 2025-05-07T19:43:02.0923116Z microcode : 0x5003901 2025-05-07T19:43:02.0923195Z cpu MHz : 3141.515 2025-05-07T19:43:02.0923291Z cache size : 36608 KB 2025-05-07T19:43:02.0923371Z physical id : 1 2025-05-07T19:43:02.0923447Z siblings : 48 2025-05-07T19:43:02.0923531Z core id : 18 2025-05-07T19:43:02.0923614Z cpu cores : 24 2025-05-07T19:43:02.0923693Z apicid : 100 2025-05-07T19:43:02.0923775Z initial apicid : 100 2025-05-07T19:43:02.0923972Z fpu : yes 2025-05-07T19:43:02.0924050Z fpu_exception : yes 2025-05-07T19:43:02.0924123Z cpuid level : 13 2025-05-07T19:43:02.0924194Z wp : yes 2025-05-07T19:43:02.0926209Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0926568Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0926656Z bogomips : 5999.99 2025-05-07T19:43:02.0926729Z clflush size : 64 2025-05-07T19:43:02.0926806Z cache_alignment : 64 2025-05-07T19:43:02.0926923Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0927010Z power management: 2025-05-07T19:43:02.0927014Z 2025-05-07T19:43:02.0927090Z processor : 43 2025-05-07T19:43:02.0927173Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0927255Z cpu family : 6 2025-05-07T19:43:02.0927323Z model : 85 2025-05-07T19:43:02.0927472Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0927557Z stepping : 7 2025-05-07T19:43:02.0927631Z microcode : 0x5003901 2025-05-07T19:43:02.0927703Z cpu MHz : 2999.998 2025-05-07T19:43:02.0927784Z cache size : 36608 KB 2025-05-07T19:43:02.0927864Z physical id : 1 2025-05-07T19:43:02.0927933Z siblings : 48 2025-05-07T19:43:02.0928004Z core id : 19 2025-05-07T19:43:02.0928075Z cpu cores : 24 2025-05-07T19:43:02.0928214Z apicid : 102 2025-05-07T19:43:02.0928292Z initial apicid : 102 2025-05-07T19:43:02.0928365Z fpu : yes 2025-05-07T19:43:02.0928451Z fpu_exception : yes 2025-05-07T19:43:02.0928524Z cpuid level : 13 2025-05-07T19:43:02.0928593Z wp : yes 2025-05-07T19:43:02.0930585Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0930939Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0931019Z bogomips : 5999.99 2025-05-07T19:43:02.0931108Z clflush size : 64 2025-05-07T19:43:02.0931185Z cache_alignment : 64 2025-05-07T19:43:02.0931304Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0931381Z power management: 2025-05-07T19:43:02.0931398Z 2025-05-07T19:43:02.0931470Z processor : 44 2025-05-07T19:43:02.0931603Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0931678Z cpu family : 6 2025-05-07T19:43:02.0931759Z model : 85 2025-05-07T19:43:02.0931903Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0931978Z stepping : 7 2025-05-07T19:43:02.0932054Z microcode : 0x5003901 2025-05-07T19:43:02.0932137Z cpu MHz : 2999.998 2025-05-07T19:43:02.0932209Z cache size : 36608 KB 2025-05-07T19:43:02.0932283Z physical id : 1 2025-05-07T19:43:02.0932371Z siblings : 48 2025-05-07T19:43:02.0932439Z core id : 20 2025-05-07T19:43:02.0932510Z cpu cores : 24 2025-05-07T19:43:02.0932581Z apicid : 104 2025-05-07T19:43:02.0932672Z initial apicid : 104 2025-05-07T19:43:02.0932741Z fpu : yes 2025-05-07T19:43:02.0932819Z fpu_exception : yes 2025-05-07T19:43:02.0932908Z cpuid level : 13 2025-05-07T19:43:02.0932977Z wp : yes 2025-05-07T19:43:02.0934969Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0935334Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0935410Z bogomips : 5999.99 2025-05-07T19:43:02.0935486Z clflush size : 64 2025-05-07T19:43:02.0935576Z cache_alignment : 64 2025-05-07T19:43:02.0935696Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0935774Z power management: 2025-05-07T19:43:02.0935779Z 2025-05-07T19:43:02.0935851Z processor : 45 2025-05-07T19:43:02.0935945Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0936026Z cpu family : 6 2025-05-07T19:43:02.0936097Z model : 85 2025-05-07T19:43:02.0936249Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0936324Z stepping : 7 2025-05-07T19:43:02.0936401Z microcode : 0x5003901 2025-05-07T19:43:02.0936474Z cpu MHz : 2999.998 2025-05-07T19:43:02.0936561Z cache size : 36608 KB 2025-05-07T19:43:02.0936635Z physical id : 1 2025-05-07T19:43:02.0936706Z siblings : 48 2025-05-07T19:43:02.0936786Z core id : 21 2025-05-07T19:43:02.0936857Z cpu cores : 24 2025-05-07T19:43:02.0936929Z apicid : 106 2025-05-07T19:43:02.0937006Z initial apicid : 106 2025-05-07T19:43:02.0937136Z fpu : yes 2025-05-07T19:43:02.0937214Z fpu_exception : yes 2025-05-07T19:43:02.0937289Z cpuid level : 13 2025-05-07T19:43:02.0937366Z wp : yes 2025-05-07T19:43:02.0939355Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0939705Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0939786Z bogomips : 5999.99 2025-05-07T19:43:02.0939861Z clflush size : 64 2025-05-07T19:43:02.0939941Z cache_alignment : 64 2025-05-07T19:43:02.0940064Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0940139Z power management: 2025-05-07T19:43:02.0940144Z 2025-05-07T19:43:02.0940215Z processor : 46 2025-05-07T19:43:02.0940294Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0940372Z cpu family : 6 2025-05-07T19:43:02.0940452Z model : 85 2025-05-07T19:43:02.0940648Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0940729Z stepping : 7 2025-05-07T19:43:02.0940807Z microcode : 0x5003901 2025-05-07T19:43:02.0940880Z cpu MHz : 2999.998 2025-05-07T19:43:02.0940956Z cache size : 36608 KB 2025-05-07T19:43:02.0941036Z physical id : 1 2025-05-07T19:43:02.0941101Z siblings : 48 2025-05-07T19:43:02.0941173Z core id : 22 2025-05-07T19:43:02.0941251Z cpu cores : 24 2025-05-07T19:43:02.0941324Z apicid : 108 2025-05-07T19:43:02.0941405Z initial apicid : 108 2025-05-07T19:43:02.0941478Z fpu : yes 2025-05-07T19:43:02.0941563Z fpu_exception : yes 2025-05-07T19:43:02.0941638Z cpuid level : 13 2025-05-07T19:43:02.0941710Z wp : yes 2025-05-07T19:43:02.0943700Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0944055Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0944129Z bogomips : 5999.99 2025-05-07T19:43:02.0944213Z clflush size : 64 2025-05-07T19:43:02.0944287Z cache_alignment : 64 2025-05-07T19:43:02.0944402Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0944479Z power management: 2025-05-07T19:43:02.0944483Z 2025-05-07T19:43:02.0944553Z processor : 47 2025-05-07T19:43:02.0944632Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0944709Z cpu family : 6 2025-05-07T19:43:02.0944785Z model : 85 2025-05-07T19:43:02.0944929Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0944998Z stepping : 7 2025-05-07T19:43:02.0945079Z microcode : 0x5003901 2025-05-07T19:43:02.0945150Z cpu MHz : 2999.998 2025-05-07T19:43:02.0945225Z cache size : 36608 KB 2025-05-07T19:43:02.0945329Z physical id : 1 2025-05-07T19:43:02.0945407Z siblings : 48 2025-05-07T19:43:02.0945479Z core id : 23 2025-05-07T19:43:02.0945551Z cpu cores : 24 2025-05-07T19:43:02.0945636Z apicid : 110 2025-05-07T19:43:02.0945712Z initial apicid : 110 2025-05-07T19:43:02.0945781Z fpu : yes 2025-05-07T19:43:02.0945861Z fpu_exception : yes 2025-05-07T19:43:02.0945943Z cpuid level : 13 2025-05-07T19:43:02.0946063Z wp : yes 2025-05-07T19:43:02.0948042Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0948403Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0948483Z bogomips : 5999.99 2025-05-07T19:43:02.0948554Z clflush size : 64 2025-05-07T19:43:02.0948646Z cache_alignment : 64 2025-05-07T19:43:02.0948759Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0948835Z power management: 2025-05-07T19:43:02.0948839Z 2025-05-07T19:43:02.0948923Z processor : 48 2025-05-07T19:43:02.0949005Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0949071Z cpu family : 6 2025-05-07T19:43:02.0949140Z model : 85 2025-05-07T19:43:02.0949295Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0949430Z stepping : 7 2025-05-07T19:43:02.0949503Z microcode : 0x5003901 2025-05-07T19:43:02.0949584Z cpu MHz : 1196.179 2025-05-07T19:43:02.0949655Z cache size : 36608 KB 2025-05-07T19:43:02.0949728Z physical id : 0 2025-05-07T19:43:02.0949800Z siblings : 48 2025-05-07T19:43:02.0949880Z core id : 0 2025-05-07T19:43:02.0949956Z cpu cores : 24 2025-05-07T19:43:02.0950026Z apicid : 1 2025-05-07T19:43:02.0950118Z initial apicid : 1 2025-05-07T19:43:02.0950182Z fpu : yes 2025-05-07T19:43:02.0950261Z fpu_exception : yes 2025-05-07T19:43:02.0950332Z cpuid level : 13 2025-05-07T19:43:02.0950414Z wp : yes 2025-05-07T19:43:02.0952745Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0953142Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0953221Z bogomips : 5999.99 2025-05-07T19:43:02.0953299Z clflush size : 64 2025-05-07T19:43:02.0953382Z cache_alignment : 64 2025-05-07T19:43:02.0953514Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0953682Z power management: 2025-05-07T19:43:02.0953686Z 2025-05-07T19:43:02.0953767Z processor : 49 2025-05-07T19:43:02.0953859Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0953939Z cpu family : 6 2025-05-07T19:43:02.0954015Z model : 85 2025-05-07T19:43:02.0954174Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0954265Z stepping : 7 2025-05-07T19:43:02.0954351Z microcode : 0x5003901 2025-05-07T19:43:02.0954432Z cpu MHz : 2999.998 2025-05-07T19:43:02.0954523Z cache size : 36608 KB 2025-05-07T19:43:02.0954603Z physical id : 0 2025-05-07T19:43:02.0954681Z siblings : 48 2025-05-07T19:43:02.0954758Z core id : 1 2025-05-07T19:43:02.0954845Z cpu cores : 24 2025-05-07T19:43:02.0954922Z apicid : 3 2025-05-07T19:43:02.0955005Z initial apicid : 3 2025-05-07T19:43:02.0955084Z fpu : yes 2025-05-07T19:43:02.0955170Z fpu_exception : yes 2025-05-07T19:43:02.0955249Z cpuid level : 13 2025-05-07T19:43:02.0955320Z wp : yes 2025-05-07T19:43:02.0957465Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0957903Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0957980Z bogomips : 5999.99 2025-05-07T19:43:02.0958061Z clflush size : 64 2025-05-07T19:43:02.0958139Z cache_alignment : 64 2025-05-07T19:43:02.0958260Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0958349Z power management: 2025-05-07T19:43:02.0958357Z 2025-05-07T19:43:02.0958431Z processor : 50 2025-05-07T19:43:02.0958513Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0958591Z cpu family : 6 2025-05-07T19:43:02.0958662Z model : 85 2025-05-07T19:43:02.0958814Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0958888Z stepping : 7 2025-05-07T19:43:02.0958982Z microcode : 0x5003901 2025-05-07T19:43:02.0959111Z cpu MHz : 1186.066 2025-05-07T19:43:02.0959194Z cache size : 36608 KB 2025-05-07T19:43:02.0959287Z physical id : 0 2025-05-07T19:43:02.0959364Z siblings : 48 2025-05-07T19:43:02.0959440Z core id : 2 2025-05-07T19:43:02.0959520Z cpu cores : 24 2025-05-07T19:43:02.0959614Z apicid : 5 2025-05-07T19:43:02.0959694Z initial apicid : 5 2025-05-07T19:43:02.0959770Z fpu : yes 2025-05-07T19:43:02.0959852Z fpu_exception : yes 2025-05-07T19:43:02.0959946Z cpuid level : 13 2025-05-07T19:43:02.0960018Z wp : yes 2025-05-07T19:43:02.0962171Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0962566Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0962648Z bogomips : 5999.99 2025-05-07T19:43:02.0962728Z clflush size : 64 2025-05-07T19:43:02.0962822Z cache_alignment : 64 2025-05-07T19:43:02.0962949Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0963030Z power management: 2025-05-07T19:43:02.0963034Z 2025-05-07T19:43:02.0963128Z processor : 51 2025-05-07T19:43:02.0963222Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0963301Z cpu family : 6 2025-05-07T19:43:02.0963376Z model : 85 2025-05-07T19:43:02.0963542Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0963620Z stepping : 7 2025-05-07T19:43:02.0963702Z microcode : 0x5003901 2025-05-07T19:43:02.0963794Z cpu MHz : 2999.998 2025-05-07T19:43:02.0963988Z cache size : 36608 KB 2025-05-07T19:43:02.0964063Z physical id : 0 2025-05-07T19:43:02.0964133Z siblings : 48 2025-05-07T19:43:02.0964215Z core id : 3 2025-05-07T19:43:02.0964290Z cpu cores : 24 2025-05-07T19:43:02.0964361Z apicid : 7 2025-05-07T19:43:02.0964449Z initial apicid : 7 2025-05-07T19:43:02.0964520Z fpu : yes 2025-05-07T19:43:02.0964602Z fpu_exception : yes 2025-05-07T19:43:02.0964678Z cpuid level : 13 2025-05-07T19:43:02.0964758Z wp : yes 2025-05-07T19:43:02.0966752Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0967165Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0967237Z bogomips : 5999.99 2025-05-07T19:43:02.0967313Z clflush size : 64 2025-05-07T19:43:02.0967390Z cache_alignment : 64 2025-05-07T19:43:02.0967519Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0967593Z power management: 2025-05-07T19:43:02.0967597Z 2025-05-07T19:43:02.0967670Z processor : 52 2025-05-07T19:43:02.0967767Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0967840Z cpu family : 6 2025-05-07T19:43:02.0967909Z model : 85 2025-05-07T19:43:02.0968051Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0968135Z stepping : 7 2025-05-07T19:43:02.0968209Z microcode : 0x5003901 2025-05-07T19:43:02.0968283Z cpu MHz : 2999.998 2025-05-07T19:43:02.0968369Z cache size : 36608 KB 2025-05-07T19:43:02.0968492Z physical id : 0 2025-05-07T19:43:02.0968566Z siblings : 48 2025-05-07T19:43:02.0968635Z core id : 4 2025-05-07T19:43:02.0968714Z cpu cores : 24 2025-05-07T19:43:02.0968787Z apicid : 9 2025-05-07T19:43:02.0968861Z initial apicid : 9 2025-05-07T19:43:02.0968941Z fpu : yes 2025-05-07T19:43:02.0969021Z fpu_exception : yes 2025-05-07T19:43:02.0969095Z cpuid level : 13 2025-05-07T19:43:02.0969167Z wp : yes 2025-05-07T19:43:02.0971151Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0971507Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0971600Z bogomips : 5999.99 2025-05-07T19:43:02.0971674Z clflush size : 64 2025-05-07T19:43:02.0971748Z cache_alignment : 64 2025-05-07T19:43:02.0971863Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0971952Z power management: 2025-05-07T19:43:02.0971957Z 2025-05-07T19:43:02.0972028Z processor : 53 2025-05-07T19:43:02.0972108Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0972194Z cpu family : 6 2025-05-07T19:43:02.0972266Z model : 85 2025-05-07T19:43:02.0972409Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0972482Z stepping : 7 2025-05-07T19:43:02.0972573Z microcode : 0x5003901 2025-05-07T19:43:02.0972646Z cpu MHz : 2999.998 2025-05-07T19:43:02.0972721Z cache size : 36608 KB 2025-05-07T19:43:02.0972808Z physical id : 0 2025-05-07T19:43:02.0972884Z siblings : 48 2025-05-07T19:43:02.0972955Z core id : 5 2025-05-07T19:43:02.0973027Z cpu cores : 24 2025-05-07T19:43:02.0973110Z apicid : 11 2025-05-07T19:43:02.0973187Z initial apicid : 11 2025-05-07T19:43:02.0973258Z fpu : yes 2025-05-07T19:43:02.0973343Z fpu_exception : yes 2025-05-07T19:43:02.0973419Z cpuid level : 13 2025-05-07T19:43:02.0973492Z wp : yes 2025-05-07T19:43:02.0975487Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0975901Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0975981Z bogomips : 5999.99 2025-05-07T19:43:02.0976069Z clflush size : 64 2025-05-07T19:43:02.0976147Z cache_alignment : 64 2025-05-07T19:43:02.0976270Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0976351Z power management: 2025-05-07T19:43:02.0976355Z 2025-05-07T19:43:02.0976438Z processor : 54 2025-05-07T19:43:02.0976523Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0976599Z cpu family : 6 2025-05-07T19:43:02.0976684Z model : 85 2025-05-07T19:43:02.0976836Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0976914Z stepping : 7 2025-05-07T19:43:02.0976990Z microcode : 0x5003901 2025-05-07T19:43:02.0977083Z cpu MHz : 2999.998 2025-05-07T19:43:02.0977164Z cache size : 36608 KB 2025-05-07T19:43:02.0977239Z physical id : 0 2025-05-07T19:43:02.0977325Z siblings : 48 2025-05-07T19:43:02.0977399Z core id : 6 2025-05-07T19:43:02.0977525Z cpu cores : 24 2025-05-07T19:43:02.0977596Z apicid : 13 2025-05-07T19:43:02.0977683Z initial apicid : 13 2025-05-07T19:43:02.0977756Z fpu : yes 2025-05-07T19:43:02.0977837Z fpu_exception : yes 2025-05-07T19:43:02.0977910Z cpuid level : 13 2025-05-07T19:43:02.0978000Z wp : yes 2025-05-07T19:43:02.0979990Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0980384Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0980462Z bogomips : 5999.99 2025-05-07T19:43:02.0980538Z clflush size : 64 2025-05-07T19:43:02.0980632Z cache_alignment : 64 2025-05-07T19:43:02.0980751Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0980828Z power management: 2025-05-07T19:43:02.0980833Z 2025-05-07T19:43:02.0980912Z processor : 55 2025-05-07T19:43:02.0981005Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0981082Z cpu family : 6 2025-05-07T19:43:02.0981155Z model : 85 2025-05-07T19:43:02.0981314Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0981390Z stepping : 7 2025-05-07T19:43:02.0981470Z microcode : 0x5003901 2025-05-07T19:43:02.0981546Z cpu MHz : 2999.998 2025-05-07T19:43:02.0981635Z cache size : 36608 KB 2025-05-07T19:43:02.0981711Z physical id : 0 2025-05-07T19:43:02.0981785Z siblings : 48 2025-05-07T19:43:02.0981872Z core id : 7 2025-05-07T19:43:02.0981943Z cpu cores : 24 2025-05-07T19:43:02.0982021Z apicid : 15 2025-05-07T19:43:02.0982099Z initial apicid : 15 2025-05-07T19:43:02.0982184Z fpu : yes 2025-05-07T19:43:02.0982259Z fpu_exception : yes 2025-05-07T19:43:02.0982337Z cpuid level : 13 2025-05-07T19:43:02.0982407Z wp : yes 2025-05-07T19:43:02.0984563Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0984965Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0985059Z bogomips : 5999.99 2025-05-07T19:43:02.0985134Z clflush size : 64 2025-05-07T19:43:02.0985210Z cache_alignment : 64 2025-05-07T19:43:02.0985328Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0985420Z power management: 2025-05-07T19:43:02.0985424Z 2025-05-07T19:43:02.0985502Z processor : 56 2025-05-07T19:43:02.0985581Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0985666Z cpu family : 6 2025-05-07T19:43:02.0985734Z model : 85 2025-05-07T19:43:02.0985875Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0985961Z stepping : 7 2025-05-07T19:43:02.0986040Z microcode : 0x5003901 2025-05-07T19:43:02.0986111Z cpu MHz : 2999.998 2025-05-07T19:43:02.0986188Z cache size : 36608 KB 2025-05-07T19:43:02.0986276Z physical id : 0 2025-05-07T19:43:02.0986345Z siblings : 48 2025-05-07T19:43:02.0986416Z core id : 8 2025-05-07T19:43:02.0986492Z cpu cores : 24 2025-05-07T19:43:02.0986574Z apicid : 17 2025-05-07T19:43:02.0986716Z initial apicid : 17 2025-05-07T19:43:02.0986787Z fpu : yes 2025-05-07T19:43:02.0986878Z fpu_exception : yes 2025-05-07T19:43:02.0986952Z cpuid level : 13 2025-05-07T19:43:02.0987021Z wp : yes 2025-05-07T19:43:02.0989000Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0989355Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0989430Z bogomips : 5999.99 2025-05-07T19:43:02.0989516Z clflush size : 64 2025-05-07T19:43:02.0989592Z cache_alignment : 64 2025-05-07T19:43:02.0989710Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0989781Z power management: 2025-05-07T19:43:02.0989792Z 2025-05-07T19:43:02.0989863Z processor : 57 2025-05-07T19:43:02.0989944Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0990019Z cpu family : 6 2025-05-07T19:43:02.0990095Z model : 85 2025-05-07T19:43:02.0990238Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0990314Z stepping : 7 2025-05-07T19:43:02.0990394Z microcode : 0x5003901 2025-05-07T19:43:02.0990476Z cpu MHz : 2999.998 2025-05-07T19:43:02.0990682Z cache size : 36608 KB 2025-05-07T19:43:02.0990929Z physical id : 0 2025-05-07T19:43:02.0991019Z siblings : 48 2025-05-07T19:43:02.0991096Z core id : 9 2025-05-07T19:43:02.0991316Z cpu cores : 24 2025-05-07T19:43:02.0991406Z apicid : 19 2025-05-07T19:43:02.0991504Z initial apicid : 19 2025-05-07T19:43:02.0991582Z fpu : yes 2025-05-07T19:43:02.0991671Z fpu_exception : yes 2025-05-07T19:43:02.0991878Z cpuid level : 13 2025-05-07T19:43:02.0991955Z wp : yes 2025-05-07T19:43:02.0994088Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0994585Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0994666Z bogomips : 5999.99 2025-05-07T19:43:02.0994746Z clflush size : 64 2025-05-07T19:43:02.0994847Z cache_alignment : 64 2025-05-07T19:43:02.0994974Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0995056Z power management: 2025-05-07T19:43:02.0995061Z 2025-05-07T19:43:02.0995143Z processor : 58 2025-05-07T19:43:02.0995248Z vendor_id : GenuineIntel 2025-05-07T19:43:02.0995328Z cpu family : 6 2025-05-07T19:43:02.0995403Z model : 85 2025-05-07T19:43:02.0995575Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.0995655Z stepping : 7 2025-05-07T19:43:02.0995737Z microcode : 0x5003901 2025-05-07T19:43:02.0995816Z cpu MHz : 2999.998 2025-05-07T19:43:02.0995909Z cache size : 36608 KB 2025-05-07T19:43:02.0995986Z physical id : 0 2025-05-07T19:43:02.0996065Z siblings : 48 2025-05-07T19:43:02.0996155Z core id : 10 2025-05-07T19:43:02.0996233Z cpu cores : 24 2025-05-07T19:43:02.0996307Z apicid : 21 2025-05-07T19:43:02.0996389Z initial apicid : 21 2025-05-07T19:43:02.0996477Z fpu : yes 2025-05-07T19:43:02.0996557Z fpu_exception : yes 2025-05-07T19:43:02.0996787Z cpuid level : 13 2025-05-07T19:43:02.0996879Z wp : yes 2025-05-07T19:43:02.0999024Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.0999406Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.0999502Z bogomips : 5999.99 2025-05-07T19:43:02.0999585Z clflush size : 64 2025-05-07T19:43:02.0999668Z cache_alignment : 64 2025-05-07T19:43:02.0999812Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.0999898Z power management: 2025-05-07T19:43:02.0999903Z 2025-05-07T19:43:02.0999980Z processor : 59 2025-05-07T19:43:02.1000068Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1000162Z cpu family : 6 2025-05-07T19:43:02.1000240Z model : 85 2025-05-07T19:43:02.1000397Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1000488Z stepping : 7 2025-05-07T19:43:02.1000572Z microcode : 0x5003901 2025-05-07T19:43:02.1000653Z cpu MHz : 2999.998 2025-05-07T19:43:02.1000734Z cache size : 36608 KB 2025-05-07T19:43:02.1000830Z physical id : 0 2025-05-07T19:43:02.1000908Z siblings : 48 2025-05-07T19:43:02.1000984Z core id : 11 2025-05-07T19:43:02.1001074Z cpu cores : 24 2025-05-07T19:43:02.1001152Z apicid : 23 2025-05-07T19:43:02.1001235Z initial apicid : 23 2025-05-07T19:43:02.1001314Z fpu : yes 2025-05-07T19:43:02.1001410Z fpu_exception : yes 2025-05-07T19:43:02.1001491Z cpuid level : 13 2025-05-07T19:43:02.1001569Z wp : yes 2025-05-07T19:43:02.1003852Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1004256Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1004337Z bogomips : 5999.99 2025-05-07T19:43:02.1004421Z clflush size : 64 2025-05-07T19:43:02.1004498Z cache_alignment : 64 2025-05-07T19:43:02.1004618Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1004707Z power management: 2025-05-07T19:43:02.1004711Z 2025-05-07T19:43:02.1004783Z processor : 60 2025-05-07T19:43:02.1004861Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1004936Z cpu family : 6 2025-05-07T19:43:02.1005019Z model : 85 2025-05-07T19:43:02.1005162Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1005234Z stepping : 7 2025-05-07T19:43:02.1005322Z microcode : 0x5003901 2025-05-07T19:43:02.1005400Z cpu MHz : 2999.998 2025-05-07T19:43:02.1005477Z cache size : 36608 KB 2025-05-07T19:43:02.1005556Z physical id : 0 2025-05-07T19:43:02.1005645Z siblings : 48 2025-05-07T19:43:02.1005713Z core id : 12 2025-05-07T19:43:02.1005786Z cpu cores : 24 2025-05-07T19:43:02.1005869Z apicid : 25 2025-05-07T19:43:02.1005947Z initial apicid : 25 2025-05-07T19:43:02.1006021Z fpu : yes 2025-05-07T19:43:02.1006101Z fpu_exception : yes 2025-05-07T19:43:02.1006184Z cpuid level : 13 2025-05-07T19:43:02.1006251Z wp : yes 2025-05-07T19:43:02.1008283Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1008649Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1008725Z bogomips : 5999.99 2025-05-07T19:43:02.1008802Z clflush size : 64 2025-05-07T19:43:02.1008893Z cache_alignment : 64 2025-05-07T19:43:02.1009012Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1009091Z power management: 2025-05-07T19:43:02.1009095Z 2025-05-07T19:43:02.1009182Z processor : 61 2025-05-07T19:43:02.1009264Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1009336Z cpu family : 6 2025-05-07T19:43:02.1009404Z model : 85 2025-05-07T19:43:02.1009561Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1009634Z stepping : 7 2025-05-07T19:43:02.1009708Z microcode : 0x5003901 2025-05-07T19:43:02.1009811Z cpu MHz : 2999.998 2025-05-07T19:43:02.1009891Z cache size : 36608 KB 2025-05-07T19:43:02.1009970Z physical id : 0 2025-05-07T19:43:02.1010048Z siblings : 48 2025-05-07T19:43:02.1010144Z core id : 13 2025-05-07T19:43:02.1010223Z cpu cores : 24 2025-05-07T19:43:02.1010304Z apicid : 27 2025-05-07T19:43:02.1010402Z initial apicid : 27 2025-05-07T19:43:02.1010478Z fpu : yes 2025-05-07T19:43:02.1010560Z fpu_exception : yes 2025-05-07T19:43:02.1010641Z cpuid level : 13 2025-05-07T19:43:02.1010733Z wp : yes 2025-05-07T19:43:02.1012731Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1013091Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1013242Z bogomips : 5999.99 2025-05-07T19:43:02.1013321Z clflush size : 64 2025-05-07T19:43:02.1013407Z cache_alignment : 64 2025-05-07T19:43:02.1013549Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1013635Z power management: 2025-05-07T19:43:02.1013639Z 2025-05-07T19:43:02.1013723Z processor : 62 2025-05-07T19:43:02.1013830Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1013910Z cpu family : 6 2025-05-07T19:43:02.1013989Z model : 85 2025-05-07T19:43:02.1014140Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1014237Z stepping : 7 2025-05-07T19:43:02.1014323Z microcode : 0x5003901 2025-05-07T19:43:02.1014405Z cpu MHz : 2999.998 2025-05-07T19:43:02.1014506Z cache size : 36608 KB 2025-05-07T19:43:02.1014593Z physical id : 0 2025-05-07T19:43:02.1014672Z siblings : 48 2025-05-07T19:43:02.1014752Z core id : 14 2025-05-07T19:43:02.1014849Z cpu cores : 24 2025-05-07T19:43:02.1014927Z apicid : 29 2025-05-07T19:43:02.1015012Z initial apicid : 29 2025-05-07T19:43:02.1015088Z fpu : yes 2025-05-07T19:43:02.1015190Z fpu_exception : yes 2025-05-07T19:43:02.1015269Z cpuid level : 13 2025-05-07T19:43:02.1015344Z wp : yes 2025-05-07T19:43:02.1017426Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1017788Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1017873Z bogomips : 5999.99 2025-05-07T19:43:02.1017971Z clflush size : 64 2025-05-07T19:43:02.1018056Z cache_alignment : 64 2025-05-07T19:43:02.1018181Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1018283Z power management: 2025-05-07T19:43:02.1018287Z 2025-05-07T19:43:02.1018368Z processor : 63 2025-05-07T19:43:02.1018460Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1018549Z cpu family : 6 2025-05-07T19:43:02.1018629Z model : 85 2025-05-07T19:43:02.1018776Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1018854Z stepping : 7 2025-05-07T19:43:02.1018951Z microcode : 0x5003901 2025-05-07T19:43:02.1019028Z cpu MHz : 2999.998 2025-05-07T19:43:02.1019107Z cache size : 36608 KB 2025-05-07T19:43:02.1019185Z physical id : 0 2025-05-07T19:43:02.1019276Z siblings : 48 2025-05-07T19:43:02.1019350Z core id : 15 2025-05-07T19:43:02.1019426Z cpu cores : 24 2025-05-07T19:43:02.1019515Z apicid : 31 2025-05-07T19:43:02.1019597Z initial apicid : 31 2025-05-07T19:43:02.1019673Z fpu : yes 2025-05-07T19:43:02.1019753Z fpu_exception : yes 2025-05-07T19:43:02.1019846Z cpuid level : 13 2025-05-07T19:43:02.1019919Z wp : yes 2025-05-07T19:43:02.1021886Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1022257Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1022401Z bogomips : 5999.99 2025-05-07T19:43:02.1022482Z clflush size : 64 2025-05-07T19:43:02.1022578Z cache_alignment : 64 2025-05-07T19:43:02.1022698Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1022780Z power management: 2025-05-07T19:43:02.1022784Z 2025-05-07T19:43:02.1022876Z processor : 64 2025-05-07T19:43:02.1022963Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1023041Z cpu family : 6 2025-05-07T19:43:02.1023119Z model : 85 2025-05-07T19:43:02.1023286Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1023368Z stepping : 7 2025-05-07T19:43:02.1023449Z microcode : 0x5003901 2025-05-07T19:43:02.1023545Z cpu MHz : 2999.998 2025-05-07T19:43:02.1023630Z cache size : 36608 KB 2025-05-07T19:43:02.1023710Z physical id : 0 2025-05-07T19:43:02.1023790Z siblings : 48 2025-05-07T19:43:02.1023878Z core id : 16 2025-05-07T19:43:02.1023953Z cpu cores : 24 2025-05-07T19:43:02.1024029Z apicid : 33 2025-05-07T19:43:02.1024125Z initial apicid : 33 2025-05-07T19:43:02.1024202Z fpu : yes 2025-05-07T19:43:02.1024284Z fpu_exception : yes 2025-05-07T19:43:02.1024362Z cpuid level : 13 2025-05-07T19:43:02.1024449Z wp : yes 2025-05-07T19:43:02.1026494Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1026870Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1026950Z bogomips : 5999.99 2025-05-07T19:43:02.1027030Z clflush size : 64 2025-05-07T19:43:02.1027111Z cache_alignment : 64 2025-05-07T19:43:02.1027249Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1027330Z power management: 2025-05-07T19:43:02.1027334Z 2025-05-07T19:43:02.1027412Z processor : 65 2025-05-07T19:43:02.1027512Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1027589Z cpu family : 6 2025-05-07T19:43:02.1027667Z model : 85 2025-05-07T19:43:02.1027816Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1027909Z stepping : 7 2025-05-07T19:43:02.1027992Z microcode : 0x5003901 2025-05-07T19:43:02.1028071Z cpu MHz : 2999.998 2025-05-07T19:43:02.1028169Z cache size : 36608 KB 2025-05-07T19:43:02.1028248Z physical id : 0 2025-05-07T19:43:02.1028325Z siblings : 48 2025-05-07T19:43:02.1028401Z core id : 17 2025-05-07T19:43:02.1028493Z cpu cores : 24 2025-05-07T19:43:02.1028569Z apicid : 35 2025-05-07T19:43:02.1028649Z initial apicid : 35 2025-05-07T19:43:02.1028737Z fpu : yes 2025-05-07T19:43:02.1028819Z fpu_exception : yes 2025-05-07T19:43:02.1028897Z cpuid level : 13 2025-05-07T19:43:02.1028971Z wp : yes 2025-05-07T19:43:02.1030947Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1031382Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1031483Z bogomips : 5999.99 2025-05-07T19:43:02.1031735Z clflush size : 64 2025-05-07T19:43:02.1031887Z cache_alignment : 64 2025-05-07T19:43:02.1032017Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1032123Z power management: 2025-05-07T19:43:02.1032127Z 2025-05-07T19:43:02.1032210Z processor : 66 2025-05-07T19:43:02.1032325Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1032423Z cpu family : 6 2025-05-07T19:43:02.1032577Z model : 85 2025-05-07T19:43:02.1032741Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1032828Z stepping : 7 2025-05-07T19:43:02.1032934Z microcode : 0x5003901 2025-05-07T19:43:02.1033016Z cpu MHz : 1196.114 2025-05-07T19:43:02.1033103Z cache size : 36608 KB 2025-05-07T19:43:02.1033207Z physical id : 0 2025-05-07T19:43:02.1033287Z siblings : 48 2025-05-07T19:43:02.1033367Z core id : 18 2025-05-07T19:43:02.1033450Z cpu cores : 24 2025-05-07T19:43:02.1033549Z apicid : 37 2025-05-07T19:43:02.1033636Z initial apicid : 37 2025-05-07T19:43:02.1033715Z fpu : yes 2025-05-07T19:43:02.1033818Z fpu_exception : yes 2025-05-07T19:43:02.1033905Z cpuid level : 13 2025-05-07T19:43:02.1033986Z wp : yes 2025-05-07T19:43:02.1036213Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1036597Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1036682Z bogomips : 5999.99 2025-05-07T19:43:02.1036782Z clflush size : 64 2025-05-07T19:43:02.1036868Z cache_alignment : 64 2025-05-07T19:43:02.1037002Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1037089Z power management: 2025-05-07T19:43:02.1037094Z 2025-05-07T19:43:02.1037189Z processor : 67 2025-05-07T19:43:02.1037279Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1037361Z cpu family : 6 2025-05-07T19:43:02.1037455Z model : 85 2025-05-07T19:43:02.1037618Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1037700Z stepping : 7 2025-05-07T19:43:02.1037789Z microcode : 0x5003901 2025-05-07T19:43:02.1037886Z cpu MHz : 2999.998 2025-05-07T19:43:02.1037972Z cache size : 36608 KB 2025-05-07T19:43:02.1038055Z physical id : 0 2025-05-07T19:43:02.1038152Z siblings : 48 2025-05-07T19:43:02.1038232Z core id : 19 2025-05-07T19:43:02.1038315Z cpu cores : 24 2025-05-07T19:43:02.1038396Z apicid : 39 2025-05-07T19:43:02.1038495Z initial apicid : 39 2025-05-07T19:43:02.1038574Z fpu : yes 2025-05-07T19:43:02.1038660Z fpu_exception : yes 2025-05-07T19:43:02.1038742Z cpuid level : 13 2025-05-07T19:43:02.1038837Z wp : yes 2025-05-07T19:43:02.1041004Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1041402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1041489Z bogomips : 5999.99 2025-05-07T19:43:02.1041573Z clflush size : 64 2025-05-07T19:43:02.1041678Z cache_alignment : 64 2025-05-07T19:43:02.1041811Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1041954Z power management: 2025-05-07T19:43:02.1041958Z 2025-05-07T19:43:02.1042045Z processor : 68 2025-05-07T19:43:02.1042157Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1042240Z cpu family : 6 2025-05-07T19:43:02.1042321Z model : 85 2025-05-07T19:43:02.1042498Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1042581Z stepping : 7 2025-05-07T19:43:02.1042670Z microcode : 0x5003901 2025-05-07T19:43:02.1042752Z cpu MHz : 1195.546 2025-05-07T19:43:02.1042853Z cache size : 36608 KB 2025-05-07T19:43:02.1042937Z physical id : 0 2025-05-07T19:43:02.1043018Z siblings : 48 2025-05-07T19:43:02.1043112Z core id : 20 2025-05-07T19:43:02.1043195Z cpu cores : 24 2025-05-07T19:43:02.1043279Z apicid : 41 2025-05-07T19:43:02.1043365Z initial apicid : 41 2025-05-07T19:43:02.1043460Z fpu : yes 2025-05-07T19:43:02.1043546Z fpu_exception : yes 2025-05-07T19:43:02.1043630Z cpuid level : 13 2025-05-07T19:43:02.1043708Z wp : yes 2025-05-07T19:43:02.1045988Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1046368Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1046469Z bogomips : 5999.99 2025-05-07T19:43:02.1046552Z clflush size : 64 2025-05-07T19:43:02.1046637Z cache_alignment : 64 2025-05-07T19:43:02.1046767Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1046868Z power management: 2025-05-07T19:43:02.1046876Z 2025-05-07T19:43:02.1046959Z processor : 69 2025-05-07T19:43:02.1047049Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1047147Z cpu family : 6 2025-05-07T19:43:02.1047227Z model : 85 2025-05-07T19:43:02.1047387Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1047486Z stepping : 7 2025-05-07T19:43:02.1047573Z microcode : 0x5003901 2025-05-07T19:43:02.1047658Z cpu MHz : 2999.998 2025-05-07T19:43:02.1047747Z cache size : 36608 KB 2025-05-07T19:43:02.1047846Z physical id : 0 2025-05-07T19:43:02.1047929Z siblings : 48 2025-05-07T19:43:02.1048012Z core id : 21 2025-05-07T19:43:02.1048095Z cpu cores : 24 2025-05-07T19:43:02.1048198Z apicid : 43 2025-05-07T19:43:02.1048286Z initial apicid : 43 2025-05-07T19:43:02.1048367Z fpu : yes 2025-05-07T19:43:02.1048471Z fpu_exception : yes 2025-05-07T19:43:02.1048556Z cpuid level : 13 2025-05-07T19:43:02.1048638Z wp : yes 2025-05-07T19:43:02.1050773Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1051156Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1051244Z bogomips : 5999.99 2025-05-07T19:43:02.1051347Z clflush size : 64 2025-05-07T19:43:02.1051439Z cache_alignment : 64 2025-05-07T19:43:02.1051567Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1051656Z power management: 2025-05-07T19:43:02.1051676Z 2025-05-07T19:43:02.1051811Z processor : 70 2025-05-07T19:43:02.1051900Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1051982Z cpu family : 6 2025-05-07T19:43:02.1052081Z model : 85 2025-05-07T19:43:02.1052242Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1052326Z stepping : 7 2025-05-07T19:43:02.1052414Z microcode : 0x5003901 2025-05-07T19:43:02.1052511Z cpu MHz : 1202.896 2025-05-07T19:43:02.1052599Z cache size : 36608 KB 2025-05-07T19:43:02.1052685Z physical id : 0 2025-05-07T19:43:02.1052783Z siblings : 48 2025-05-07T19:43:02.1052862Z core id : 22 2025-05-07T19:43:02.1052945Z cpu cores : 24 2025-05-07T19:43:02.1053024Z apicid : 45 2025-05-07T19:43:02.1053129Z initial apicid : 45 2025-05-07T19:43:02.1053244Z fpu : yes 2025-05-07T19:43:02.1053334Z fpu_exception : yes 2025-05-07T19:43:02.1053429Z cpuid level : 13 2025-05-07T19:43:02.1053506Z wp : yes 2025-05-07T19:43:02.1055682Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1056058Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1056143Z bogomips : 5999.99 2025-05-07T19:43:02.1056222Z clflush size : 64 2025-05-07T19:43:02.1056313Z cache_alignment : 64 2025-05-07T19:43:02.1056434Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1056513Z power management: 2025-05-07T19:43:02.1056517Z 2025-05-07T19:43:02.1056595Z processor : 71 2025-05-07T19:43:02.1056694Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1056771Z cpu family : 6 2025-05-07T19:43:02.1056847Z model : 85 2025-05-07T19:43:02.1057006Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1057082Z stepping : 7 2025-05-07T19:43:02.1057162Z microcode : 0x5003901 2025-05-07T19:43:02.1057239Z cpu MHz : 2999.998 2025-05-07T19:43:02.1057328Z cache size : 36608 KB 2025-05-07T19:43:02.1057410Z physical id : 0 2025-05-07T19:43:02.1057484Z siblings : 48 2025-05-07T19:43:02.1057575Z core id : 23 2025-05-07T19:43:02.1057652Z cpu cores : 24 2025-05-07T19:43:02.1057728Z apicid : 47 2025-05-07T19:43:02.1057808Z initial apicid : 47 2025-05-07T19:43:02.1057897Z fpu : yes 2025-05-07T19:43:02.1057977Z fpu_exception : yes 2025-05-07T19:43:02.1070619Z cpuid level : 13 2025-05-07T19:43:02.1070782Z wp : yes 2025-05-07T19:43:02.1073212Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1073610Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1073712Z bogomips : 5999.99 2025-05-07T19:43:02.1073795Z clflush size : 64 2025-05-07T19:43:02.1073883Z cache_alignment : 64 2025-05-07T19:43:02.1074029Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1074112Z power management: 2025-05-07T19:43:02.1074118Z 2025-05-07T19:43:02.1074201Z processor : 72 2025-05-07T19:43:02.1074287Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1074518Z cpu family : 6 2025-05-07T19:43:02.1074595Z model : 85 2025-05-07T19:43:02.1074757Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1074849Z stepping : 7 2025-05-07T19:43:02.1074933Z microcode : 0x5003901 2025-05-07T19:43:02.1075011Z cpu MHz : 2999.998 2025-05-07T19:43:02.1075097Z cache size : 36608 KB 2025-05-07T19:43:02.1075190Z physical id : 1 2025-05-07T19:43:02.1075269Z siblings : 48 2025-05-07T19:43:02.1075345Z core id : 0 2025-05-07T19:43:02.1075430Z cpu cores : 24 2025-05-07T19:43:02.1075506Z apicid : 65 2025-05-07T19:43:02.1075586Z initial apicid : 65 2025-05-07T19:43:02.1075659Z fpu : yes 2025-05-07T19:43:02.1075752Z fpu_exception : yes 2025-05-07T19:43:02.1075833Z cpuid level : 13 2025-05-07T19:43:02.1075908Z wp : yes 2025-05-07T19:43:02.1078056Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1078498Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1078583Z bogomips : 5999.99 2025-05-07T19:43:02.1078671Z clflush size : 64 2025-05-07T19:43:02.1078755Z cache_alignment : 64 2025-05-07T19:43:02.1078882Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1078978Z power management: 2025-05-07T19:43:02.1078983Z 2025-05-07T19:43:02.1079064Z processor : 73 2025-05-07T19:43:02.1079154Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1079234Z cpu family : 6 2025-05-07T19:43:02.1079319Z model : 85 2025-05-07T19:43:02.1079483Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1079566Z stepping : 7 2025-05-07T19:43:02.1079661Z microcode : 0x5003901 2025-05-07T19:43:02.1079740Z cpu MHz : 2999.998 2025-05-07T19:43:02.1079821Z cache size : 36608 KB 2025-05-07T19:43:02.1079905Z physical id : 1 2025-05-07T19:43:02.1079987Z siblings : 48 2025-05-07T19:43:02.1080065Z core id : 1 2025-05-07T19:43:02.1080136Z cpu cores : 24 2025-05-07T19:43:02.1080207Z apicid : 67 2025-05-07T19:43:02.1080288Z initial apicid : 67 2025-05-07T19:43:02.1080362Z fpu : yes 2025-05-07T19:43:02.1080444Z fpu_exception : yes 2025-05-07T19:43:02.1080529Z cpuid level : 13 2025-05-07T19:43:02.1080604Z wp : yes 2025-05-07T19:43:02.1082750Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1083328Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1083410Z bogomips : 5999.99 2025-05-07T19:43:02.1083490Z clflush size : 64 2025-05-07T19:43:02.1083580Z cache_alignment : 64 2025-05-07T19:43:02.1083707Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1083791Z power management: 2025-05-07T19:43:02.1083796Z 2025-05-07T19:43:02.1083883Z processor : 74 2025-05-07T19:43:02.1084080Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1084153Z cpu family : 6 2025-05-07T19:43:02.1084224Z model : 85 2025-05-07T19:43:02.1084383Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1084505Z stepping : 7 2025-05-07T19:43:02.1084583Z microcode : 0x5003901 2025-05-07T19:43:02.1084655Z cpu MHz : 2999.998 2025-05-07T19:43:02.1084737Z cache size : 36608 KB 2025-05-07T19:43:02.1084810Z physical id : 1 2025-05-07T19:43:02.1084878Z siblings : 48 2025-05-07T19:43:02.1084956Z core id : 2 2025-05-07T19:43:02.1085033Z cpu cores : 24 2025-05-07T19:43:02.1085106Z apicid : 69 2025-05-07T19:43:02.1085181Z initial apicid : 69 2025-05-07T19:43:02.1085262Z fpu : yes 2025-05-07T19:43:02.1085340Z fpu_exception : yes 2025-05-07T19:43:02.1085410Z cpuid level : 13 2025-05-07T19:43:02.1085487Z wp : yes 2025-05-07T19:43:02.1087480Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1087881Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1087964Z bogomips : 5999.99 2025-05-07T19:43:02.1088037Z clflush size : 64 2025-05-07T19:43:02.1088114Z cache_alignment : 64 2025-05-07T19:43:02.1088240Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1088316Z power management: 2025-05-07T19:43:02.1088320Z 2025-05-07T19:43:02.1088390Z processor : 75 2025-05-07T19:43:02.1088468Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1088548Z cpu family : 6 2025-05-07T19:43:02.1088616Z model : 85 2025-05-07T19:43:02.1088758Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1088839Z stepping : 7 2025-05-07T19:43:02.1088913Z microcode : 0x5003901 2025-05-07T19:43:02.1088983Z cpu MHz : 3172.086 2025-05-07T19:43:02.1089054Z cache size : 36608 KB 2025-05-07T19:43:02.1089131Z physical id : 1 2025-05-07T19:43:02.1089201Z siblings : 48 2025-05-07T19:43:02.1089268Z core id : 3 2025-05-07T19:43:02.1089345Z cpu cores : 24 2025-05-07T19:43:02.1089412Z apicid : 71 2025-05-07T19:43:02.1089489Z initial apicid : 71 2025-05-07T19:43:02.1089556Z fpu : yes 2025-05-07T19:43:02.1089637Z fpu_exception : yes 2025-05-07T19:43:02.1089709Z cpuid level : 13 2025-05-07T19:43:02.1089775Z wp : yes 2025-05-07T19:43:02.1092247Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1092639Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1092723Z bogomips : 5999.99 2025-05-07T19:43:02.1092810Z clflush size : 64 2025-05-07T19:43:02.1092893Z cache_alignment : 64 2025-05-07T19:43:02.1093016Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1093106Z power management: 2025-05-07T19:43:02.1093110Z 2025-05-07T19:43:02.1093185Z processor : 76 2025-05-07T19:43:02.1093270Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1093346Z cpu family : 6 2025-05-07T19:43:02.1093429Z model : 85 2025-05-07T19:43:02.1093587Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1093667Z stepping : 7 2025-05-07T19:43:02.1093860Z microcode : 0x5003901 2025-05-07T19:43:02.1093937Z cpu MHz : 3188.834 2025-05-07T19:43:02.1094019Z cache size : 36608 KB 2025-05-07T19:43:02.1094096Z physical id : 1 2025-05-07T19:43:02.1094177Z siblings : 48 2025-05-07T19:43:02.1094249Z core id : 4 2025-05-07T19:43:02.1094324Z cpu cores : 24 2025-05-07T19:43:02.1094406Z apicid : 73 2025-05-07T19:43:02.1094488Z initial apicid : 73 2025-05-07T19:43:02.1094566Z fpu : yes 2025-05-07T19:43:02.1094646Z fpu_exception : yes 2025-05-07T19:43:02.1094730Z cpuid level : 13 2025-05-07T19:43:02.1094806Z wp : yes 2025-05-07T19:43:02.1096933Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1097321Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1097405Z bogomips : 5999.99 2025-05-07T19:43:02.1097547Z clflush size : 64 2025-05-07T19:43:02.1097634Z cache_alignment : 64 2025-05-07T19:43:02.1097760Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1097841Z power management: 2025-05-07T19:43:02.1097845Z 2025-05-07T19:43:02.1097933Z processor : 77 2025-05-07T19:43:02.1098017Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1098094Z cpu family : 6 2025-05-07T19:43:02.1098171Z model : 85 2025-05-07T19:43:02.1098333Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1098412Z stepping : 7 2025-05-07T19:43:02.1098495Z microcode : 0x5003901 2025-05-07T19:43:02.1098587Z cpu MHz : 3113.937 2025-05-07T19:43:02.1098665Z cache size : 36608 KB 2025-05-07T19:43:02.1098743Z physical id : 1 2025-05-07T19:43:02.1098817Z siblings : 48 2025-05-07T19:43:02.1098902Z core id : 5 2025-05-07T19:43:02.1098976Z cpu cores : 24 2025-05-07T19:43:02.1099051Z apicid : 75 2025-05-07T19:43:02.1099137Z initial apicid : 75 2025-05-07T19:43:02.1099210Z fpu : yes 2025-05-07T19:43:02.1099294Z fpu_exception : yes 2025-05-07T19:43:02.1099369Z cpuid level : 13 2025-05-07T19:43:02.1099450Z wp : yes 2025-05-07T19:43:02.1101602Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1101995Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1102073Z bogomips : 5999.99 2025-05-07T19:43:02.1102149Z clflush size : 64 2025-05-07T19:43:02.1102233Z cache_alignment : 64 2025-05-07T19:43:02.1102366Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1102446Z power management: 2025-05-07T19:43:02.1102450Z 2025-05-07T19:43:02.1102528Z processor : 78 2025-05-07T19:43:02.1102619Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1102694Z cpu family : 6 2025-05-07T19:43:02.1102766Z model : 85 2025-05-07T19:43:02.1102919Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1103002Z stepping : 7 2025-05-07T19:43:02.1103081Z microcode : 0x5003901 2025-05-07T19:43:02.1103156Z cpu MHz : 3150.383 2025-05-07T19:43:02.1103411Z cache size : 36608 KB 2025-05-07T19:43:02.1103487Z physical id : 1 2025-05-07T19:43:02.1103559Z siblings : 48 2025-05-07T19:43:02.1103631Z core id : 6 2025-05-07T19:43:02.1103711Z cpu cores : 24 2025-05-07T19:43:02.1103782Z apicid : 77 2025-05-07T19:43:02.1103859Z initial apicid : 77 2025-05-07T19:43:02.1103930Z fpu : yes 2025-05-07T19:43:02.1104014Z fpu_exception : yes 2025-05-07T19:43:02.1104204Z cpuid level : 13 2025-05-07T19:43:02.1104270Z wp : yes 2025-05-07T19:43:02.1106274Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1106627Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1106705Z bogomips : 5999.99 2025-05-07T19:43:02.1106775Z clflush size : 64 2025-05-07T19:43:02.1106847Z cache_alignment : 64 2025-05-07T19:43:02.1107026Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1107109Z power management: 2025-05-07T19:43:02.1107113Z 2025-05-07T19:43:02.1107185Z processor : 79 2025-05-07T19:43:02.1107265Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1107342Z cpu family : 6 2025-05-07T19:43:02.1107409Z model : 85 2025-05-07T19:43:02.1107551Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1107623Z stepping : 7 2025-05-07T19:43:02.1107708Z microcode : 0x5003901 2025-05-07T19:43:02.1107781Z cpu MHz : 2999.998 2025-05-07T19:43:02.1107855Z cache size : 36608 KB 2025-05-07T19:43:02.1107938Z physical id : 1 2025-05-07T19:43:02.1108007Z siblings : 48 2025-05-07T19:43:02.1108076Z core id : 7 2025-05-07T19:43:02.1108143Z cpu cores : 24 2025-05-07T19:43:02.1108218Z apicid : 79 2025-05-07T19:43:02.1108293Z initial apicid : 79 2025-05-07T19:43:02.1108362Z fpu : yes 2025-05-07T19:43:02.1108433Z fpu_exception : yes 2025-05-07T19:43:02.1108510Z cpuid level : 13 2025-05-07T19:43:02.1108579Z wp : yes 2025-05-07T19:43:02.1110606Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1110966Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1111041Z bogomips : 5999.99 2025-05-07T19:43:02.1111111Z clflush size : 64 2025-05-07T19:43:02.1111264Z cache_alignment : 64 2025-05-07T19:43:02.1111383Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1111462Z power management: 2025-05-07T19:43:02.1111466Z 2025-05-07T19:43:02.1111550Z processor : 80 2025-05-07T19:43:02.1111811Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1111886Z cpu family : 6 2025-05-07T19:43:02.1111968Z model : 85 2025-05-07T19:43:02.1112122Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1112200Z stepping : 7 2025-05-07T19:43:02.1112279Z microcode : 0x5003901 2025-05-07T19:43:02.1112419Z cpu MHz : 3116.598 2025-05-07T19:43:02.1112498Z cache size : 36608 KB 2025-05-07T19:43:02.1112575Z physical id : 1 2025-05-07T19:43:02.1112646Z siblings : 48 2025-05-07T19:43:02.1112785Z core id : 8 2025-05-07T19:43:02.1112859Z cpu cores : 24 2025-05-07T19:43:02.1112934Z apicid : 81 2025-05-07T19:43:02.1113025Z initial apicid : 81 2025-05-07T19:43:02.1113095Z fpu : yes 2025-05-07T19:43:02.1113177Z fpu_exception : yes 2025-05-07T19:43:02.1113253Z cpuid level : 13 2025-05-07T19:43:02.1113331Z wp : yes 2025-05-07T19:43:02.1115490Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1115875Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1115955Z bogomips : 5999.99 2025-05-07T19:43:02.1116033Z clflush size : 64 2025-05-07T19:43:02.1116114Z cache_alignment : 64 2025-05-07T19:43:02.1116244Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1116325Z power management: 2025-05-07T19:43:02.1116380Z 2025-05-07T19:43:02.1116457Z processor : 81 2025-05-07T19:43:02.1116548Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1116624Z cpu family : 6 2025-05-07T19:43:02.1116696Z model : 85 2025-05-07T19:43:02.1116850Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1116934Z stepping : 7 2025-05-07T19:43:02.1117014Z microcode : 0x5003901 2025-05-07T19:43:02.1117088Z cpu MHz : 3134.860 2025-05-07T19:43:02.1117178Z cache size : 36608 KB 2025-05-07T19:43:02.1117254Z physical id : 1 2025-05-07T19:43:02.1117326Z siblings : 48 2025-05-07T19:43:02.1117398Z core id : 9 2025-05-07T19:43:02.1117485Z cpu cores : 24 2025-05-07T19:43:02.1117557Z apicid : 83 2025-05-07T19:43:02.1117635Z initial apicid : 83 2025-05-07T19:43:02.1117716Z fpu : yes 2025-05-07T19:43:02.1117797Z fpu_exception : yes 2025-05-07T19:43:02.1117873Z cpuid level : 13 2025-05-07T19:43:02.1117946Z wp : yes 2025-05-07T19:43:02.1120095Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1120472Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1120560Z bogomips : 5999.99 2025-05-07T19:43:02.1120638Z clflush size : 64 2025-05-07T19:43:02.1120719Z cache_alignment : 64 2025-05-07T19:43:02.1120843Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1120931Z power management: 2025-05-07T19:43:02.1120935Z 2025-05-07T19:43:02.1121013Z processor : 82 2025-05-07T19:43:02.1121099Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1121179Z cpu family : 6 2025-05-07T19:43:02.1121254Z model : 85 2025-05-07T19:43:02.1121408Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1121485Z stepping : 7 2025-05-07T19:43:02.1121571Z microcode : 0x5003901 2025-05-07T19:43:02.1121646Z cpu MHz : 2999.998 2025-05-07T19:43:02.1121723Z cache size : 36608 KB 2025-05-07T19:43:02.1121806Z physical id : 1 2025-05-07T19:43:02.1121881Z siblings : 48 2025-05-07T19:43:02.1121953Z core id : 10 2025-05-07T19:43:02.1122028Z cpu cores : 24 2025-05-07T19:43:02.1122158Z apicid : 85 2025-05-07T19:43:02.1122238Z initial apicid : 85 2025-05-07T19:43:02.1122310Z fpu : yes 2025-05-07T19:43:02.1122393Z fpu_exception : yes 2025-05-07T19:43:02.1122466Z cpuid level : 13 2025-05-07T19:43:02.1122536Z wp : yes 2025-05-07T19:43:02.1124718Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1125064Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1125142Z bogomips : 5999.99 2025-05-07T19:43:02.1125221Z clflush size : 64 2025-05-07T19:43:02.1125296Z cache_alignment : 64 2025-05-07T19:43:02.1125411Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1125483Z power management: 2025-05-07T19:43:02.1125487Z 2025-05-07T19:43:02.1125567Z processor : 83 2025-05-07T19:43:02.1125693Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1125762Z cpu family : 6 2025-05-07T19:43:02.1125836Z model : 85 2025-05-07T19:43:02.1125978Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1126048Z stepping : 7 2025-05-07T19:43:02.1126119Z microcode : 0x5003901 2025-05-07T19:43:02.1126198Z cpu MHz : 3178.307 2025-05-07T19:43:02.1126272Z cache size : 36608 KB 2025-05-07T19:43:02.1126356Z physical id : 1 2025-05-07T19:43:02.1126423Z siblings : 48 2025-05-07T19:43:02.1126490Z core id : 11 2025-05-07T19:43:02.1126570Z cpu cores : 24 2025-05-07T19:43:02.1126639Z apicid : 87 2025-05-07T19:43:02.1126718Z initial apicid : 87 2025-05-07T19:43:02.1126788Z fpu : yes 2025-05-07T19:43:02.1126871Z fpu_exception : yes 2025-05-07T19:43:02.1126942Z cpuid level : 13 2025-05-07T19:43:02.1127009Z wp : yes 2025-05-07T19:43:02.1129017Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1129371Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1129452Z bogomips : 5999.99 2025-05-07T19:43:02.1129528Z clflush size : 64 2025-05-07T19:43:02.1129601Z cache_alignment : 64 2025-05-07T19:43:02.1129716Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1129799Z power management: 2025-05-07T19:43:02.1129803Z 2025-05-07T19:43:02.1129879Z processor : 84 2025-05-07T19:43:02.1129959Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1130040Z cpu family : 6 2025-05-07T19:43:02.1130109Z model : 85 2025-05-07T19:43:02.1130250Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1130320Z stepping : 7 2025-05-07T19:43:02.1130405Z microcode : 0x5003901 2025-05-07T19:43:02.1130479Z cpu MHz : 3290.339 2025-05-07T19:43:02.1130556Z cache size : 36608 KB 2025-05-07T19:43:02.1130637Z physical id : 1 2025-05-07T19:43:02.1130708Z siblings : 48 2025-05-07T19:43:02.1130778Z core id : 12 2025-05-07T19:43:02.1130847Z cpu cores : 24 2025-05-07T19:43:02.1130921Z apicid : 89 2025-05-07T19:43:02.1130997Z initial apicid : 89 2025-05-07T19:43:02.1131113Z fpu : yes 2025-05-07T19:43:02.1131186Z fpu_exception : yes 2025-05-07T19:43:02.1131269Z cpuid level : 13 2025-05-07T19:43:02.1131336Z wp : yes 2025-05-07T19:43:02.1133311Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1133667Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1133742Z bogomips : 5999.99 2025-05-07T19:43:02.1133811Z clflush size : 64 2025-05-07T19:43:02.1133899Z cache_alignment : 64 2025-05-07T19:43:02.1134011Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1134084Z power management: 2025-05-07T19:43:02.1134088Z 2025-05-07T19:43:02.1134167Z processor : 85 2025-05-07T19:43:02.1134247Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1134317Z cpu family : 6 2025-05-07T19:43:02.1134394Z model : 85 2025-05-07T19:43:02.1134588Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1134658Z stepping : 7 2025-05-07T19:43:02.1134731Z microcode : 0x5003901 2025-05-07T19:43:02.1134810Z cpu MHz : 2999.998 2025-05-07T19:43:02.1134886Z cache size : 36608 KB 2025-05-07T19:43:02.1134958Z physical id : 1 2025-05-07T19:43:02.1135027Z siblings : 48 2025-05-07T19:43:02.1135105Z core id : 13 2025-05-07T19:43:02.1135176Z cpu cores : 24 2025-05-07T19:43:02.1135245Z apicid : 91 2025-05-07T19:43:02.1135332Z initial apicid : 91 2025-05-07T19:43:02.1135401Z fpu : yes 2025-05-07T19:43:02.1135472Z fpu_exception : yes 2025-05-07T19:43:02.1135546Z cpuid level : 13 2025-05-07T19:43:02.1135628Z wp : yes 2025-05-07T19:43:02.1137607Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1137966Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1138040Z bogomips : 5999.99 2025-05-07T19:43:02.1138110Z clflush size : 64 2025-05-07T19:43:02.1138183Z cache_alignment : 64 2025-05-07T19:43:02.1138313Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1138386Z power management: 2025-05-07T19:43:02.1138390Z 2025-05-07T19:43:02.1138460Z processor : 86 2025-05-07T19:43:02.1138550Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1138620Z cpu family : 6 2025-05-07T19:43:02.1138692Z model : 85 2025-05-07T19:43:02.1138838Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1138916Z stepping : 7 2025-05-07T19:43:02.1138990Z microcode : 0x5003901 2025-05-07T19:43:02.1139059Z cpu MHz : 3143.626 2025-05-07T19:43:02.1139139Z cache size : 36608 KB 2025-05-07T19:43:02.1139210Z physical id : 1 2025-05-07T19:43:02.1139278Z siblings : 48 2025-05-07T19:43:02.1139346Z core id : 14 2025-05-07T19:43:02.1139421Z cpu cores : 24 2025-05-07T19:43:02.1139489Z apicid : 93 2025-05-07T19:43:02.1139562Z initial apicid : 93 2025-05-07T19:43:02.1139635Z fpu : yes 2025-05-07T19:43:02.1139707Z fpu_exception : yes 2025-05-07T19:43:02.1139777Z cpuid level : 13 2025-05-07T19:43:02.1139921Z wp : yes 2025-05-07T19:43:02.1141903Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1142255Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1142338Z bogomips : 5999.99 2025-05-07T19:43:02.1142411Z clflush size : 64 2025-05-07T19:43:02.1142485Z cache_alignment : 64 2025-05-07T19:43:02.1142598Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1142685Z power management: 2025-05-07T19:43:02.1142689Z 2025-05-07T19:43:02.1142760Z processor : 87 2025-05-07T19:43:02.1142838Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1142920Z cpu family : 6 2025-05-07T19:43:02.1142992Z model : 85 2025-05-07T19:43:02.1143135Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1143273Z stepping : 7 2025-05-07T19:43:02.1143359Z microcode : 0x5003901 2025-05-07T19:43:02.1143430Z cpu MHz : 3124.010 2025-05-07T19:43:02.1143506Z cache size : 36608 KB 2025-05-07T19:43:02.1143590Z physical id : 1 2025-05-07T19:43:02.1143663Z siblings : 48 2025-05-07T19:43:02.1143730Z core id : 15 2025-05-07T19:43:02.1143798Z cpu cores : 24 2025-05-07T19:43:02.1143876Z apicid : 95 2025-05-07T19:43:02.1143951Z initial apicid : 95 2025-05-07T19:43:02.1144018Z fpu : yes 2025-05-07T19:43:02.1144108Z fpu_exception : yes 2025-05-07T19:43:02.1144185Z cpuid level : 13 2025-05-07T19:43:02.1144254Z wp : yes 2025-05-07T19:43:02.1146229Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1146578Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1146653Z bogomips : 5999.99 2025-05-07T19:43:02.1146731Z clflush size : 64 2025-05-07T19:43:02.1146804Z cache_alignment : 64 2025-05-07T19:43:02.1146917Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1146994Z power management: 2025-05-07T19:43:02.1146998Z 2025-05-07T19:43:02.1147074Z processor : 88 2025-05-07T19:43:02.1147151Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1147219Z cpu family : 6 2025-05-07T19:43:02.1147298Z model : 85 2025-05-07T19:43:02.1147438Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1147509Z stepping : 7 2025-05-07T19:43:02.1147589Z microcode : 0x5003901 2025-05-07T19:43:02.1147666Z cpu MHz : 2999.998 2025-05-07T19:43:02.1147737Z cache size : 36608 KB 2025-05-07T19:43:02.1147808Z physical id : 1 2025-05-07T19:43:02.1147889Z siblings : 48 2025-05-07T19:43:02.1147959Z core id : 16 2025-05-07T19:43:02.1148029Z cpu cores : 24 2025-05-07T19:43:02.1148100Z apicid : 97 2025-05-07T19:43:02.1148179Z initial apicid : 97 2025-05-07T19:43:02.1148246Z fpu : yes 2025-05-07T19:43:02.1148318Z fpu_exception : yes 2025-05-07T19:43:02.1148393Z cpuid level : 13 2025-05-07T19:43:02.1148459Z wp : yes 2025-05-07T19:43:02.1150420Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1150828Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1150903Z bogomips : 5999.99 2025-05-07T19:43:02.1150972Z clflush size : 64 2025-05-07T19:43:02.1151059Z cache_alignment : 64 2025-05-07T19:43:02.1151249Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1151327Z power management: 2025-05-07T19:43:02.1151336Z 2025-05-07T19:43:02.1151404Z processor : 89 2025-05-07T19:43:02.1151493Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1151564Z cpu family : 6 2025-05-07T19:43:02.1151803Z model : 85 2025-05-07T19:43:02.1151969Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1152044Z stepping : 7 2025-05-07T19:43:02.1152123Z microcode : 0x5003901 2025-05-07T19:43:02.1152262Z cpu MHz : 2999.998 2025-05-07T19:43:02.1152353Z cache size : 36608 KB 2025-05-07T19:43:02.1152433Z physical id : 1 2025-05-07T19:43:02.1152508Z siblings : 48 2025-05-07T19:43:02.1152593Z core id : 17 2025-05-07T19:43:02.1152667Z cpu cores : 24 2025-05-07T19:43:02.1152742Z apicid : 99 2025-05-07T19:43:02.1152821Z initial apicid : 99 2025-05-07T19:43:02.1152902Z fpu : yes 2025-05-07T19:43:02.1152984Z fpu_exception : yes 2025-05-07T19:43:02.1153060Z cpuid level : 13 2025-05-07T19:43:02.1153132Z wp : yes 2025-05-07T19:43:02.1155281Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1155663Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1155750Z bogomips : 5999.99 2025-05-07T19:43:02.1155831Z clflush size : 64 2025-05-07T19:43:02.1155912Z cache_alignment : 64 2025-05-07T19:43:02.1156043Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1156121Z power management: 2025-05-07T19:43:02.1156125Z 2025-05-07T19:43:02.1156199Z processor : 90 2025-05-07T19:43:02.1156286Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1156368Z cpu family : 6 2025-05-07T19:43:02.1156443Z model : 85 2025-05-07T19:43:02.1156596Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1156680Z stepping : 7 2025-05-07T19:43:02.1156761Z microcode : 0x5003901 2025-05-07T19:43:02.1156838Z cpu MHz : 2999.998 2025-05-07T19:43:02.1156919Z cache size : 36608 KB 2025-05-07T19:43:02.1157004Z physical id : 1 2025-05-07T19:43:02.1157078Z siblings : 48 2025-05-07T19:43:02.1157147Z core id : 18 2025-05-07T19:43:02.1157225Z cpu cores : 24 2025-05-07T19:43:02.1157298Z apicid : 101 2025-05-07T19:43:02.1157378Z initial apicid : 101 2025-05-07T19:43:02.1157450Z fpu : yes 2025-05-07T19:43:02.1157542Z fpu_exception : yes 2025-05-07T19:43:02.1157619Z cpuid level : 13 2025-05-07T19:43:02.1157692Z wp : yes 2025-05-07T19:43:02.1159858Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1160294Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1160373Z bogomips : 5999.99 2025-05-07T19:43:02.1160457Z clflush size : 64 2025-05-07T19:43:02.1160539Z cache_alignment : 64 2025-05-07T19:43:02.1160660Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1160740Z power management: 2025-05-07T19:43:02.1160753Z 2025-05-07T19:43:02.1160828Z processor : 91 2025-05-07T19:43:02.1160912Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1160990Z cpu family : 6 2025-05-07T19:43:02.1161075Z model : 85 2025-05-07T19:43:02.1161231Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1161308Z stepping : 7 2025-05-07T19:43:02.1161398Z microcode : 0x5003901 2025-05-07T19:43:02.1161475Z cpu MHz : 3141.875 2025-05-07T19:43:02.1161553Z cache size : 36608 KB 2025-05-07T19:43:02.1161761Z physical id : 1 2025-05-07T19:43:02.1161850Z siblings : 48 2025-05-07T19:43:02.1161925Z core id : 19 2025-05-07T19:43:02.1161999Z cpu cores : 24 2025-05-07T19:43:02.1162075Z apicid : 103 2025-05-07T19:43:02.1162168Z initial apicid : 103 2025-05-07T19:43:02.1162241Z fpu : yes 2025-05-07T19:43:02.1162320Z fpu_exception : yes 2025-05-07T19:43:02.1162405Z cpuid level : 13 2025-05-07T19:43:02.1162478Z wp : yes 2025-05-07T19:43:02.1164684Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1165047Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1165124Z bogomips : 5999.99 2025-05-07T19:43:02.1165195Z clflush size : 64 2025-05-07T19:43:02.1165281Z cache_alignment : 64 2025-05-07T19:43:02.1165399Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1165473Z power management: 2025-05-07T19:43:02.1165477Z 2025-05-07T19:43:02.1165557Z processor : 92 2025-05-07T19:43:02.1165637Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1165704Z cpu family : 6 2025-05-07T19:43:02.1165776Z model : 85 2025-05-07T19:43:02.1165927Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1165998Z stepping : 7 2025-05-07T19:43:02.1166074Z microcode : 0x5003901 2025-05-07T19:43:02.1166146Z cpu MHz : 3450.944 2025-05-07T19:43:02.1166233Z cache size : 36608 KB 2025-05-07T19:43:02.1166305Z physical id : 1 2025-05-07T19:43:02.1166373Z siblings : 48 2025-05-07T19:43:02.1166446Z core id : 20 2025-05-07T19:43:02.1166516Z cpu cores : 24 2025-05-07T19:43:02.1166584Z apicid : 105 2025-05-07T19:43:02.1166657Z initial apicid : 105 2025-05-07T19:43:02.1166733Z fpu : yes 2025-05-07T19:43:02.1166806Z fpu_exception : yes 2025-05-07T19:43:02.1166876Z cpuid level : 13 2025-05-07T19:43:02.1166950Z wp : yes 2025-05-07T19:43:02.1168925Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1169328Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1169408Z bogomips : 5999.99 2025-05-07T19:43:02.1169478Z clflush size : 64 2025-05-07T19:43:02.1169551Z cache_alignment : 64 2025-05-07T19:43:02.1169674Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1169748Z power management: 2025-05-07T19:43:02.1169752Z 2025-05-07T19:43:02.1169820Z processor : 93 2025-05-07T19:43:02.1169898Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1169972Z cpu family : 6 2025-05-07T19:43:02.1170038Z model : 85 2025-05-07T19:43:02.1170179Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1170253Z stepping : 7 2025-05-07T19:43:02.1170325Z microcode : 0x5003901 2025-05-07T19:43:02.1170392Z cpu MHz : 3159.185 2025-05-07T19:43:02.1170464Z cache size : 36608 KB 2025-05-07T19:43:02.1170541Z physical id : 1 2025-05-07T19:43:02.1170606Z siblings : 48 2025-05-07T19:43:02.1170671Z core id : 21 2025-05-07T19:43:02.1171426Z cpu cores : 24 2025-05-07T19:43:02.1171500Z apicid : 107 2025-05-07T19:43:02.1171574Z initial apicid : 107 2025-05-07T19:43:02.1171640Z fpu : yes 2025-05-07T19:43:02.1171718Z fpu_exception : yes 2025-05-07T19:43:02.1171790Z cpuid level : 13 2025-05-07T19:43:02.1171854Z wp : yes 2025-05-07T19:43:02.1173844Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1174210Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1174291Z bogomips : 5999.99 2025-05-07T19:43:02.1174386Z clflush size : 64 2025-05-07T19:43:02.1174465Z cache_alignment : 64 2025-05-07T19:43:02.1174586Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1174684Z power management: 2025-05-07T19:43:02.1174688Z 2025-05-07T19:43:02.1174766Z processor : 94 2025-05-07T19:43:02.1174853Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1174930Z cpu family : 6 2025-05-07T19:43:02.1175023Z model : 85 2025-05-07T19:43:02.1175173Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1175254Z stepping : 7 2025-05-07T19:43:02.1175348Z microcode : 0x5003901 2025-05-07T19:43:02.1175425Z cpu MHz : 3131.101 2025-05-07T19:43:02.1175504Z cache size : 36608 KB 2025-05-07T19:43:02.1175583Z physical id : 1 2025-05-07T19:43:02.1175675Z siblings : 48 2025-05-07T19:43:02.1175751Z core id : 22 2025-05-07T19:43:02.1175827Z cpu cores : 24 2025-05-07T19:43:02.1175920Z apicid : 109 2025-05-07T19:43:02.1176002Z initial apicid : 109 2025-05-07T19:43:02.1176072Z fpu : yes 2025-05-07T19:43:02.1176150Z fpu_exception : yes 2025-05-07T19:43:02.1176237Z cpuid level : 13 2025-05-07T19:43:02.1176308Z wp : yes 2025-05-07T19:43:02.1178283Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1178723Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1178801Z bogomips : 5999.99 2025-05-07T19:43:02.1178877Z clflush size : 64 2025-05-07T19:43:02.1178966Z cache_alignment : 64 2025-05-07T19:43:02.1179090Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1179171Z power management: 2025-05-07T19:43:02.1179175Z 2025-05-07T19:43:02.1179263Z processor : 95 2025-05-07T19:43:02.1179345Z vendor_id : GenuineIntel 2025-05-07T19:43:02.1179419Z cpu family : 6 2025-05-07T19:43:02.1179494Z model : 85 2025-05-07T19:43:02.1179653Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.1179730Z stepping : 7 2025-05-07T19:43:02.1179814Z microcode : 0x5003901 2025-05-07T19:43:02.1179896Z cpu MHz : 3205.241 2025-05-07T19:43:02.1179970Z cache size : 36608 KB 2025-05-07T19:43:02.1180043Z physical id : 1 2025-05-07T19:43:02.1180115Z siblings : 48 2025-05-07T19:43:02.1180201Z core id : 23 2025-05-07T19:43:02.1180273Z cpu cores : 24 2025-05-07T19:43:02.1180346Z apicid : 111 2025-05-07T19:43:02.1180494Z initial apicid : 111 2025-05-07T19:43:02.1180563Z fpu : yes 2025-05-07T19:43:02.1180640Z fpu_exception : yes 2025-05-07T19:43:02.1180714Z cpuid level : 13 2025-05-07T19:43:02.1180797Z wp : yes 2025-05-07T19:43:02.1182926Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.1183294Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.1183374Z bogomips : 5999.99 2025-05-07T19:43:02.1183447Z clflush size : 64 2025-05-07T19:43:02.1183526Z cache_alignment : 64 2025-05-07T19:43:02.1183654Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.1183730Z power management: 2025-05-07T19:43:02.1183734Z 2025-05-07T19:43:02.1183738Z 2025-05-07T19:43:02.1183843Z ################################################################################ 2025-05-07T19:43:02.1183941Z [INFO] Print PCI info ... 2025-05-07T19:43:02.1184019Z + lspci -v 2025-05-07T19:43:02.1184024Z 2025-05-07T19:43:02.1184188Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:02.1184312Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:02.1184421Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:02.1184425Z 2025-05-07T19:43:02.1184606Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:02.1184683Z Physical Slot: 1 2025-05-07T19:43:02.1184796Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:02.1184800Z 2025-05-07T19:43:02.1185034Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:02.1185112Z Physical Slot: 1 2025-05-07T19:43:02.1185236Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:02.1185240Z 2025-05-07T19:43:02.1185484Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:02.1185556Z Physical Slot: 3 2025-05-07T19:43:02.1185666Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:02.1185793Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:02.1185961Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:02.1185965Z 2025-05-07T19:43:02.1186256Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:02.1186352Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:02.1186424Z Physical Slot: 4 2025-05-07T19:43:02.1186546Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:02.1186692Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:02.1186782Z Capabilities: 2025-05-07T19:43:02.1186867Z Kernel driver in use: nvme 2025-05-07T19:43:02.1186871Z 2025-05-07T19:43:02.1187073Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:02.1187147Z Physical Slot: 5 2025-05-07T19:43:02.1187247Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:02.1187398Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:02.1187522Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:02.1187652Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:02.1187738Z Capabilities: 2025-05-07T19:43:02.1187834Z Kernel driver in use: ena 2025-05-07T19:43:02.1187838Z 2025-05-07T19:43:02.1187842Z 2025-05-07T19:43:02.1187993Z ################################################################################ 2025-05-07T19:43:02.1188091Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:02.1188173Z + uname -a 2025-05-07T19:43:02.1188178Z 2025-05-07T19:43:02.1188538Z Linux 3a46c8861204 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:02.1188542Z 2025-05-07T19:43:02.1188609Z + uname -m 2025-05-07T19:43:02.1188613Z 2025-05-07T19:43:02.1188685Z x86_64 2025-05-07T19:43:02.1188689Z 2025-05-07T19:43:02.1188763Z + cat /proc/version 2025-05-07T19:43:02.1188767Z 2025-05-07T19:43:02.1189314Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:02.1189331Z 2025-05-07T19:43:02.1189407Z + cat /etc/os-release 2025-05-07T19:43:02.1189411Z 2025-05-07T19:43:02.1189486Z NAME="Amazon Linux" 2025-05-07T19:43:02.1189564Z VERSION="2023" 2025-05-07T19:43:02.1189648Z ID="amzn" 2025-05-07T19:43:02.1189721Z ID_LIKE="fedora" 2025-05-07T19:43:02.1189797Z VERSION_ID="2023" 2025-05-07T19:43:02.1189897Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:02.1189997Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:02.1190067Z ANSI_COLOR="0;33" 2025-05-07T19:43:02.1190177Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:02.1190353Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:02.1190652Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:02.1190799Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:02.1191231Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:02.1191321Z VENDOR_NAME="AWS" 2025-05-07T19:43:02.1191428Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:02.1191518Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:02.1191523Z 2025-05-07T19:43:02.1223865Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:02.1224018Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:02.1224500Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:02.1224570Z env: 2025-05-07T19:43:02.1224682Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:02.1224774Z BUILD_ENV: build_binary 2025-05-07T19:43:02.1224863Z BUILD_TARGET: default 2025-05-07T19:43:02.1224943Z BUILD_VARIANT: cuda 2025-05-07T19:43:02.1225036Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:02.1225128Z ##[endgroup] 2025-05-07T19:43:02.5590252Z ################################################################################ 2025-05-07T19:43:02.5590871Z [INFO] Printing general display info ... 2025-05-07T19:43:02.5620994Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:02.6521490Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:02.6532064Z /usr/bin/sudo 2025-05-07T19:43:02.6546154Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:02.6555738Z /usr/bin/yum 2025-05-07T19:43:02.6556774Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:02.6583376Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:02.8811173Z Last metadata expiration check: 0:00:17 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:43:02.9771349Z Dependencies resolved. 2025-05-07T19:43:02.9985988Z Nothing to do. 2025-05-07T19:43:02.9986665Z Complete! 2025-05-07T19:43:03.0329817Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:03.0354269Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:03.2595119Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:43:03.3122059Z Dependencies resolved. 2025-05-07T19:43:03.3290219Z ================================================================================ 2025-05-07T19:43:03.3291089Z Package Arch Version Repository Size 2025-05-07T19:43:03.3291527Z ================================================================================ 2025-05-07T19:43:03.3291848Z Installing: 2025-05-07T19:43:03.3292186Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:03.3292651Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:03.3292954Z 2025-05-07T19:43:03.3293063Z Transaction Summary 2025-05-07T19:43:03.3293400Z ================================================================================ 2025-05-07T19:43:03.3293716Z Install 2 Packages 2025-05-07T19:43:03.3293854Z 2025-05-07T19:43:03.3293973Z Total download size: 347 k 2025-05-07T19:43:03.3294250Z Installed size: 883 k 2025-05-07T19:43:03.3294502Z Downloading Packages: 2025-05-07T19:43:03.4353711Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.8 MB/s | 28 kB 00:00 2025-05-07T19:43:03.4444476Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 12 MB/s | 319 kB 00:00 2025-05-07T19:43:03.4458707Z -------------------------------------------------------------------------------- 2025-05-07T19:43:03.4459866Z Total 2.9 MB/s | 347 kB 00:00 2025-05-07T19:43:03.4669981Z Running transaction check 2025-05-07T19:43:03.4720950Z Transaction check succeeded. 2025-05-07T19:43:03.4721929Z Running transaction test 2025-05-07T19:43:03.4873286Z Transaction test succeeded. 2025-05-07T19:43:03.4874200Z Running transaction 2025-05-07T19:43:03.5145852Z Preparing : 1/1 2025-05-07T19:43:03.5219867Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:03.5255289Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:04.7156227Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:04.7157773Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:04.7526108Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:04.7527900Z 2025-05-07T19:43:04.7528146Z Installed: 2025-05-07T19:43:04.7529126Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:04.7530119Z 2025-05-07T19:43:04.7530350Z Complete! 2025-05-07T19:43:04.7917267Z + hostname 2025-05-07T19:43:04.7917947Z 2025-05-07T19:43:04.7937799Z 3a46c8861204 2025-05-07T19:43:04.7939051Z 2025-05-07T19:43:04.7939253Z + sudo lshw -C display 2025-05-07T19:43:04.7939569Z 2025-05-07T19:43:04.9909754Z *-display UNCLAIMED 2025-05-07T19:43:04.9910229Z description: VGA compatible controller 2025-05-07T19:43:04.9911251Z product: Amazon.com, Inc. 2025-05-07T19:43:04.9911788Z vendor: Amazon.com, Inc. 2025-05-07T19:43:04.9912069Z physical id: 3 2025-05-07T19:43:04.9912407Z bus info: pci@0000:00:03.0 2025-05-07T19:43:04.9912673Z version: 00 2025-05-07T19:43:04.9912921Z width: 32 bits 2025-05-07T19:43:04.9913144Z clock: 33MHz 2025-05-07T19:43:04.9913417Z capabilities: vga_controller bus_master 2025-05-07T19:43:04.9913740Z configuration: latency=0 2025-05-07T19:43:04.9914089Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:04.9933529Z 2025-05-07T19:43:04.9934106Z ################################################################################ 2025-05-07T19:43:04.9935481Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:05.0044908Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:05.0073881Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.0074420Z [CHECK] nvidia-smi not found 2025-05-07T19:43:05.0074734Z ################################################################################ 2025-05-07T19:43:05.0075073Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:05.0185523Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:05.0213948Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.0214483Z [CHECK] rocminfo not found 2025-05-07T19:43:05.0227610Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.0228625Z [CHECK] rocm-smi not found 2025-05-07T19:43:05.0298287Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:05.0298828Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:05.0299408Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:05.0299787Z env: 2025-05-07T19:43:05.0300036Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:05.0300417Z BUILD_ENV: build_binary 2025-05-07T19:43:05.0300682Z BUILD_TARGET: default 2025-05-07T19:43:05.0301001Z BUILD_VARIANT: cuda 2025-05-07T19:43:05.0301291Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:05.0301567Z ##[endgroup] 2025-05-07T19:43:05.4789250Z ################################################################################ 2025-05-07T19:43:05.4790168Z # Setup Miniconda 2025-05-07T19:43:05.4790737Z # 2025-05-07T19:43:05.4813478Z # [2025-05-07T19:43:05.480Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:05.4813955Z ################################################################################ 2025-05-07T19:43:05.4814384Z 2025-05-07T19:43:05.4833455Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:05.5667423Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:05.5667898Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:05.5668114Z 2025-05-07T19:43:05.5680460Z 2025-05-07T19:43:05.5680676Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:05.5708563Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:06.4844228Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:06.4844770Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:06.4845045Z 2025-05-07T19:43:06.4986589Z PREFIX=/github/home/miniconda 2025-05-07T19:43:06.8580143Z Unpacking payload ... 2025-05-07T19:43:07.3353393Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:08.0066589Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:09.8701280Z 2025-05-07T19:43:09.8701821Z Installing base environment... 2025-05-07T19:43:09.8702108Z 2025-05-07T19:43:10.8572425Z Preparing transaction: ...working... done 2025-05-07T19:43:13.7693724Z Executing transaction: ...working... done 2025-05-07T19:43:14.3167092Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:14.3834393Z installation finished. 2025-05-07T19:43:14.3837676Z 2025-05-07T19:43:14.3838113Z + rm -f miniconda.sh 2025-05-07T19:43:14.3838277Z 2025-05-07T19:43:14.4001691Z 2025-05-07T19:43:14.4001893Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:14.4002316Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:14.4002536Z 2025-05-07T19:43:14.7602542Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:14.7603713Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:14.7604738Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:14.7605809Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:14.7606882Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:14.7608049Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:14.7609308Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:14.7610615Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:14.7611204Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:14.7611712Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:14.7612561Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:14.7612945Z modified /github/home/.bashrc 2025-05-07T19:43:14.7613166Z 2025-05-07T19:43:14.7613372Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:14.7613672Z 2025-05-07T19:43:14.8133628Z 2025-05-07T19:43:14.8134035Z + . /github/home/.bashrc 2025-05-07T19:43:14.8134243Z 2025-05-07T19:43:15.5949342Z 2025-05-07T19:43:15.5950491Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:15.5974337Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:27.2727359Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:28.7331559Z Solving environment: \ | / - \ | / - \ | / done 2025-05-07T19:43:28.8221249Z 2025-05-07T19:43:28.8221829Z ## Package Plan ## 2025-05-07T19:43:28.8222017Z 2025-05-07T19:43:28.8222356Z environment location: /github/home/miniconda 2025-05-07T19:43:28.8222654Z 2025-05-07T19:43:28.8222773Z added / updated specs: 2025-05-07T19:43:28.8223081Z - conda-libmamba-solver 2025-05-07T19:43:28.8224092Z - libarchive 2025-05-07T19:43:28.8224348Z - libmamba 2025-05-07T19:43:28.8224571Z - libmambapy 2025-05-07T19:43:28.8224727Z 2025-05-07T19:43:28.8224731Z 2025-05-07T19:43:28.8224857Z The following packages will be downloaded: 2025-05-07T19:43:28.8225070Z 2025-05-07T19:43:28.8225218Z package | build 2025-05-07T19:43:28.8225545Z ---------------------------|----------------- 2025-05-07T19:43:28.8225989Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:28.8226639Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:28.8227123Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:28.8227618Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:28.8228474Z ------------------------------------------------------------ 2025-05-07T19:43:28.8228872Z Total: 1.4 MB 2025-05-07T19:43:28.8229100Z 2025-05-07T19:43:28.8229227Z The following packages will be UPDATED: 2025-05-07T19:43:28.8229476Z 2025-05-07T19:43:28.8234961Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:28.8235892Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:28.8236317Z 2025-05-07T19:43:28.8236557Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:28.8236928Z 2025-05-07T19:43:28.8237271Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:28.8238279Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:28.8238787Z 2025-05-07T19:43:28.8238791Z 2025-05-07T19:43:28.8238795Z 2025-05-07T19:43:28.8238948Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:28.8239360Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:28.8239595Z 2025-05-07T19:43:28.8239907Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:28.8240298Z 2025-05-07T19:43:28.8240301Z 2025-05-07T19:43:28.8240541Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:28.8240802Z 2025-05-07T19:43:28.8240805Z 2025-05-07T19:43:28.8241066Z 2025-05-07T19:43:28.8827198Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:28.8827894Z 2025-05-07T19:43:28.8827899Z 2025-05-07T19:43:28.8915397Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:28.8915718Z 2025-05-07T19:43:28.8915723Z 2025-05-07T19:43:28.8939719Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:28.8940683Z 2025-05-07T19:43:28.8940699Z 2025-05-07T19:43:28.8940712Z 2025-05-07T19:43:28.9034447Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:28.9036868Z 2025-05-07T19:43:28.9166385Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:28.9166765Z 2025-05-07T19:43:28.9167040Z 2025-05-07T19:43:28.9167051Z 2025-05-07T19:43:28.9216241Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:28.9217139Z 2025-05-07T19:43:28.9262175Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:28.9310040Z conda-25.3.1 | 1.1 MB | #######7 | 78% 2025-05-07T19:43:29.0230968Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:29.0233033Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:29.0233418Z 2025-05-07T19:43:29.0233628Z 2025-05-07T19:43:29.0233908Z  2025-05-07T19:43:29.0234146Z 2025-05-07T19:43:29.0234167Z 2025-05-07T19:43:29.0234339Z  2025-05-07T19:43:29.0234553Z 2025-05-07T19:43:29.0234557Z 2025-05-07T19:43:29.0234561Z 2025-05-07T19:43:29.0234761Z  done 2025-05-07T19:43:29.1248539Z Preparing transaction: \ done 2025-05-07T19:43:29.2258634Z Verifying transaction: / done 2025-05-07T19:43:30.5291444Z Executing transaction: \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:32.0909721Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:32.0933029Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:32.7886200Z Channels: 2025-05-07T19:43:32.7886690Z - defaults 2025-05-07T19:43:32.7886925Z Platform: linux-64 2025-05-07T19:43:33.8873021Z Collecting package metadata (repodata.json): - \ | / - \ | done 2025-05-07T19:43:34.0169574Z Solving environment: - \ Channels: 2025-05-07T19:43:34.3018395Z - defaults 2025-05-07T19:43:34.3019064Z Platform: linux-64 2025-05-07T19:43:34.3020531Z Collecting package metadata (repodata.json): / - \ | done 2025-05-07T19:43:34.5254247Z Solving environment: - \ | / done 2025-05-07T19:43:34.6201397Z done 2025-05-07T19:43:34.6851149Z 2025-05-07T19:43:34.6851495Z ## Package Plan ## 2025-05-07T19:43:34.6851701Z 2025-05-07T19:43:34.6851845Z environment location: /github/home/miniconda 2025-05-07T19:43:34.6852089Z 2025-05-07T19:43:34.6852186Z added / updated specs: 2025-05-07T19:43:34.6852468Z - conda 2025-05-07T19:43:34.6852589Z 2025-05-07T19:43:34.6852593Z 2025-05-07T19:43:34.6852741Z The following packages will be downloaded: 2025-05-07T19:43:34.6852959Z 2025-05-07T19:43:34.6853073Z package | build 2025-05-07T19:43:34.6853410Z ---------------------------|----------------- 2025-05-07T19:43:34.6853765Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:34.6854165Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:34.6854557Z ------------------------------------------------------------ 2025-05-07T19:43:34.6854897Z Total: 1.4 MB 2025-05-07T19:43:34.6855109Z 2025-05-07T19:43:34.6855238Z The following packages will be UPDATED: 2025-05-07T19:43:34.6855448Z 2025-05-07T19:43:34.6855759Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:34.6856604Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:34.6856875Z 2025-05-07T19:43:34.6856878Z 2025-05-07T19:43:34.6856882Z 2025-05-07T19:43:34.6857043Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:34.6857417Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:34.6857689Z 2025-05-07T19:43:34.7253316Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:34.7253613Z 2025-05-07T19:43:34.8088758Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.8290100Z pip-25.1 | 1.3 MB | 1 | 1% 2025-05-07T19:43:34.8296909Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.8297154Z 2025-05-07T19:43:34.8299614Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.8300345Z 2025-05-07T19:43:34.9148898Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.9149358Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.9152703Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.9153035Z 2025-05-07T19:43:34.9153239Z 2025-05-07T19:43:34.9153780Z  done 2025-05-07T19:43:35.0163783Z Preparing transaction: \ done 2025-05-07T19:43:35.1170506Z Verifying transaction: / done 2025-05-07T19:43:37.1204728Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:37.6529252Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:37.6529665Z + conda clean --packages --tarball -y 2025-05-07T19:43:37.6529877Z 2025-05-07T19:43:38.0954440Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:38.0955360Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:38.1502046Z 2025-05-07T19:43:38.1511354Z + conda clean --all -y 2025-05-07T19:43:38.1511570Z 2025-05-07T19:43:38.5946169Z There are no unused tarball(s) to remove. 2025-05-07T19:43:38.5947161Z Will remove 1 index cache(s). 2025-05-07T19:43:38.5947979Z There are no unused package(s) to remove. 2025-05-07T19:43:38.5948894Z There are no tempfile(s) to remove. 2025-05-07T19:43:38.5949422Z There are no logfile(s) to remove. 2025-05-07T19:43:38.6481561Z 2025-05-07T19:43:38.6484714Z + conda info 2025-05-07T19:43:38.6485270Z 2025-05-07T19:43:39.2095271Z 2025-05-07T19:43:39.2096005Z active environment : base 2025-05-07T19:43:39.2096986Z active env location : /github/home/miniconda 2025-05-07T19:43:39.2097932Z shell level : 1 2025-05-07T19:43:39.2098741Z user config file : /github/home/.condarc 2025-05-07T19:43:39.2099840Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:39.2100904Z conda version : 25.3.1 2025-05-07T19:43:39.2101475Z conda-build version : not installed 2025-05-07T19:43:39.2101775Z python version : 3.13.2.final.0 2025-05-07T19:43:39.2102074Z solver : libmamba (default) 2025-05-07T19:43:39.2102401Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:39.2102706Z __conda=25.3.1=0 2025-05-07T19:43:39.2102990Z __glibc=2.34=0 2025-05-07T19:43:39.2103276Z __linux=6.1.130=0 2025-05-07T19:43:39.2103540Z __unix=0=0 2025-05-07T19:43:39.2103882Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:39.2104260Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:39.2104606Z conda av metadata url : None 2025-05-07T19:43:39.2104957Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:39.2105386Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:39.2105753Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:39.2106130Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:39.2106750Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:39.2107082Z /github/home/.conda/pkgs 2025-05-07T19:43:39.2107436Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:39.2107755Z /github/home/.conda/envs 2025-05-07T19:43:39.2108060Z platform : linux-64 2025-05-07T19:43:39.2108895Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:39.2109737Z UID:GID : 0:0 2025-05-07T19:43:39.2109998Z netrc file : None 2025-05-07T19:43:39.2110241Z offline mode : False 2025-05-07T19:43:39.2110403Z 2025-05-07T19:43:39.2676807Z 2025-05-07T19:43:39.2677736Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:39.2679381Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_3185666f-07bd-4d1c-968f-762cbf14a1d5 ... 2025-05-07T19:43:39.2680086Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:39.2832327Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:39.2832883Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:39.2833907Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:39.2834225Z env: 2025-05-07T19:43:39.2834472Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:39.2834777Z BUILD_ENV: build_binary 2025-05-07T19:43:39.2835039Z BUILD_TARGET: default 2025-05-07T19:43:39.2835288Z BUILD_VARIANT: cuda 2025-05-07T19:43:39.2835524Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:39.2835793Z ##[endgroup] 2025-05-07T19:43:39.7220326Z ################################################################################ 2025-05-07T19:43:39.7220735Z # Create Conda Environment 2025-05-07T19:43:39.7220985Z # 2025-05-07T19:43:39.7232945Z # [2025-05-07T19:43:39.722Z] + create_conda_environment build_binary 3.13 2025-05-07T19:43:39.7233438Z ################################################################################ 2025-05-07T19:43:39.7233667Z 2025-05-07T19:43:39.7259802Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:39.8161625Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:39.8162295Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:39.8162623Z + conda info --envs 2025-05-07T19:43:39.8162765Z 2025-05-07T19:43:40.3923856Z 2025-05-07T19:43:40.3924453Z # conda environments: 2025-05-07T19:43:40.3925171Z # 2025-05-07T19:43:40.3925790Z base /github/home/miniconda 2025-05-07T19:43:40.3926428Z 2025-05-07T19:43:40.4495862Z 2025-05-07T19:43:40.4496722Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:42.0826117Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:42.0826421Z 2025-05-07T19:43:42.0844689Z 2025-05-07T19:43:42.0860263Z [SETUP] Creating new Conda environment (Python 3.13) ... 2025-05-07T19:43:42.0883604Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.13 2025-05-07T19:43:42.6495709Z Channels: 2025-05-07T19:43:42.6495988Z - defaults 2025-05-07T19:43:42.6496219Z Platform: linux-64 2025-05-07T19:43:44.0252117Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:44.1261090Z Solving environment: | done 2025-05-07T19:43:44.1548032Z 2025-05-07T19:43:44.1548252Z ## Package Plan ## 2025-05-07T19:43:44.1548474Z 2025-05-07T19:43:44.1548756Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:44.1549122Z 2025-05-07T19:43:44.1549225Z added / updated specs: 2025-05-07T19:43:44.1549490Z - python=3.13 2025-05-07T19:43:44.1549624Z 2025-05-07T19:43:44.1549629Z 2025-05-07T19:43:44.1549751Z The following packages will be downloaded: 2025-05-07T19:43:44.1549988Z 2025-05-07T19:43:44.1550103Z package | build 2025-05-07T19:43:44.1550437Z ---------------------------|----------------- 2025-05-07T19:43:44.1550822Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:44.1551346Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:44.1551774Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:44.1552218Z python_abi-3.13 | 0_cp313 6 KB 2025-05-07T19:43:44.1552594Z ------------------------------------------------------------ 2025-05-07T19:43:44.1552947Z Total: 159 KB 2025-05-07T19:43:44.1553160Z 2025-05-07T19:43:44.1553305Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:44.1553534Z 2025-05-07T19:43:44.1553735Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:44.1554201Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:44.1554625Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:44.1555371Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:44.1555880Z expat pkgs/main/linux-64::expat-2.7.1-h6a678d5_0 2025-05-07T19:43:44.1556334Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:44.1556827Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:44.1557262Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:44.1557717Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:44.1558168Z libmpdec pkgs/main/linux-64::libmpdec-4.0.0-h5eee18b_0 2025-05-07T19:43:44.1558637Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:44.1559112Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:44.1559539Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:44.1559978Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:44.1560389Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:44.1560826Z python pkgs/main/linux-64::python-3.13.2-hf623796_100_cp313 2025-05-07T19:43:44.1561289Z python_abi pkgs/main/linux-64::python_abi-3.13-0_cp313 2025-05-07T19:43:44.1561844Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:44.1562337Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py313h06a4308_0 2025-05-07T19:43:44.1562811Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:44.1563222Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:44.1563627Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:44.1564045Z wheel pkgs/main/linux-64::wheel-0.45.1-py313h06a4308_0 2025-05-07T19:43:44.1564451Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:44.1564820Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:44.1565086Z 2025-05-07T19:43:44.1565091Z 2025-05-07T19:43:44.1565095Z 2025-05-07T19:43:44.1565238Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:44.1565632Z ca-certificates-2025 | 129 KB | | 0% 2025-05-07T19:43:44.1565881Z 2025-05-07T19:43:44.1566191Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:44.1566453Z 2025-05-07T19:43:44.1566457Z 2025-05-07T19:43:44.1581042Z python_abi-3.13 | 6 KB | | 0%  2025-05-07T19:43:44.1581853Z 2025-05-07T19:43:44.1581867Z 2025-05-07T19:43:44.1581878Z 2025-05-07T19:43:44.1983483Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:44.1983774Z 2025-05-07T19:43:44.2022869Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:44.2023229Z 2025-05-07T19:43:44.2023237Z 2025-05-07T19:43:44.2023242Z 2025-05-07T19:43:44.2093706Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:44.2094571Z 2025-05-07T19:43:44.2094635Z 2025-05-07T19:43:44.2094648Z 2025-05-07T19:43:44.2095308Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:44.2193905Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:44.2235053Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:44.2235883Z 2025-05-07T19:43:44.2452878Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:44.2453179Z 2025-05-07T19:43:44.2453393Z 2025-05-07T19:43:44.2503788Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:44.2504452Z 2025-05-07T19:43:44.2504486Z 2025-05-07T19:43:44.2512270Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:44.2512828Z 2025-05-07T19:43:44.2513053Z 2025-05-07T19:43:44.2513248Z  2025-05-07T19:43:44.2513503Z 2025-05-07T19:43:44.2513507Z 2025-05-07T19:43:44.2514034Z  2025-05-07T19:43:44.2514264Z 2025-05-07T19:43:44.2514268Z 2025-05-07T19:43:44.2514272Z 2025-05-07T19:43:44.2514494Z  done 2025-05-07T19:43:44.4626613Z Preparing transaction: - \ done 2025-05-07T19:43:46.0144358Z Verifying transaction: / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:48.2304165Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:48.2341892Z # 2025-05-07T19:43:48.2342679Z # To activate this environment, use 2025-05-07T19:43:48.2343014Z # 2025-05-07T19:43:48.2343231Z # $ conda activate build_binary 2025-05-07T19:43:48.2343720Z # 2025-05-07T19:43:48.2343954Z # To deactivate an active environment, use 2025-05-07T19:43:48.2344267Z # 2025-05-07T19:43:48.2344486Z # $ conda deactivate 2025-05-07T19:43:48.2344682Z 2025-05-07T19:43:48.3183503Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:48.3208788Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:51.2787158Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:51.2789488Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (25.1) 2025-05-07T19:43:51.2790102Z Collecting pip 2025-05-07T19:43:51.2790411Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:51.2791035Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:51.2792076Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 72.0 MB/s eta 0:00:00 2025-05-07T19:43:51.2792489Z Installing collected packages: pip 2025-05-07T19:43:51.2792784Z Attempting uninstall: pip 2025-05-07T19:43:51.2793079Z Found existing installation: pip 25.1 2025-05-07T19:43:51.2793386Z Uninstalling pip-25.1: 2025-05-07T19:43:51.2793689Z Successfully uninstalled pip-25.1 2025-05-07T19:43:51.2794014Z Successfully installed pip-25.1.1 2025-05-07T19:43:51.2794206Z 2025-05-07T19:43:51.3513635Z 2025-05-07T19:43:51.3513858Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:51.3547633Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:52.0116152Z Channels: 2025-05-07T19:43:52.0116789Z - conda-forge 2025-05-07T19:43:52.0117430Z Platform: linux-64 2025-05-07T19:44:01.6780323Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:03.5620548Z Solving environment: / - \ | / done 2025-05-07T19:44:03.6100686Z 2025-05-07T19:44:03.6101332Z ## Package Plan ## 2025-05-07T19:44:03.6101848Z 2025-05-07T19:44:03.6102467Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:03.6102845Z 2025-05-07T19:44:03.6102957Z added / updated specs: 2025-05-07T19:44:03.6103271Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:03.6103508Z 2025-05-07T19:44:03.6103513Z 2025-05-07T19:44:03.6103656Z The following packages will be downloaded: 2025-05-07T19:44:03.6103895Z 2025-05-07T19:44:03.6104043Z package | build 2025-05-07T19:44:03.6104393Z ---------------------------|----------------- 2025-05-07T19:44:03.6104814Z cffi-1.17.1 | py313hfab6e84_0 289 KB conda-forge 2025-05-07T19:44:03.6105296Z cryptography-44.0.3 | py313h6556f6e_0 1.5 MB conda-forge 2025-05-07T19:44:03.6105791Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:03.6106231Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:03.6107059Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:03.6107525Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:03.6107971Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:03.6108441Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:03.6108923Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:03.6109454Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:03.6109926Z ------------------------------------------------------------ 2025-05-07T19:44:03.6110289Z Total: 6.4 MB 2025-05-07T19:44:03.6110510Z 2025-05-07T19:44:03.6110658Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:03.6110887Z 2025-05-07T19:44:03.6111103Z cffi conda-forge/linux-64::cffi-1.17.1-py313hfab6e84_0 2025-05-07T19:44:03.6111764Z cryptography conda-forge/linux-64::cryptography-44.0.3-py313h6556f6e_0 2025-05-07T19:44:03.6112316Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:03.6116285Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:03.6116809Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:03.6117366Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:03.6118107Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:03.6118459Z 2025-05-07T19:44:03.6118594Z The following packages will be UPDATED: 2025-05-07T19:44:03.6118799Z 2025-05-07T19:44:03.6119201Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:03.6120013Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:03.6120694Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:03.6121371Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:03.6121746Z 2025-05-07T19:44:03.6121769Z 2025-05-07T19:44:03.6121773Z 2025-05-07T19:44:03.6121917Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:03.6122327Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:03.6122577Z 2025-05-07T19:44:03.6126500Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:03.6126877Z 2025-05-07T19:44:03.6127430Z 2025-05-07T19:44:03.6146594Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:03.6146999Z 2025-05-07T19:44:03.6147110Z 2025-05-07T19:44:03.6147114Z 2025-05-07T19:44:03.6147898Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:03.6148280Z 2025-05-07T19:44:03.6148285Z 2025-05-07T19:44:03.6148289Z 2025-05-07T19:44:03.6148303Z 2025-05-07T19:44:03.6160982Z cffi-1.17.1 | 289 KB | | 0%  2025-05-07T19:44:03.6161760Z 2025-05-07T19:44:03.6161772Z 2025-05-07T19:44:03.6161783Z 2025-05-07T19:44:03.6161821Z 2025-05-07T19:44:03.6162202Z 2025-05-07T19:44:03.6163645Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:03.6164463Z 2025-05-07T19:44:03.6164501Z 2025-05-07T19:44:03.6164512Z 2025-05-07T19:44:03.6164522Z 2025-05-07T19:44:03.6164533Z 2025-05-07T19:44:03.6164543Z 2025-05-07T19:44:03.6165284Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:03.6166095Z 2025-05-07T19:44:03.6166106Z 2025-05-07T19:44:03.6166117Z 2025-05-07T19:44:03.6166127Z 2025-05-07T19:44:03.6166138Z 2025-05-07T19:44:03.6166148Z 2025-05-07T19:44:03.6166159Z 2025-05-07T19:44:03.6167289Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:03.6168195Z 2025-05-07T19:44:03.6168206Z 2025-05-07T19:44:03.6168217Z 2025-05-07T19:44:03.6168227Z 2025-05-07T19:44:03.6168237Z 2025-05-07T19:44:03.6168248Z 2025-05-07T19:44:03.6168271Z 2025-05-07T19:44:03.6168283Z 2025-05-07T19:44:03.6169093Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:03.6169971Z 2025-05-07T19:44:03.6169982Z 2025-05-07T19:44:03.6169993Z 2025-05-07T19:44:03.6170003Z 2025-05-07T19:44:03.6170013Z 2025-05-07T19:44:03.6170023Z 2025-05-07T19:44:03.6170034Z 2025-05-07T19:44:03.6170045Z 2025-05-07T19:44:03.6170055Z 2025-05-07T19:44:03.6806255Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:03.6806803Z 2025-05-07T19:44:03.6806858Z 2025-05-07T19:44:03.6806863Z 2025-05-07T19:44:03.7071274Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.7071632Z 2025-05-07T19:44:03.7071637Z 2025-05-07T19:44:03.7106374Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.7113732Z openssl-3.5.0 | 3.0 MB | #####6 | 56% 2025-05-07T19:44:03.7114824Z 2025-05-07T19:44:03.7153968Z cryptography-44.0.3 | 1.5 MB | ##5 | 26%  2025-05-07T19:44:03.7154543Z 2025-05-07T19:44:03.7154548Z 2025-05-07T19:44:03.7154552Z 2025-05-07T19:44:03.7154556Z 2025-05-07T19:44:03.7260049Z cffi-1.17.1 | 289 KB | ##2 | 22%  2025-05-07T19:44:03.7260852Z 2025-05-07T19:44:03.7260867Z 2025-05-07T19:44:03.7260878Z 2025-05-07T19:44:03.7260889Z 2025-05-07T19:44:03.7292569Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:03.7292858Z 2025-05-07T19:44:03.7292902Z 2025-05-07T19:44:03.7292906Z 2025-05-07T19:44:03.7292934Z 2025-05-07T19:44:03.7292938Z 2025-05-07T19:44:03.7341479Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:03.7341964Z 2025-05-07T19:44:03.7342052Z 2025-05-07T19:44:03.7342075Z 2025-05-07T19:44:03.7342080Z 2025-05-07T19:44:03.7342085Z 2025-05-07T19:44:03.7576898Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:03.7577499Z 2025-05-07T19:44:03.7577563Z 2025-05-07T19:44:03.7577608Z 2025-05-07T19:44:03.7580533Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.7580821Z 2025-05-07T19:44:03.7580824Z 2025-05-07T19:44:03.7580888Z 2025-05-07T19:44:03.7627520Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.7628369Z 2025-05-07T19:44:03.7628384Z 2025-05-07T19:44:03.7628395Z 2025-05-07T19:44:03.7628405Z 2025-05-07T19:44:03.7628416Z 2025-05-07T19:44:03.7628426Z 2025-05-07T19:44:03.7658402Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:03.7658733Z 2025-05-07T19:44:03.7658738Z 2025-05-07T19:44:03.7658742Z 2025-05-07T19:44:03.7658746Z 2025-05-07T19:44:03.7658750Z 2025-05-07T19:44:03.7658753Z 2025-05-07T19:44:03.7712623Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:03.7713000Z 2025-05-07T19:44:03.7786035Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:03.7786354Z 2025-05-07T19:44:03.7786371Z 2025-05-07T19:44:03.7786375Z 2025-05-07T19:44:03.7786379Z 2025-05-07T19:44:03.7786382Z 2025-05-07T19:44:03.7786386Z 2025-05-07T19:44:03.7786389Z 2025-05-07T19:44:03.7820284Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:03.7820627Z 2025-05-07T19:44:03.7820632Z 2025-05-07T19:44:03.7820636Z 2025-05-07T19:44:03.7820640Z 2025-05-07T19:44:03.7820643Z 2025-05-07T19:44:03.7820647Z 2025-05-07T19:44:03.7820650Z 2025-05-07T19:44:03.7827300Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:03.7827605Z 2025-05-07T19:44:03.7827609Z 2025-05-07T19:44:03.7827613Z 2025-05-07T19:44:03.7827616Z 2025-05-07T19:44:03.7827620Z 2025-05-07T19:44:03.7827634Z 2025-05-07T19:44:03.7827637Z 2025-05-07T19:44:03.7827840Z 2025-05-07T19:44:03.7834246Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:03.7853749Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:03.7854064Z 2025-05-07T19:44:03.7854174Z 2025-05-07T19:44:03.7854178Z 2025-05-07T19:44:03.7854268Z 2025-05-07T19:44:03.7854276Z 2025-05-07T19:44:03.7854281Z 2025-05-07T19:44:03.7854314Z 2025-05-07T19:44:03.7854318Z 2025-05-07T19:44:03.7980362Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:03.7980700Z 2025-05-07T19:44:03.7980705Z 2025-05-07T19:44:03.7980708Z 2025-05-07T19:44:03.7980712Z 2025-05-07T19:44:03.7980715Z 2025-05-07T19:44:03.8030000Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:03.8030312Z 2025-05-07T19:44:03.8030316Z 2025-05-07T19:44:03.8030320Z 2025-05-07T19:44:03.8030327Z 2025-05-07T19:44:03.8088299Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:03.8088594Z 2025-05-07T19:44:03.8088788Z 2025-05-07T19:44:03.8089217Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.8089534Z 2025-05-07T19:44:03.8089538Z 2025-05-07T19:44:03.8236190Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.8236493Z 2025-05-07T19:44:03.8236498Z 2025-05-07T19:44:03.8236502Z 2025-05-07T19:44:03.8236505Z 2025-05-07T19:44:03.8236509Z 2025-05-07T19:44:03.8236512Z 2025-05-07T19:44:03.8236516Z 2025-05-07T19:44:03.8441380Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:03.8441742Z 2025-05-07T19:44:03.8441746Z 2025-05-07T19:44:03.8441750Z 2025-05-07T19:44:03.8441754Z 2025-05-07T19:44:03.8441757Z 2025-05-07T19:44:03.8441761Z 2025-05-07T19:44:03.8441764Z 2025-05-07T19:44:03.8441768Z 2025-05-07T19:44:03.8441772Z 2025-05-07T19:44:03.8459951Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:03.8460297Z 2025-05-07T19:44:03.8460314Z 2025-05-07T19:44:03.8460318Z 2025-05-07T19:44:03.8460322Z 2025-05-07T19:44:03.8460326Z 2025-05-07T19:44:03.8460330Z 2025-05-07T19:44:03.8460333Z 2025-05-07T19:44:03.8460337Z 2025-05-07T19:44:03.8460347Z 2025-05-07T19:44:03.9025991Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:03.9026370Z 2025-05-07T19:44:03.9026415Z 2025-05-07T19:44:03.9026419Z 2025-05-07T19:44:03.9026423Z 2025-05-07T19:44:03.9026450Z 2025-05-07T19:44:03.9026454Z 2025-05-07T19:44:03.9026764Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:03.9027058Z 2025-05-07T19:44:03.9027061Z 2025-05-07T19:44:03.9027071Z 2025-05-07T19:44:03.9027074Z 2025-05-07T19:44:03.9027078Z 2025-05-07T19:44:03.9027082Z 2025-05-07T19:44:03.9157927Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:03.9158248Z 2025-05-07T19:44:03.9158252Z 2025-05-07T19:44:03.9158256Z 2025-05-07T19:44:03.9158272Z 2025-05-07T19:44:03.9158277Z 2025-05-07T19:44:03.9158280Z 2025-05-07T19:44:03.9158284Z 2025-05-07T19:44:03.9158296Z 2025-05-07T19:44:03.9161236Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:03.9161562Z 2025-05-07T19:44:03.9161566Z 2025-05-07T19:44:03.9161570Z 2025-05-07T19:44:03.9161574Z 2025-05-07T19:44:03.9161577Z 2025-05-07T19:44:03.9161580Z 2025-05-07T19:44:03.9161584Z 2025-05-07T19:44:03.9161594Z 2025-05-07T19:44:03.9508717Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:03.9509070Z 2025-05-07T19:44:03.9509075Z 2025-05-07T19:44:03.9509079Z 2025-05-07T19:44:03.9509082Z 2025-05-07T19:44:03.9509087Z 2025-05-07T19:44:03.9509090Z 2025-05-07T19:44:03.9509094Z 2025-05-07T19:44:03.9509097Z 2025-05-07T19:44:03.9509101Z 2025-05-07T19:44:03.9509358Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:03.9509657Z 2025-05-07T19:44:03.9509661Z 2025-05-07T19:44:03.9509858Z 2025-05-07T19:44:03.9509863Z 2025-05-07T19:44:03.9509867Z 2025-05-07T19:44:03.9509870Z 2025-05-07T19:44:03.9509874Z 2025-05-07T19:44:03.9509877Z 2025-05-07T19:44:03.9509881Z 2025-05-07T19:44:03.9952604Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:04.0133650Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:04.0134001Z 2025-05-07T19:44:04.0134726Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:04.0135027Z 2025-05-07T19:44:04.0146123Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:04.0147180Z 2025-05-07T19:44:04.0147789Z 2025-05-07T19:44:04.0148251Z  2025-05-07T19:44:04.0148837Z 2025-05-07T19:44:04.0148849Z 2025-05-07T19:44:04.0149331Z  2025-05-07T19:44:04.0149937Z 2025-05-07T19:44:04.0149978Z 2025-05-07T19:44:04.0149990Z 2025-05-07T19:44:04.0150478Z  2025-05-07T19:44:04.0151107Z 2025-05-07T19:44:04.0151117Z 2025-05-07T19:44:04.0151689Z 2025-05-07T19:44:04.0151701Z 2025-05-07T19:44:04.0152229Z  2025-05-07T19:44:04.0152856Z 2025-05-07T19:44:04.0152866Z 2025-05-07T19:44:04.0152877Z 2025-05-07T19:44:04.0152887Z 2025-05-07T19:44:04.0152898Z 2025-05-07T19:44:04.0153412Z  2025-05-07T19:44:04.0153904Z 2025-05-07T19:44:04.0153908Z 2025-05-07T19:44:04.0153911Z 2025-05-07T19:44:04.0153915Z 2025-05-07T19:44:04.0153918Z 2025-05-07T19:44:04.0153922Z 2025-05-07T19:44:04.0154116Z  2025-05-07T19:44:04.0154336Z 2025-05-07T19:44:04.0154339Z 2025-05-07T19:44:04.0154343Z 2025-05-07T19:44:04.0154346Z 2025-05-07T19:44:04.0154355Z 2025-05-07T19:44:04.0154358Z 2025-05-07T19:44:04.0154362Z 2025-05-07T19:44:04.0154543Z  2025-05-07T19:44:04.0154778Z 2025-05-07T19:44:04.0154786Z 2025-05-07T19:44:04.0154790Z 2025-05-07T19:44:04.0154793Z 2025-05-07T19:44:04.0154796Z 2025-05-07T19:44:04.0154800Z 2025-05-07T19:44:04.0154803Z 2025-05-07T19:44:04.0154807Z 2025-05-07T19:44:04.0154988Z  2025-05-07T19:44:04.0155223Z 2025-05-07T19:44:04.0155227Z 2025-05-07T19:44:04.0155231Z 2025-05-07T19:44:04.0155234Z 2025-05-07T19:44:04.0155237Z 2025-05-07T19:44:04.0155241Z 2025-05-07T19:44:04.0155244Z 2025-05-07T19:44:04.0155248Z 2025-05-07T19:44:04.0155251Z 2025-05-07T19:44:04.0155443Z  done 2025-05-07T19:44:04.1153763Z Preparing transaction: \ done 2025-05-07T19:44:04.2164665Z Verifying transaction: / done 2025-05-07T19:44:05.6197078Z Executing transaction: \ | / - \ | / - \ | / - \ | done 2025-05-07T19:44:05.7192538Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:07.3914830Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:07.3922591Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:07.3950092Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:08.0506613Z Channels: 2025-05-07T19:44:08.0506969Z - conda-forge 2025-05-07T19:44:08.0507202Z Platform: linux-64 2025-05-07T19:44:11.1099510Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:11.5373512Z Solving environment: \ done 2025-05-07T19:44:11.5818987Z 2025-05-07T19:44:11.5819364Z ## Package Plan ## 2025-05-07T19:44:11.5819609Z 2025-05-07T19:44:11.5819836Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:11.5820164Z 2025-05-07T19:44:11.5820261Z added / updated specs: 2025-05-07T19:44:11.5820920Z - libxcrypt 2025-05-07T19:44:11.5821062Z 2025-05-07T19:44:11.5821067Z 2025-05-07T19:44:11.5821204Z The following packages will be downloaded: 2025-05-07T19:44:11.5821425Z 2025-05-07T19:44:11.5821554Z package | build 2025-05-07T19:44:11.5821890Z ---------------------------|----------------- 2025-05-07T19:44:11.5822267Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:11.5822695Z ------------------------------------------------------------ 2025-05-07T19:44:11.5823038Z Total: 98 KB 2025-05-07T19:44:11.5823282Z 2025-05-07T19:44:11.5823409Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:11.5823635Z 2025-05-07T19:44:11.5823884Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:11.5824180Z 2025-05-07T19:44:11.5824184Z 2025-05-07T19:44:11.5824187Z 2025-05-07T19:44:11.5824332Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:11.7212982Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:11.7233742Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:11.7346945Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:11.7350486Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:11.7350846Z 2025-05-07T19:44:11.7351125Z done 2025-05-07T19:44:11.8361500Z Preparing transaction: / done 2025-05-07T19:44:11.9371758Z Verifying transaction: \ done 2025-05-07T19:44:12.0383338Z Executing transaction: / done 2025-05-07T19:44:15.2954934Z [SETUP] Copying over ... 2025-05-07T19:44:15.2955700Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.13/crypt.h 2025-05-07T19:44:15.2956290Z 2025-05-07T19:44:15.2994486Z 2025-05-07T19:44:16.8770689Z [SETUP] Installed Python version: Python 3.13.2 2025-05-07T19:44:16.8771166Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:16.8836020Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:16.8836529Z . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:16.8837079Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:16.8837411Z env: 2025-05-07T19:44:16.8837631Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:16.8837940Z BUILD_ENV: build_binary 2025-05-07T19:44:16.8838181Z BUILD_TARGET: default 2025-05-07T19:44:16.8838422Z BUILD_VARIANT: cuda 2025-05-07T19:44:16.8838653Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:44:16.8838911Z ##[endgroup] 2025-05-07T19:44:17.3217509Z ################################################################################ 2025-05-07T19:44:17.3217927Z # Install C/C++ Compilers 2025-05-07T19:44:17.3218192Z # 2025-05-07T19:44:17.3233387Z # [2025-05-07T19:44:17.322Z] + install_cxx_compiler build_binary gcc 2025-05-07T19:44:17.3233980Z ################################################################################ 2025-05-07T19:44:17.3234249Z 2025-05-07T19:44:17.3259349Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:17.4113227Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:17.4125416Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:17.4152614Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:18.0826920Z Channels: 2025-05-07T19:44:18.0827632Z - conda-forge 2025-05-07T19:44:18.0828275Z Platform: linux-64 2025-05-07T19:44:21.1063251Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:21.5287121Z Solving environment: \ done 2025-05-07T19:44:21.5754382Z 2025-05-07T19:44:21.5754849Z ## Package Plan ## 2025-05-07T19:44:21.5755401Z 2025-05-07T19:44:21.5755980Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:21.5756881Z 2025-05-07T19:44:21.5757164Z added / updated specs: 2025-05-07T19:44:21.5757970Z - sysroot_linux-64=2.17 2025-05-07T19:44:21.5758464Z 2025-05-07T19:44:21.5758476Z 2025-05-07T19:44:21.5758887Z The following packages will be downloaded: 2025-05-07T19:44:21.5759530Z 2025-05-07T19:44:21.5759865Z package | build 2025-05-07T19:44:21.5760829Z ---------------------------|----------------- 2025-05-07T19:44:21.5762082Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:21.5763428Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:21.5763874Z ------------------------------------------------------------ 2025-05-07T19:44:21.5764382Z Total: 15.4 MB 2025-05-07T19:44:21.5764600Z 2025-05-07T19:44:21.5764766Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:21.5765001Z 2025-05-07T19:44:21.5765423Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:21.5766020Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:21.5766625Z 2025-05-07T19:44:21.5766630Z 2025-05-07T19:44:21.5766633Z 2025-05-07T19:44:21.5766808Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:21.5767196Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:21.5767429Z 2025-05-07T19:44:21.7696035Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:21.7696856Z 2025-05-07T19:44:21.7774272Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:21.7775133Z 2025-05-07T19:44:21.8017468Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:21.9033080Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:21.9609572Z sysroot_linux-64-2.1 | 14.5 MB | #####5 | 55% 2025-05-07T19:44:21.9610358Z 2025-05-07T19:44:21.9611173Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:21.9611945Z 2025-05-07T19:44:21.9988765Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:22.4348141Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:22.4349391Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:22.4350906Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:22.4352088Z 2025-05-07T19:44:22.4352687Z 2025-05-07T19:44:22.4353376Z  done 2025-05-07T19:44:22.5359154Z Preparing transaction: / done 2025-05-07T19:44:22.7373815Z Verifying transaction: \ | done 2025-05-07T19:44:22.8384076Z Executing transaction: - done 2025-05-07T19:44:22.9234053Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:22.9234936Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:24.5381704Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:24.5397346Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:24.5424603Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:25.2397424Z Channels: 2025-05-07T19:44:25.2398088Z - conda-forge 2025-05-07T19:44:25.2398706Z Platform: linux-64 2025-05-07T19:44:28.2298189Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:29.3474978Z Solving environment: \ | / done 2025-05-07T19:44:29.3957503Z 2025-05-07T19:44:29.3958121Z ## Package Plan ## 2025-05-07T19:44:29.3958300Z 2025-05-07T19:44:29.3958792Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:29.3959106Z 2025-05-07T19:44:29.3959215Z added / updated specs: 2025-05-07T19:44:29.3959537Z - gxx_linux-64=11.4.0 2025-05-07T19:44:29.3959700Z 2025-05-07T19:44:29.3959704Z 2025-05-07T19:44:29.3959911Z The following packages will be downloaded: 2025-05-07T19:44:29.3960137Z 2025-05-07T19:44:29.3960278Z package | build 2025-05-07T19:44:29.3960626Z ---------------------------|----------------- 2025-05-07T19:44:29.3961056Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:29.3961572Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:29.3962072Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:29.3962535Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:29.3963011Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:29.3963575Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:29.3964026Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:29.3964607Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:29.3965355Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:29.3965832Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:29.3966623Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:29.3967146Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:29.3967570Z ------------------------------------------------------------ 2025-05-07T19:44:29.3967944Z Total: 91.6 MB 2025-05-07T19:44:29.3968166Z 2025-05-07T19:44:29.3968318Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:29.3968546Z 2025-05-07T19:44:29.3968845Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:29.3969451Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:29.3970233Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:29.3970792Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:29.3971379Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:29.3971902Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:29.3972470Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:29.3973051Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:29.3973575Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:29.3974156Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:29.3974540Z 2025-05-07T19:44:29.3974659Z The following packages will be UPDATED: 2025-05-07T19:44:29.3974894Z 2025-05-07T19:44:29.3975222Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:29.3976103Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:29.3976529Z 2025-05-07T19:44:29.3976533Z 2025-05-07T19:44:29.3976537Z 2025-05-07T19:44:29.3976682Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:29.3977074Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:29.3977421Z 2025-05-07T19:44:29.3977713Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:29.3978144Z 2025-05-07T19:44:29.3978147Z 2025-05-07T19:44:29.3978372Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:29.3978807Z 2025-05-07T19:44:29.3978810Z 2025-05-07T19:44:29.3978850Z 2025-05-07T19:44:29.3979089Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:29.3979361Z 2025-05-07T19:44:29.3979365Z 2025-05-07T19:44:29.3979369Z 2025-05-07T19:44:29.3985752Z 2025-05-07T19:44:29.4022596Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:29.4023115Z 2025-05-07T19:44:29.4023122Z 2025-05-07T19:44:29.4023134Z 2025-05-07T19:44:29.4023138Z 2025-05-07T19:44:29.4023142Z 2025-05-07T19:44:29.4023434Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:29.4023740Z 2025-05-07T19:44:29.4023743Z 2025-05-07T19:44:29.4023747Z 2025-05-07T19:44:29.4023751Z 2025-05-07T19:44:29.4023754Z 2025-05-07T19:44:29.4023758Z 2025-05-07T19:44:29.4024027Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:29.4024344Z 2025-05-07T19:44:29.4024347Z 2025-05-07T19:44:29.4024351Z 2025-05-07T19:44:29.4024355Z 2025-05-07T19:44:29.4024359Z 2025-05-07T19:44:29.4024362Z 2025-05-07T19:44:29.4024366Z 2025-05-07T19:44:29.4024616Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:29.4024918Z 2025-05-07T19:44:29.4024921Z 2025-05-07T19:44:29.4024925Z 2025-05-07T19:44:29.4024928Z 2025-05-07T19:44:29.4024932Z 2025-05-07T19:44:29.4024958Z 2025-05-07T19:44:29.4024963Z 2025-05-07T19:44:29.4024966Z 2025-05-07T19:44:29.4025516Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:29.4025839Z 2025-05-07T19:44:29.4025843Z 2025-05-07T19:44:29.4025847Z 2025-05-07T19:44:29.4025850Z 2025-05-07T19:44:29.4025854Z 2025-05-07T19:44:29.4025857Z 2025-05-07T19:44:29.4025861Z 2025-05-07T19:44:29.4025865Z 2025-05-07T19:44:29.4025868Z 2025-05-07T19:44:29.4026135Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:29.4026426Z 2025-05-07T19:44:29.4026452Z 2025-05-07T19:44:29.4026455Z 2025-05-07T19:44:29.4026459Z 2025-05-07T19:44:29.4026462Z 2025-05-07T19:44:29.4026466Z 2025-05-07T19:44:29.4026469Z 2025-05-07T19:44:29.4026473Z 2025-05-07T19:44:29.4026476Z 2025-05-07T19:44:29.4026480Z 2025-05-07T19:44:29.4026747Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:29.4027056Z 2025-05-07T19:44:29.4027182Z 2025-05-07T19:44:29.4027187Z 2025-05-07T19:44:29.4027191Z 2025-05-07T19:44:29.4027202Z 2025-05-07T19:44:29.4027206Z 2025-05-07T19:44:29.4027209Z 2025-05-07T19:44:29.4027212Z 2025-05-07T19:44:29.4027216Z 2025-05-07T19:44:29.4027219Z 2025-05-07T19:44:29.4027223Z 2025-05-07T19:44:29.4998835Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:29.4999201Z 2025-05-07T19:44:29.4999515Z 2025-05-07T19:44:29.4999534Z 2025-05-07T19:44:29.5002012Z 2025-05-07T19:44:29.7001418Z libstdcxx-15.1.0 | 3.7 MB | #3 | 13%  2025-05-07T19:44:29.7002170Z 2025-05-07T19:44:29.7002197Z 2025-05-07T19:44:29.7002276Z 2025-05-07T19:44:29.7002290Z 2025-05-07T19:44:29.7041781Z libstdcxx-15.1.0 | 3.7 MB | ##6 | 26%  2025-05-07T19:44:29.7433259Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:29.7434029Z 2025-05-07T19:44:29.7434059Z 2025-05-07T19:44:29.7434071Z 2025-05-07T19:44:29.7434117Z 2025-05-07T19:44:29.7458622Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:29.7458960Z 2025-05-07T19:44:29.7458973Z 2025-05-07T19:44:29.7534256Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:29.7534561Z 2025-05-07T19:44:29.7824121Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:29.7824913Z 2025-05-07T19:44:29.7824926Z 2025-05-07T19:44:29.7824937Z 2025-05-07T19:44:29.7824949Z 2025-05-07T19:44:29.7824960Z 2025-05-07T19:44:29.8041442Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:29.8305008Z gcc_impl_linux-64-11 | 53.0 MB | ##2 | 22% 2025-05-07T19:44:29.8305508Z 2025-05-07T19:44:29.8305547Z 2025-05-07T19:44:29.8305585Z 2025-05-07T19:44:29.8460610Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:29.8461247Z 2025-05-07T19:44:29.8461252Z 2025-05-07T19:44:29.8534945Z libstdcxx-devel_linu | 11.1 MB | #######1 | 72%  2025-05-07T19:44:29.8535786Z 2025-05-07T19:44:29.8909946Z gxx_impl_linux-64-11 | 11.2 MB | ###9 | 40%  2025-05-07T19:44:29.8910817Z 2025-05-07T19:44:29.8910831Z 2025-05-07T19:44:29.8910842Z 2025-05-07T19:44:29.8910852Z 2025-05-07T19:44:29.8910863Z 2025-05-07T19:44:29.8911829Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:29.8912682Z 2025-05-07T19:44:29.8912692Z 2025-05-07T19:44:29.8912703Z 2025-05-07T19:44:29.8912870Z 2025-05-07T19:44:29.8912874Z 2025-05-07T19:44:29.9119096Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:29.9304419Z gcc_impl_linux-64-11 | 53.0 MB | ###5 | 36% 2025-05-07T19:44:29.9304825Z 2025-05-07T19:44:29.9304920Z 2025-05-07T19:44:29.9304925Z 2025-05-07T19:44:29.9364416Z binutils_impl_linux- | 6.0 MB | #########3 | 93%  2025-05-07T19:44:29.9365272Z 2025-05-07T19:44:29.9365288Z 2025-05-07T19:44:29.9365300Z 2025-05-07T19:44:29.9365311Z 2025-05-07T19:44:29.9365355Z 2025-05-07T19:44:29.9365380Z 2025-05-07T19:44:29.9535786Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:29.9536360Z 2025-05-07T19:44:29.9730443Z gxx_impl_linux-64-11 | 11.2 MB | ########9 | 89%  2025-05-07T19:44:29.9730746Z 2025-05-07T19:44:29.9730887Z 2025-05-07T19:44:29.9730891Z 2025-05-07T19:44:29.9730895Z 2025-05-07T19:44:29.9736394Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:29.9737270Z 2025-05-07T19:44:29.9737284Z 2025-05-07T19:44:29.9737295Z 2025-05-07T19:44:29.9737322Z 2025-05-07T19:44:29.9954112Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:29.9954437Z 2025-05-07T19:44:29.9954442Z 2025-05-07T19:44:29.9954446Z 2025-05-07T19:44:29.9954450Z 2025-05-07T19:44:29.9954454Z 2025-05-07T19:44:29.9954457Z 2025-05-07T19:44:30.0104503Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:30.0104846Z 2025-05-07T19:44:30.0105070Z 2025-05-07T19:44:30.0105075Z 2025-05-07T19:44:30.0121171Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:30.0367768Z gcc_impl_linux-64-11 | 53.0 MB | #####2 | 53% 2025-05-07T19:44:30.0368241Z 2025-05-07T19:44:30.0368306Z 2025-05-07T19:44:30.0368312Z 2025-05-07T19:44:30.0368316Z 2025-05-07T19:44:30.0368320Z 2025-05-07T19:44:30.0368323Z 2025-05-07T19:44:30.0368327Z 2025-05-07T19:44:30.0460490Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:30.0461404Z 2025-05-07T19:44:30.0461419Z 2025-05-07T19:44:30.0518140Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:30.0518443Z 2025-05-07T19:44:30.0518447Z 2025-05-07T19:44:30.0518451Z 2025-05-07T19:44:30.0518468Z 2025-05-07T19:44:30.0518472Z 2025-05-07T19:44:30.0518475Z 2025-05-07T19:44:30.0518480Z 2025-05-07T19:44:30.0518483Z 2025-05-07T19:44:30.0540505Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:30.0540837Z 2025-05-07T19:44:30.0540842Z 2025-05-07T19:44:30.0540846Z 2025-05-07T19:44:30.0540870Z 2025-05-07T19:44:30.0540874Z 2025-05-07T19:44:30.0540877Z 2025-05-07T19:44:30.0540881Z 2025-05-07T19:44:30.0541773Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:30.0542061Z 2025-05-07T19:44:30.0542073Z 2025-05-07T19:44:30.0542077Z 2025-05-07T19:44:30.0542081Z 2025-05-07T19:44:30.0542098Z 2025-05-07T19:44:30.0542101Z 2025-05-07T19:44:30.0542105Z 2025-05-07T19:44:30.0542108Z 2025-05-07T19:44:30.0873457Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:30.0873794Z 2025-05-07T19:44:30.0873800Z 2025-05-07T19:44:30.0873803Z 2025-05-07T19:44:30.0873807Z 2025-05-07T19:44:30.0873826Z 2025-05-07T19:44:30.0873829Z 2025-05-07T19:44:30.0873833Z 2025-05-07T19:44:30.0873836Z 2025-05-07T19:44:30.0873840Z 2025-05-07T19:44:30.0880364Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:30.0880655Z 2025-05-07T19:44:30.0880659Z 2025-05-07T19:44:30.0880678Z 2025-05-07T19:44:30.0880694Z 2025-05-07T19:44:30.0880698Z 2025-05-07T19:44:30.0880702Z 2025-05-07T19:44:30.0880705Z 2025-05-07T19:44:30.0880709Z 2025-05-07T19:44:30.0880712Z 2025-05-07T19:44:30.0880715Z 2025-05-07T19:44:30.0881057Z 2025-05-07T19:44:30.0894643Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:30.0894993Z 2025-05-07T19:44:30.0894997Z 2025-05-07T19:44:30.0895001Z 2025-05-07T19:44:30.0895005Z 2025-05-07T19:44:30.0895008Z 2025-05-07T19:44:30.0895011Z 2025-05-07T19:44:30.0895015Z 2025-05-07T19:44:30.0895019Z 2025-05-07T19:44:30.0897220Z 2025-05-07T19:44:30.0902507Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:30.0902848Z 2025-05-07T19:44:30.0902852Z 2025-05-07T19:44:30.0902856Z 2025-05-07T19:44:30.0902861Z 2025-05-07T19:44:30.0902865Z 2025-05-07T19:44:30.0902887Z 2025-05-07T19:44:30.0902891Z 2025-05-07T19:44:30.0902894Z 2025-05-07T19:44:30.0902898Z 2025-05-07T19:44:30.0903076Z 2025-05-07T19:44:30.0903411Z 2025-05-07T19:44:30.0919011Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:30.0919370Z 2025-05-07T19:44:30.0919383Z 2025-05-07T19:44:30.0919387Z 2025-05-07T19:44:30.0919391Z 2025-05-07T19:44:30.0919394Z 2025-05-07T19:44:30.0919398Z 2025-05-07T19:44:30.0919401Z 2025-05-07T19:44:30.0919405Z 2025-05-07T19:44:30.0919409Z 2025-05-07T19:44:30.0919412Z 2025-05-07T19:44:30.0934574Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:30.0934944Z 2025-05-07T19:44:30.0934949Z 2025-05-07T19:44:30.0934954Z 2025-05-07T19:44:30.0934959Z 2025-05-07T19:44:30.0934963Z 2025-05-07T19:44:30.0934968Z 2025-05-07T19:44:30.0934971Z 2025-05-07T19:44:30.0934975Z 2025-05-07T19:44:30.0934978Z 2025-05-07T19:44:30.0934982Z 2025-05-07T19:44:30.1025786Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:30.1026122Z 2025-05-07T19:44:30.1123755Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:30.1182086Z gcc_impl_linux-64-11 | 53.0 MB | ######9 | 70% 2025-05-07T19:44:30.1182575Z 2025-05-07T19:44:30.1182625Z 2025-05-07T19:44:30.1182632Z 2025-05-07T19:44:30.1182677Z 2025-05-07T19:44:30.1182683Z 2025-05-07T19:44:30.1182689Z 2025-05-07T19:44:30.1183052Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:30.1183356Z 2025-05-07T19:44:30.1183361Z 2025-05-07T19:44:30.1183372Z 2025-05-07T19:44:30.1183375Z 2025-05-07T19:44:30.1183395Z 2025-05-07T19:44:30.1183398Z 2025-05-07T19:44:30.1259912Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:30.1260249Z 2025-05-07T19:44:30.1260254Z 2025-05-07T19:44:30.1260258Z 2025-05-07T19:44:30.1260261Z 2025-05-07T19:44:30.1260265Z 2025-05-07T19:44:30.1736108Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:30.1736471Z 2025-05-07T19:44:30.1736491Z 2025-05-07T19:44:30.1736495Z 2025-05-07T19:44:30.1736498Z 2025-05-07T19:44:30.1736502Z 2025-05-07T19:44:30.1736524Z 2025-05-07T19:44:30.1736528Z 2025-05-07T19:44:30.1736824Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:30.1737149Z 2025-05-07T19:44:30.1737153Z 2025-05-07T19:44:30.1737157Z 2025-05-07T19:44:30.1737160Z 2025-05-07T19:44:30.1737164Z 2025-05-07T19:44:30.1737167Z 2025-05-07T19:44:30.1737171Z 2025-05-07T19:44:30.2125331Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:30.2170088Z gcc_impl_linux-64-11 | 53.0 MB | ########7 | 87% 2025-05-07T19:44:30.2170375Z 2025-05-07T19:44:30.2170472Z 2025-05-07T19:44:30.2170476Z 2025-05-07T19:44:30.2170658Z 2025-05-07T19:44:30.2170662Z 2025-05-07T19:44:30.2170667Z 2025-05-07T19:44:30.2170689Z 2025-05-07T19:44:30.2170710Z 2025-05-07T19:44:30.2171546Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:30.2171898Z 2025-05-07T19:44:30.2171903Z 2025-05-07T19:44:30.2171907Z 2025-05-07T19:44:30.2171910Z 2025-05-07T19:44:30.2171914Z 2025-05-07T19:44:30.2171918Z 2025-05-07T19:44:30.2171921Z 2025-05-07T19:44:30.2171925Z 2025-05-07T19:44:30.2545256Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:30.2545593Z 2025-05-07T19:44:30.2545598Z 2025-05-07T19:44:30.2545601Z 2025-05-07T19:44:30.2545606Z 2025-05-07T19:44:30.2545609Z 2025-05-07T19:44:30.2545613Z 2025-05-07T19:44:30.2545616Z 2025-05-07T19:44:30.2545620Z 2025-05-07T19:44:30.2545623Z 2025-05-07T19:44:30.2546400Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:30.2546761Z 2025-05-07T19:44:30.2546767Z 2025-05-07T19:44:30.2546772Z 2025-05-07T19:44:30.2546777Z 2025-05-07T19:44:30.2546782Z 2025-05-07T19:44:30.2546787Z 2025-05-07T19:44:30.2546822Z 2025-05-07T19:44:30.2546827Z 2025-05-07T19:44:30.2546831Z 2025-05-07T19:44:30.2891401Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:30.2892019Z 2025-05-07T19:44:30.2892096Z 2025-05-07T19:44:30.2892100Z 2025-05-07T19:44:30.2892103Z 2025-05-07T19:44:30.2892107Z 2025-05-07T19:44:30.2892206Z 2025-05-07T19:44:30.2892223Z 2025-05-07T19:44:30.2892231Z 2025-05-07T19:44:30.2892237Z 2025-05-07T19:44:30.2892243Z 2025-05-07T19:44:30.2892251Z 2025-05-07T19:44:30.2892975Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:30.2893313Z 2025-05-07T19:44:30.2893320Z 2025-05-07T19:44:30.2893327Z 2025-05-07T19:44:30.2893334Z 2025-05-07T19:44:30.2893339Z 2025-05-07T19:44:30.2893346Z 2025-05-07T19:44:30.2893351Z 2025-05-07T19:44:30.2893355Z 2025-05-07T19:44:30.2893359Z 2025-05-07T19:44:30.2893362Z 2025-05-07T19:44:30.2893369Z 2025-05-07T19:44:30.3211583Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:30.3211946Z 2025-05-07T19:44:30.3211951Z 2025-05-07T19:44:30.3211977Z 2025-05-07T19:44:30.3211981Z 2025-05-07T19:44:30.3211984Z 2025-05-07T19:44:30.3211988Z 2025-05-07T19:44:30.3211991Z 2025-05-07T19:44:30.3211995Z 2025-05-07T19:44:30.3211999Z 2025-05-07T19:44:30.3212002Z 2025-05-07T19:44:30.3212291Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:30.3212577Z 2025-05-07T19:44:30.3212581Z 2025-05-07T19:44:30.3212584Z 2025-05-07T19:44:30.3212588Z 2025-05-07T19:44:30.3212591Z 2025-05-07T19:44:30.3212595Z 2025-05-07T19:44:30.3212598Z 2025-05-07T19:44:30.3212602Z 2025-05-07T19:44:30.3212605Z 2025-05-07T19:44:30.3212608Z 2025-05-07T19:44:30.4264490Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:30.4264818Z 2025-05-07T19:44:30.4264823Z 2025-05-07T19:44:30.4264827Z 2025-05-07T19:44:30.5425041Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:30.5425349Z 2025-05-07T19:44:30.5901917Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:30.6799445Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:30.6800190Z 2025-05-07T19:44:30.6800205Z 2025-05-07T19:44:31.1678694Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:31.1686707Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:31.1687806Z 2025-05-07T19:44:31.1688415Z 2025-05-07T19:44:31.1689043Z  2025-05-07T19:44:31.1689649Z 2025-05-07T19:44:31.1689661Z 2025-05-07T19:44:31.1690166Z  2025-05-07T19:44:31.1691246Z 2025-05-07T19:44:31.1691257Z 2025-05-07T19:44:31.1691268Z 2025-05-07T19:44:31.1691831Z  2025-05-07T19:44:31.1692454Z 2025-05-07T19:44:31.1692499Z 2025-05-07T19:44:31.1692510Z 2025-05-07T19:44:31.1692521Z 2025-05-07T19:44:31.1693047Z  2025-05-07T19:44:31.1693700Z 2025-05-07T19:44:31.1693711Z 2025-05-07T19:44:31.1693721Z 2025-05-07T19:44:31.1693732Z 2025-05-07T19:44:31.1693742Z 2025-05-07T19:44:31.1694243Z  2025-05-07T19:44:31.1694874Z 2025-05-07T19:44:31.1695031Z 2025-05-07T19:44:31.1695034Z 2025-05-07T19:44:31.1695058Z 2025-05-07T19:44:31.1695062Z 2025-05-07T19:44:31.1695065Z 2025-05-07T19:44:31.1695252Z  2025-05-07T19:44:31.1695477Z 2025-05-07T19:44:31.1695480Z 2025-05-07T19:44:31.1695484Z 2025-05-07T19:44:31.1695488Z 2025-05-07T19:44:31.1695491Z 2025-05-07T19:44:31.1695495Z 2025-05-07T19:44:31.1695499Z 2025-05-07T19:44:31.1695699Z  2025-05-07T19:44:31.1695935Z 2025-05-07T19:44:31.1695938Z 2025-05-07T19:44:31.1695942Z 2025-05-07T19:44:31.1696211Z 2025-05-07T19:44:31.1696214Z 2025-05-07T19:44:31.1696259Z 2025-05-07T19:44:31.1696280Z 2025-05-07T19:44:31.1696284Z 2025-05-07T19:44:31.1696482Z  2025-05-07T19:44:31.1696709Z 2025-05-07T19:44:31.1696713Z 2025-05-07T19:44:31.1696716Z 2025-05-07T19:44:31.1696720Z 2025-05-07T19:44:31.1696724Z 2025-05-07T19:44:31.1696728Z 2025-05-07T19:44:31.1696732Z 2025-05-07T19:44:31.1696735Z 2025-05-07T19:44:31.1696756Z 2025-05-07T19:44:31.1696953Z  2025-05-07T19:44:31.1697184Z 2025-05-07T19:44:31.1697188Z 2025-05-07T19:44:31.1697191Z 2025-05-07T19:44:31.1697195Z 2025-05-07T19:44:31.1697198Z 2025-05-07T19:44:31.1697202Z 2025-05-07T19:44:31.1697206Z 2025-05-07T19:44:31.1697210Z 2025-05-07T19:44:31.1697213Z 2025-05-07T19:44:31.1697344Z 2025-05-07T19:44:31.1697544Z  2025-05-07T19:44:31.1697786Z 2025-05-07T19:44:31.1697789Z 2025-05-07T19:44:31.1697793Z 2025-05-07T19:44:31.1697797Z 2025-05-07T19:44:31.1697800Z 2025-05-07T19:44:31.1697804Z 2025-05-07T19:44:31.1697807Z 2025-05-07T19:44:31.1697811Z 2025-05-07T19:44:31.1697815Z 2025-05-07T19:44:31.1697836Z 2025-05-07T19:44:31.1697840Z 2025-05-07T19:44:31.1698057Z  done 2025-05-07T19:44:31.2705258Z Preparing transaction: \ done 2025-05-07T19:44:31.5721576Z Verifying transaction: / - \ done 2025-05-07T19:44:31.6737603Z Executing transaction: / done 2025-05-07T19:44:31.7608191Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:35.4660573Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:35.4662299Z 2025-05-07T19:44:35.4678722Z 2025-05-07T19:44:35.4699606Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:35.4701324Z 2025-05-07T19:44:35.4713523Z 2025-05-07T19:44:35.4739605Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:35.4741372Z 2025-05-07T19:44:35.4755727Z 2025-05-07T19:44:35.4779667Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:35.4780287Z 2025-05-07T19:44:35.4793107Z 2025-05-07T19:44:37.2594010Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:37.2594772Z 2025-05-07T19:44:37.3334707Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:39.1023682Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:39.1023986Z 2025-05-07T19:44:39.1587161Z [CHECK] Binary gcc found in PATH 2025-05-07T19:44:40.9564696Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:40.9565036Z 2025-05-07T19:44:41.0386724Z [CHECK] Binary c++ found in PATH 2025-05-07T19:44:42.8118572Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:42.8119381Z 2025-05-07T19:44:42.8852207Z [CHECK] Binary g++ found in PATH 2025-05-07T19:44:42.8853901Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:44:42.8854855Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:44:42.8855154Z 2025-05-07T19:44:44.6754009Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:44.6754973Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:44.6756100Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:44.6756869Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:44.6757833Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:44.6758891Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:44.6759705Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:44.6760648Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:44.6761396Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:44.6762709Z #define __CHAR_BIT__ 8 2025-05-07T19:44:44.6763366Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:44.6764085Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:44.6764790Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:44.6765578Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:44.6766222Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:44.6766520Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6766837Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:44.6767124Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:44.6767464Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:44.6767787Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:44.6768213Z #define __DBL_DENORM_MIN__ ((double)4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:44.6768637Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:44.6769123Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:44.6769430Z #define __GCC_IEC_559 2 2025-05-07T19:44:44.6769683Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:44.6769975Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:44.6770236Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:44.6770531Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:44.6770865Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6771212Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:44.6771483Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:44.6771781Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:44.6772049Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:44.6772338Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:44.6772613Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:44.6772870Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:44.6773146Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:44.6773396Z #define __INT8_C(c) c 2025-05-07T19:44:44.6773643Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:44.6773939Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6774274Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:44.6774589Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:44.6774965Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:44.6775378Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:44.6775631Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6775913Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:44.6776176Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:44.6776567Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:44.6776966Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:44.6777257Z #define __linux 1 2025-05-07T19:44:44.6777470Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:44.6777750Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:44.6778019Z #define __unix 1 2025-05-07T19:44:44.6778248Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:44.6778539Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:44.6778800Z #define __WINT_MIN__ 0U 2025-05-07T19:44:44.6779054Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:44.6779329Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:44.6779605Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:44.6779856Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:44.6780113Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:44.6780385Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:44.6780691Z #define __INT64_C(c) c ## L 2025-05-07T19:44:44.6780944Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:44.6781248Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:44.6781516Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:44.6781852Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:44.6782231Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:44.6782469Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:44.6782731Z #define __DBL_DIG__ 15 2025-05-07T19:44:44.6782954Z #define __FLT32_DIG__ 6 2025-05-07T19:44:44.6783257Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:44.6783695Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:44.6783947Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:44.6784277Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:44.6784606Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:44.6784856Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:44.6785107Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:44.6785485Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:44.6785869Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:44.6786169Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:44.6786413Z #define __unix__ 1 2025-05-07T19:44:44.6786642Z #define __INT_WIDTH__ 32 2025-05-07T19:44:44.6786874Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:44.6787126Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:44.6787441Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:44.6787716Z #define __UINT16_C(c) c 2025-05-07T19:44:44.6787965Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:44.6788219Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:44.6788580Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:44.6788939Z #define __gnu_linux__ 1 2025-05-07T19:44:44.6789193Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:44.6789461Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:44.6789760Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6790019Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:44.6790294Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:44.6790741Z #define __GNUC__ 11 2025-05-07T19:44:44.6791237Z #define __pie__ 2 2025-05-07T19:44:44.6791492Z #define __MMX__ 1 2025-05-07T19:44:44.6791720Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:44.6792087Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:44.6792375Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:44.6792679Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:44.6793034Z #define __DBL_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:44.6793473Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6793794Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:44.6794072Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:44.6794337Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:44.6794650Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:44.6794931Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:44.6795192Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:44.6795495Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:44.6795794Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:44.6796081Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:44.6796363Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:44.6796629Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:44.6796897Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:44.6797189Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:44.6797460Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:44.6797743Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:44.6798086Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:44.6798461Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:44.6798752Z #define __SSE2_MATH__ 1 2025-05-07T19:44:44.6798999Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:44.6799319Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6799616Z #define __amd64 1 2025-05-07T19:44:44.6799858Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:44.6800130Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:44.6800450Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:44.6800781Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:44.6801044Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:44.6801341Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:44.6801601Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:44.6801884Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:44.6802154Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:44.6802435Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:44.6802707Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:44.6803168Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:44.6803419Z #define __x86_64 1 2025-05-07T19:44:44.6803676Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:44.6804308Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:44.6804760Z #define __DBL_MIN__ ((double)2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:44.6805222Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:44.6805676Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:44.6806067Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:44.6806308Z #define __LP64__ 1 2025-05-07T19:44:44.6806546Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6806882Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:44.6807363Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:44.6807643Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:44.6807906Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:44.6808192Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:44.6808450Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:44.6808723Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:44.6808967Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:44.6809231Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:44.6809475Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:44.6809800Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:44.6810145Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:44.6810422Z #define __FLT_DIG__ 6 2025-05-07T19:44:44.6810654Z #define __NO_INLINE__ 1 2025-05-07T19:44:44.6810878Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:44.6811198Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:44.6811531Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:44.6811787Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:44.6812034Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:44.6812293Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:44.6812535Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:44.6812791Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:44.6813074Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:44.6813362Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:44.6813642Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:44.6813925Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:44.6814250Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:44.6814499Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:44.6814758Z #define __FLT128_DIG__ 33 2025-05-07T19:44:44.6814979Z #define __INT32_C(c) c 2025-05-07T19:44:44.6815222Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:44.6815485Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:44.6815765Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:44.6816031Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:44.6816353Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:44.6816666Z #define unix 1 2025-05-07T19:44:44.6816889Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:44.6817206Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6817495Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:44.6817816Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:44.6818137Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:44.6818399Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:44.6818652Z #define __ELF__ 1 2025-05-07T19:44:44.6818899Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:44.6819191Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:44.6819458Z #define __FLT_RADIX__ 2 2025-05-07T19:44:44.6819717Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:44.6820065Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:44.6820447Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:44.6820699Z #define __SSE_MATH__ 1 2025-05-07T19:44:44.6820936Z #define __k8 1 2025-05-07T19:44:44.6821219Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:44.6821678Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:44.6821964Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:44.6822274Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:44.6822537Z #define __LDBL_DIG__ 18 2025-05-07T19:44:44.6822769Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:44.6823023Z #define __x86_64__ 1 2025-05-07T19:44:44.6823251Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:44.6823548Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:44.6823865Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6824168Z #define __FLT64_DIG__ 15 2025-05-07T19:44:44.6842624Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6843052Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:44.6843409Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6843991Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:44.6844265Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6844586Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:44.6844943Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:44.6845359Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:44.6845641Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:44.6845989Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:44.6846317Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:44.6846602Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:44.6846890Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:44.6847187Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:44.6847472Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:44.6847701Z #define __SEG_FS 1 2025-05-07T19:44:44.6847947Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:44.6848214Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:44.6848502Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6848784Z #define __SEG_GS 1 2025-05-07T19:44:44.6849107Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:44.6849493Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:44.6849751Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:44.6850042Z #define __INT16_TYPE__ short int 2025-05-07T19:44:44.6850306Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:44.6850609Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:44.6850865Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:44.6851116Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:44.6851362Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:44.6851706Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:44.6852095Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6852373Z #define linux 1 2025-05-07T19:44:44.6852595Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6852855Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:44.6853135Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:44.6853371Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:44.6853639Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:44.6853885Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:44.6854236Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:44.6854638Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:44.6854976Z #define __code_model_small__ 1 2025-05-07T19:44:44.6855258Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:44.6855529Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:44.6855786Z #define __k8__ 1 2025-05-07T19:44:44.6856001Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:44.6856294Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:44.6856583Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:44.6856819Z #define __pic__ 2 2025-05-07T19:44:44.6857056Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6857415Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:44.6857708Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6858023Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:44.6858491Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:44.6858841Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:44.6859122Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:44.6859416Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:44.6859715Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:44.6859971Z #define __linux__ 1 2025-05-07T19:44:44.6860189Z #define __INT64_TYPE__ long int 2025-05-07T19:44:44.6860451Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:44.6860698Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:44.6860983Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:44.6861229Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:44.6861525Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6861838Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:44.6862197Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:44.6862449Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:44.6862742Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:44.6863026Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:44.6863357Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:44.6863718Z #define __SSE__ 1 2025-05-07T19:44:44.6863928Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:44.6864266Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:44.6864595Z #define __amd64__ 1 2025-05-07T19:44:44.6864822Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:44.6865058Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:44.6865324Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:44.6865577Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:44.6865843Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:44.6866097Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:44.6866366Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:44.6866649Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:44.6866903Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:44.6867250Z #define __DBL_EPSILON__ ((double)2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:44.6867699Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:44.6868058Z #define _LP64 1 2025-05-07T19:44:44.6868262Z #define __UINT8_C(c) c 2025-05-07T19:44:44.6868503Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:44.6868752Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:44.6869023Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:44.6869304Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:44.6869595Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:44.6869950Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:44.6870400Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:44.6870778Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6871064Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:44.6871515Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:44.6872076Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:44.6872485Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:44.6872773Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:44.6873122Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:44.6873524Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:44.6873790Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:44.6874059Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:44.6874311Z #define __FXSR__ 1 2025-05-07T19:44:44.6874631Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:44.6875108Z #define __DBL_NORM_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:44.6875552Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:44.6875883Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:44.6876339Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:44.6876702Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:44.6877193Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:44.6877461Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:44.6877701Z #define __PIC__ 2 2025-05-07T19:44:44.6877971Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:44.6878382Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:44.6878794Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:44.6879146Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:44.6879481Z #define __SSE2__ 1 2025-05-07T19:44:44.6879716Z #define __INT32_TYPE__ int 2025-05-07T19:44:44.6879965Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:44.6880236Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:44.6880574Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:44.6880952Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:44.6881284Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:44.6881569Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:44.6881837Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6882128Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:44.6882386Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:44.6882631Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:44.6882935Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6883235Z #define __PIE__ 2 2025-05-07T19:44:44.6883583Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:44.6884100Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:44.6884445Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:44.6884795Z #define __INT16_C(c) c 2025-05-07T19:44:44.6885025Z #define __STDC__ 1 2025-05-07T19:44:44.6885242Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:44.6885517Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:44.6885773Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:44.6886062Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:44.6886408Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:44.6886731Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:44.6886996Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:44.6887262Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:44.6887529Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:44.6887795Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:44.6888084Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:44.6888355Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:44.6888632Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:44.6889019Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:44.6889383Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:44.6889681Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:44.6889961Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:44.6890209Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:44.6890361Z 2025-05-07T19:44:44.7481120Z 2025-05-07T19:44:44.7482108Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:44:44.7483112Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:44:44.7483347Z 2025-05-07T19:44:46.5488101Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:46.5488536Z #define __cpp_attributes 200809L 2025-05-07T19:44:46.5488940Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:44:46.5489335Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:46.5489689Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:46.5489981Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:46.5490365Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:46.5491029Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:46.5491338Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:44:46.5491699Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:46.5492032Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:46.5492341Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:46.5492658Z #define __CHAR_BIT__ 8 2025-05-07T19:44:46.5492944Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:46.5493536Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:46.5493841Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:46.5494140Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:46.5494462Z #define __cpp_static_assert 201411L 2025-05-07T19:44:46.5494796Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:46.5495123Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5495480Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:46.5495793Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:46.5496165Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:46.5496510Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:46.5496960Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:46.5497413Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:46.5497774Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:46.5498243Z #define __GCC_IEC_559 2 2025-05-07T19:44:46.5498511Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:46.5498845Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:46.5499146Z #define __cpp_binary_literals 201304L 2025-05-07T19:44:46.5499487Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:46.5499801Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:44:46.5500174Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:46.5500518Z #define __cpp_variadic_templates 200704L 2025-05-07T19:44:46.5500911Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5501269Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:46.5501595Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.5501920Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:46.5502224Z #define __cpp_variable_templates 201304L 2025-05-07T19:44:46.5502570Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:46.5502856Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:46.5503167Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:46.5503693Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:44:46.5504048Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:44:46.5504409Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:46.5504692Z #define __INT8_C(c) c 2025-05-07T19:44:46.5504936Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:46.5505233Z #define __cpp_variadic_using 201611L 2025-05-07T19:44:46.5505560Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5505927Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:46.5506214Z #define __cpp_capture_star_this 201603L 2025-05-07T19:44:46.5506532Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:46.5506848Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.5507235Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:46.5507516Z #define __cpp_if_constexpr 201606L 2025-05-07T19:44:46.5507818Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.5508113Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5508399Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:46.5508701Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:46.5509102Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:46.5509548Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:46.5509842Z #define __linux 1 2025-05-07T19:44:46.5510102Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:46.5510387Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:46.5510699Z #define __unix 1 2025-05-07T19:44:46.5510957Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:46.5511346Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:44:46.5511846Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:46.5512145Z #define __WINT_MIN__ 0U 2025-05-07T19:44:46.5512447Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.5512760Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:46.5513081Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:46.5513374Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:46.5513685Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:46.5513983Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:46.5514427Z #define __INT64_C(c) c ## L 2025-05-07T19:44:46.5514717Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:46.5515023Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:46.5515320Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:46.5515629Z #define __cpp_aligned_new 201606L 2025-05-07T19:44:46.5515928Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:46.5516195Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:46.5516568Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:46.5516966Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:46.5517242Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:46.5517522Z #define __cpp_decltype_auto 201304L 2025-05-07T19:44:46.5517819Z #define __DBL_DIG__ 15 2025-05-07T19:44:46.5518064Z #define __FLT32_DIG__ 6 2025-05-07T19:44:46.5518462Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:46.5518835Z #define __GXX_WEAK__ 1 2025-05-07T19:44:46.5519070Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:46.5519337Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:46.5519665Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:46.5520036Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.5520298Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:46.5520618Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:44:46.5520966Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:44:46.5521385Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:46.5521816Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:46.5522094Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:46.5522367Z #define __unix__ 1 2025-05-07T19:44:46.5522592Z #define __INT_WIDTH__ 32 2025-05-07T19:44:46.5522853Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:46.5523100Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:46.5523375Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:46.5523647Z #define __UINT16_C(c) c 2025-05-07T19:44:46.5523905Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:46.5524280Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:46.5524620Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:46.5524987Z #define __gnu_linux__ 1 2025-05-07T19:44:46.5525214Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:46.5525475Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:46.5525737Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.5526021Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5526275Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:46.5526538Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:46.5526780Z #define __GNUC__ 11 2025-05-07T19:44:46.5527000Z #define __GXX_RTTI 1 2025-05-07T19:44:46.5527230Z #define __pie__ 2 2025-05-07T19:44:46.5527430Z #define __MMX__ 1 2025-05-07T19:44:46.5527660Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:46.5527911Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:46.5528190Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:46.5528446Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:46.5528714Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:46.5529010Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:44:46.5529355Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:46.5529701Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.5530098Z #define __cpp_raw_strings 200710L 2025-05-07T19:44:46.5530430Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5530749Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.5531042Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:46.5531311Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:46.5531641Z #define __cpp_fold_expressions 201603L 2025-05-07T19:44:46.5531934Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:46.5532225Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:46.5532488Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:46.5532799Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:46.5533093Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:46.5533489Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:46.5533797Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:46.5534047Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:46.5534339Z #define __cplusplus 201703L 2025-05-07T19:44:46.5534605Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:44:46.5534912Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:46.5535166Z #define __DEPRECATED 1 2025-05-07T19:44:46.5535446Z #define __cpp_rvalue_references 200610L 2025-05-07T19:44:46.5535743Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:46.5536031Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:46.5536502Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.5536885Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:46.5537199Z #define __SSE2_MATH__ 1 2025-05-07T19:44:46.5537452Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:46.5537874Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5538171Z #define __amd64 1 2025-05-07T19:44:46.5538424Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:46.5538693Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:46.5538984Z #define __GNUG__ 11 2025-05-07T19:44:46.5539238Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:46.5539581Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:46.5539877Z #define __cpp_nsdmi 200809L 2025-05-07T19:44:46.5540143Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:46.5540443Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:46.5540699Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:46.5541001Z #define __cpp_initializer_lists 200806L 2025-05-07T19:44:46.5541303Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:46.5541596Z #define __cpp_hex_float 201603L 2025-05-07T19:44:46.5541864Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:46.5542157Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:46.5542486Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:46.5542784Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:46.5543076Z #define __x86_64 1 2025-05-07T19:44:46.5543309Z #define __cpp_lambdas 200907L 2025-05-07T19:44:46.5543612Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:46.5543981Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:46.5544400Z #define __cpp_template_auto 201606L 2025-05-07T19:44:46.5544759Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:46.5545235Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:46.5545707Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.5546123Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:46.5546402Z #define __LP64__ 1 2025-05-07T19:44:46.5546656Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5547032Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:46.5547421Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:46.5547725Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.5548012Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:46.5548325Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:46.5548620Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:46.5548880Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:46.5549170Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.5549499Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:46.5549891Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:46.5550176Z #define __FLT_DIG__ 6 2025-05-07T19:44:46.5550445Z #define __NO_INLINE__ 1 2025-05-07T19:44:46.5550693Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:46.5551048Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:46.5551514Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:46.5551968Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:46.5552281Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:46.5552592Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:46.5552916Z #define __cpp_unicode_characters 201411L 2025-05-07T19:44:46.5553233Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:46.5553659Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:46.5553971Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:46.5554305Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:46.5554590Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:46.5554929Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.5555310Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:44:46.5555614Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:46.5555919Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:46.5556187Z #define __FLT128_DIG__ 33 2025-05-07T19:44:46.5556464Z #define __INT32_C(c) c 2025-05-07T19:44:46.5556717Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:46.5557036Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:46.5557329Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:46.5557649Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:46.5558084Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:46.5558444Z #define unix 1 2025-05-07T19:44:46.5558704Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:46.5558982Z #define __cpp_rtti 199711L 2025-05-07T19:44:46.5559299Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:46.5559633Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5559989Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:46.5560315Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:46.5560690Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:46.5560954Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:46.5561284Z #define __cpp_digit_separators 201309L 2025-05-07T19:44:46.5561588Z #define __ELF__ 1 2025-05-07T19:44:46.5561863Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:46.5562183Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:46.5562478Z #define __FLT_RADIX__ 2 2025-05-07T19:44:46.5562775Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:46.5563163Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:46.5563577Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:46.5563871Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:44:46.5564299Z #define __k8 1 2025-05-07T19:44:46.5564598Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:46.5565016Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:46.5565339Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:46.5565645Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:46.5565935Z #define __LDBL_DIG__ 18 2025-05-07T19:44:46.5566181Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:46.5566467Z #define __x86_64__ 1 2025-05-07T19:44:46.5566709Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:46.5567045Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:46.5567389Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5567734Z #define __FLT64_DIG__ 15 2025-05-07T19:44:46.5568023Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5568406Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.5568754Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5569029Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:46.5569340Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5569645Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:46.5570042Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:46.5570454Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:46.5570940Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:46.5571277Z #define __cpp_unicode_literals 200710L 2025-05-07T19:44:46.5571633Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:46.5571997Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:46.5572313Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:46.5572628Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:46.5572951Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:46.5573274Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:46.5573525Z #define __SEG_FS 1 2025-05-07T19:44:46.5576430Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:46.5576724Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:46.5577046Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5577350Z #define __SEG_GS 1 2025-05-07T19:44:46.5577715Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:46.5578146Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:46.5578441Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:46.5578774Z #define __INT16_TYPE__ short int 2025-05-07T19:44:46.5579078Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:46.5579436Z #define __cpp_structured_bindings 201606L 2025-05-07T19:44:46.5579750Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:46.5580037Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:46.5580314Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:46.5580804Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.5581247Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5581598Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:44:46.5581981Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:44:46.5582310Z #define linux 1 2025-05-07T19:44:46.5582570Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5582988Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.5583308Z #define __EXCEPTIONS 1 2025-05-07T19:44:46.5583674Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:46.5583968Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:46.5584236Z #define __cpp_range_based_for 201603L 2025-05-07T19:44:46.5584562Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:46.5584944Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.5585344Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:44:46.5585725Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:46.5586064Z #define __code_model_small__ 1 2025-05-07T19:44:46.5586383Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:46.5586694Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:44:46.5587032Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:46.5587310Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:44:46.5587636Z #define __k8__ 1 2025-05-07T19:44:46.5587902Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:46.5588184Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:46.5588513Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:46.5588767Z #define __pic__ 2 2025-05-07T19:44:46.5589035Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5589344Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:46.5589643Z #define __cpp_decltype 200707L 2025-05-07T19:44:46.5589928Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5590271Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:46.5590976Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.5591546Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:46.5591882Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.5592233Z #define __cpp_inline_variables 201606L 2025-05-07T19:44:46.5592596Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:46.5592867Z #define __linux__ 1 2025-05-07T19:44:46.5593135Z #define __INT64_TYPE__ long int 2025-05-07T19:44:46.5593419Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:46.5593723Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:46.5594014Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:46.5594343Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:44:46.5594681Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:46.5595021Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5595390Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:46.5595681Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:46.5596032Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:46.5596365Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:46.5596750Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.5597143Z #define __SSE__ 1 2025-05-07T19:44:46.5597575Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:46.5597944Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.5598347Z #define __amd64__ 1 2025-05-07T19:44:46.5598620Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:46.5598898Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:46.5599233Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:46.5599520Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:46.5599845Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:46.5600132Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:46.5600448Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:46.5600740Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:46.5601129Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:46.5601627Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:46.5602135Z #define _LP64 1 2025-05-07T19:44:46.5602408Z #define __UINT8_C(c) c 2025-05-07T19:44:46.5602667Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:46.5602989Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:46.5603281Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:46.5603597Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:46.5604085Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.5604582Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.5604968Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5605294Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.5605630Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:46.5605941Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:44:46.5606345Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:46.5606718Z #define __STDCPP_THREADS__ 1 2025-05-07T19:44:46.5607010Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:46.5607282Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:46.5607645Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:46.5608016Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:46.5608299Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:46.5608575Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:46.5608830Z #define __FXSR__ 1 2025-05-07T19:44:46.5609146Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.5609601Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.5610034Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.5610344Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:46.5610634Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:44:46.5610929Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:46.5611254Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:46.5611549Z #define __cpp_alias_templates 200704L 2025-05-07T19:44:46.5611911Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:46.5612479Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:46.5612830Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:46.5613119Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:46.5613377Z #define __PIC__ 2 2025-05-07T19:44:46.5613672Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:46.5614087Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.5614523Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:46.5614871Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.5615266Z #define __cpp_constexpr 201603L 2025-05-07T19:44:46.5615570Z #define __SSE2__ 1 2025-05-07T19:44:46.5615820Z #define __cpp_deduction_guides 201703L 2025-05-07T19:44:46.5616152Z #define __INT32_TYPE__ int 2025-05-07T19:44:46.5616438Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:46.5616740Z #define __cpp_exceptions 199711L 2025-05-07T19:44:46.5617035Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.5617422Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:46.5617891Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:46.5618213Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:46.5618532Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:46.5618825Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5619148Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:46.5619416Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:46.5619720Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:44:46.5620039Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:46.5620382Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5620706Z #define __PIE__ 2 2025-05-07T19:44:46.5621085Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:46.5621529Z #define __cpp_template_template_args 201611L 2025-05-07T19:44:46.5621883Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:46.5622343Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:46.5622737Z #define __INT16_C(c) c 2025-05-07T19:44:46.5623012Z #define __STDC__ 1 2025-05-07T19:44:46.5623254Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:46.5623557Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:46.5624017Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:46.5624329Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.5624661Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:46.5625060Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:46.5625446Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:46.5625729Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.5626062Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:44:46.5626359Z #define __SSE_MATH__ 1 2025-05-07T19:44:46.5626635Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:46.5626940Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:44:46.5627291Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:46.5627596Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:46.5627933Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.5628220Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:46.5628566Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.5629018Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.5629422Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:46.5629792Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:46.5630107Z #define _GNU_SOURCE 1 2025-05-07T19:44:46.5630397Z #define __cpp_init_captures 201304L 2025-05-07T19:44:46.5630707Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:46.5630999Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:46.5631262Z 2025-05-07T19:44:46.6176504Z 2025-05-07T19:44:46.6177187Z + conda run -n build_binary c++ --version 2025-05-07T19:44:46.6177482Z 2025-05-07T19:44:48.4199116Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:44:48.4199507Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:44:48.4200015Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:44:48.4200589Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:44:48.4200945Z 2025-05-07T19:44:48.4200958Z 2025-05-07T19:44:48.4990812Z 2025-05-07T19:44:48.4991647Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:44:48.4992255Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:44:48.4992576Z 2025-05-07T19:44:50.3803804Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:50.3806862Z 2025-05-07T19:44:50.3807155Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:44:50.3807741Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:44:50.3808069Z 2025-05-07T19:44:52.2149431Z #define __cplusplus 201703L 2025-05-07T19:44:52.2150019Z 2025-05-07T19:44:52.2150358Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:44:52.2216392Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:52.2216920Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:52.2217740Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:52.2218109Z env: 2025-05-07T19:44:52.2218384Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:52.2218704Z BUILD_ENV: build_binary 2025-05-07T19:44:52.2218989Z BUILD_TARGET: default 2025-05-07T19:44:52.2219236Z BUILD_VARIANT: cuda 2025-05-07T19:44:52.2219518Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:44:52.2219781Z ##[endgroup] 2025-05-07T19:44:52.6782184Z ################################################################################ 2025-05-07T19:44:52.6783213Z # Install Build Tools 2025-05-07T19:44:52.6783834Z # 2025-05-07T19:44:52.6799295Z # [2025-05-07T19:44:52.679Z] + install_build_tools build_binary 2025-05-07T19:44:52.6800464Z ################################################################################ 2025-05-07T19:44:52.6801238Z 2025-05-07T19:44:52.6819245Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:52.7675813Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:52.7680389Z [INSTALL] Installing build tools ... 2025-05-07T19:44:52.7707354Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:44:53.4914356Z Channels: 2025-05-07T19:44:53.4915007Z - conda-forge 2025-05-07T19:44:53.4915657Z Platform: linux-64 2025-05-07T19:44:56.4822525Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:59.7212514Z Solving environment: \ | / done 2025-05-07T19:44:59.7727547Z 2025-05-07T19:44:59.7728116Z ## Package Plan ## 2025-05-07T19:44:59.7728568Z 2025-05-07T19:44:59.7729174Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:59.7730071Z 2025-05-07T19:44:59.7730384Z added / updated specs: 2025-05-07T19:44:59.7731105Z - auditwheel 2025-05-07T19:44:59.7731520Z - bazel 2025-05-07T19:44:59.7731768Z - cmake[version='>=3.30'] 2025-05-07T19:44:59.7732030Z - hypothesis 2025-05-07T19:44:59.7732266Z - jinja2 2025-05-07T19:44:59.7732472Z - make 2025-05-07T19:44:59.7732688Z - ncurses 2025-05-07T19:44:59.7732993Z - ninja 2025-05-07T19:44:59.7733205Z - openblas 2025-05-07T19:44:59.7733447Z - patchelf 2025-05-07T19:44:59.7733665Z - pyyaml 2025-05-07T19:44:59.7733894Z - rhash 2025-05-07T19:44:59.7734101Z - scikit-build 2025-05-07T19:44:59.7734344Z - wheel 2025-05-07T19:44:59.7734462Z 2025-05-07T19:44:59.7734466Z 2025-05-07T19:44:59.7734630Z The following packages will be downloaded: 2025-05-07T19:44:59.7734978Z 2025-05-07T19:44:59.7735098Z package | build 2025-05-07T19:44:59.7735433Z ---------------------------|----------------- 2025-05-07T19:44:59.7735811Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:44:59.7736244Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:44:59.7736684Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:44:59.7737094Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:44:59.7737500Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:44:59.7737881Z cairo-1.18.4 | h3394656_0 955 KB conda-forge 2025-05-07T19:44:59.7738277Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:44:59.7738661Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:44:59.7739072Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:44:59.7739840Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:44:59.7740271Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:44:59.7740852Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:44:59.7741350Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:44:59.7741864Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:44:59.7742334Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:44:59.7742778Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:44:59.7743240Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:44:59.7743695Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:44:59.7744131Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:44:59.7744527Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:44:59.7744941Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:44:59.7745371Z harfbuzz-11.0.0 | h76408a6_0 1.6 MB conda-forge 2025-05-07T19:44:59.7745793Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:44:59.7746205Z icu-75.1 | he02047a_0 11.6 MB conda-forge 2025-05-07T19:44:59.7746565Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:44:59.7746961Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:44:59.7747375Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:44:59.7747804Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:44:59.7748206Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:44:59.7748597Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:44:59.7749056Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:44:59.7749504Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:44:59.7749935Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:44:59.7750355Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:44:59.7750823Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:44:59.7751380Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:44:59.7751967Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:44:59.7752438Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:44:59.7752911Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:44:59.7753409Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:59.7753918Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:44:59.7754382Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:44:59.7754824Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:44:59.7755248Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:59.7755713Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:44:59.7756161Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:44:59.7756623Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:44:59.7757101Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:44:59.7757735Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:44:59.7758309Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:44:59.7758785Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:44:59.7759225Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:44:59.7759660Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:44:59.7760060Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:44:59.7760469Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:44:59.7760860Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:44:59.7761266Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:44:59.7761673Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:44:59.7762091Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:44:59.7762498Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:44:59.7762875Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:44:59.7763291Z markupsafe-3.0.2 | py313h8060acc_1 24 KB conda-forge 2025-05-07T19:44:59.7763695Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:44:59.7764091Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:44:59.7764500Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:44:59.7764936Z openjdk-23.0.2 | h53dfc1b_2 181.4 MB conda-forge 2025-05-07T19:44:59.7765354Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:44:59.7765766Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:44:59.7766172Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:44:59.7766559Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:44:59.7766986Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:44:59.7767434Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:44:59.7767854Z python-3.13.2 |hf636f53_101_cp313 31.7 MB conda-forge 2025-05-07T19:44:59.7768280Z pyyaml-6.0.2 | py313h8060acc_2 201 KB conda-forge 2025-05-07T19:44:59.7768675Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:44:59.7769075Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:44:59.7769488Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:44:59.7769929Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:44:59.7770389Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:44:59.7770821Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:44:59.7771220Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:44:59.7771603Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:44:59.7772020Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:44:59.7772425Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:44:59.7772859Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:44:59.7773294Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:44:59.7773805Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:44:59.7774266Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:44:59.7774764Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:44:59.7775225Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:44:59.7775672Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:44:59.7776104Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:44:59.7776580Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:44:59.7777020Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:44:59.7777465Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:44:59.7777870Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:44:59.7778288Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:44:59.7778726Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:44:59.7779114Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:44:59.7779505Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:44:59.7779875Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:44:59.7780274Z ------------------------------------------------------------ 2025-05-07T19:44:59.7780607Z Total: 351.6 MB 2025-05-07T19:44:59.7780834Z 2025-05-07T19:44:59.7780956Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:59.7781176Z 2025-05-07T19:44:59.7781406Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:44:59.7781835Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:44:59.7782298Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:44:59.7782727Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:44:59.7783139Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:44:59.7783546Z cairo conda-forge/linux-64::cairo-1.18.4-h3394656_0 2025-05-07T19:44:59.7783935Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:44:59.7784336Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:44:59.7784729Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:44:59.7785214Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:44:59.7785796Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:44:59.7786384Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:44:59.7786975Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:44:59.7787525Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:44:59.7790368Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:44:59.7791372Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:44:59.7791885Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:44:59.7792384Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:44:59.7792834Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:44:59.7809917Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:44:59.7810538Z harfbuzz conda-forge/linux-64::harfbuzz-11.0.0-h76408a6_0 2025-05-07T19:44:59.7811282Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:44:59.7811744Z icu conda-forge/linux-64::icu-75.1-he02047a_0 2025-05-07T19:44:59.7812120Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:44:59.7812635Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:44:59.7813076Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:44:59.7813479Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:44:59.7813882Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:44:59.7814261Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:44:59.7814723Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:44:59.7815217Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:44:59.7815637Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:44:59.7816096Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:44:59.7816568Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:44:59.7817043Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:44:59.7817468Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:44:59.7817928Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:44:59.7818435Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:44:59.7818922Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:44:59.7819427Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:44:59.7819901Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:44:59.7820503Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:44:59.7820969Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:59.7821456Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:44:59.7821956Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:44:59.7822444Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:44:59.7822964Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:44:59.7823688Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:44:59.7824206Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:44:59.7824699Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:44:59.7825214Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:44:59.7825696Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:44:59.7826179Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:44:59.7826627Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:44:59.7827084Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:44:59.7827555Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:44:59.7828051Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:44:59.7828515Z libzlib conda-forge/linux-64::libzlib-1.3.1-hb9d3cd8_2 2025-05-07T19:44:59.7828940Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:44:59.7829422Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py313h8060acc_1 2025-05-07T19:44:59.7829906Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:44:59.7830407Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:44:59.7832248Z openjdk conda-forge/linux-64::openjdk-23.0.2-h53dfc1b_2 2025-05-07T19:44:59.7832750Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:44:59.7833328Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:44:59.7833775Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:44:59.7834226Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:44:59.7834745Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:44:59.7835268Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:44:59.7835766Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py313h8060acc_2 2025-05-07T19:44:59.7836201Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:44:59.7836629Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:44:59.7837127Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:44:59.7837629Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:44:59.7838182Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:44:59.7838706Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:44:59.7839191Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:44:59.7839697Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:44:59.7840184Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:44:59.7840701Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:44:59.7841227Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:44:59.7841766Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:44:59.7842320Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:44:59.7842840Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:44:59.7843375Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:44:59.7844060Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:44:59.7844605Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:44:59.7845112Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:44:59.7845612Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:44:59.7846099Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:44:59.7846519Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:44:59.7846931Z zstd conda-forge/linux-64::zstd-1.5.7-hb8e6e7a_2 2025-05-07T19:44:59.7847184Z 2025-05-07T19:44:59.7847300Z The following packages will be UPDATED: 2025-05-07T19:44:59.7847527Z 2025-05-07T19:44:59.7847803Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:44:59.7848458Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:44:59.7849108Z python pkgs/main::python-3.13.2-hf623796_100~ --> conda-forge::python-3.13.2-hf636f53_101_cp313 2025-05-07T19:44:59.7849776Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:44:59.7850438Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:44:59.7851036Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:44:59.7851591Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.3.1-hb9d3cd8_2 2025-05-07T19:44:59.7851928Z 2025-05-07T19:44:59.7852249Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:44:59.7852575Z 2025-05-07T19:44:59.7852818Z expat pkgs/main::expat-2.7.1-h6a678d5_0 --> conda-forge::expat-2.7.0-h5888daf_0 2025-05-07T19:44:59.7853478Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:44:59.7853815Z 2025-05-07T19:44:59.7853851Z 2025-05-07T19:44:59.7853855Z 2025-05-07T19:44:59.7854023Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:59.7854398Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:44:59.7854634Z 2025-05-07T19:44:59.7854956Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:44:59.7855189Z 2025-05-07T19:44:59.7855193Z 2025-05-07T19:44:59.7855397Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:44:59.7855657Z 2025-05-07T19:44:59.7855661Z 2025-05-07T19:44:59.7855665Z 2025-05-07T19:44:59.7855873Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:44:59.7856114Z 2025-05-07T19:44:59.7856118Z 2025-05-07T19:44:59.7856140Z 2025-05-07T19:44:59.7856148Z 2025-05-07T19:44:59.7860828Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:44:59.7861064Z 2025-05-07T19:44:59.7861068Z 2025-05-07T19:44:59.7861072Z 2025-05-07T19:44:59.7861075Z 2025-05-07T19:44:59.7861079Z 2025-05-07T19:44:59.7861329Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:44:59.7861590Z 2025-05-07T19:44:59.7861594Z 2025-05-07T19:44:59.7861597Z 2025-05-07T19:44:59.7861601Z 2025-05-07T19:44:59.7861604Z 2025-05-07T19:44:59.7861608Z 2025-05-07T19:44:59.7861884Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:44:59.7862151Z 2025-05-07T19:44:59.7862155Z 2025-05-07T19:44:59.7862158Z 2025-05-07T19:44:59.7862162Z 2025-05-07T19:44:59.7862166Z 2025-05-07T19:44:59.7862169Z 2025-05-07T19:44:59.7862173Z 2025-05-07T19:44:59.7862425Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:44:59.7862726Z 2025-05-07T19:44:59.7862730Z 2025-05-07T19:44:59.7862737Z 2025-05-07T19:44:59.7862741Z 2025-05-07T19:44:59.7862744Z 2025-05-07T19:44:59.7862747Z 2025-05-07T19:44:59.7862751Z 2025-05-07T19:44:59.7862754Z 2025-05-07T19:44:59.7863183Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:44:59.7863475Z 2025-05-07T19:44:59.7863479Z 2025-05-07T19:44:59.7863482Z 2025-05-07T19:44:59.7863487Z 2025-05-07T19:44:59.7863490Z 2025-05-07T19:44:59.7863494Z 2025-05-07T19:44:59.7863497Z 2025-05-07T19:44:59.7863501Z 2025-05-07T19:44:59.7863505Z 2025-05-07T19:44:59.7863792Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:44:59.7864087Z 2025-05-07T19:44:59.7864103Z 2025-05-07T19:44:59.7864106Z 2025-05-07T19:44:59.7864110Z 2025-05-07T19:44:59.7864113Z 2025-05-07T19:44:59.7864117Z 2025-05-07T19:44:59.7864120Z 2025-05-07T19:44:59.7864128Z 2025-05-07T19:44:59.7864132Z 2025-05-07T19:44:59.7864136Z 2025-05-07T19:44:59.7865394Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:44:59.7865783Z 2025-05-07T19:44:59.7865788Z 2025-05-07T19:44:59.7865793Z 2025-05-07T19:44:59.7865797Z 2025-05-07T19:44:59.7865801Z 2025-05-07T19:44:59.7865804Z 2025-05-07T19:44:59.7865809Z 2025-05-07T19:44:59.7865867Z 2025-05-07T19:44:59.7865871Z 2025-05-07T19:44:59.7865875Z 2025-05-07T19:44:59.7865879Z 2025-05-07T19:44:59.7866133Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:44:59.7866403Z 2025-05-07T19:44:59.7866406Z 2025-05-07T19:44:59.7866410Z 2025-05-07T19:44:59.7866413Z 2025-05-07T19:44:59.7866418Z 2025-05-07T19:44:59.7866442Z 2025-05-07T19:44:59.7866445Z 2025-05-07T19:44:59.7866449Z 2025-05-07T19:44:59.7866452Z 2025-05-07T19:44:59.7866456Z 2025-05-07T19:44:59.7866459Z 2025-05-07T19:44:59.7866462Z 2025-05-07T19:44:59.7867181Z harfbuzz-11.0.0 | 1.6 MB | | 0%  2025-05-07T19:44:59.7867488Z 2025-05-07T19:44:59.7867584Z 2025-05-07T19:44:59.7867625Z 2025-05-07T19:44:59.7867629Z 2025-05-07T19:44:59.7867633Z 2025-05-07T19:44:59.7867636Z 2025-05-07T19:44:59.7867640Z 2025-05-07T19:44:59.7867643Z 2025-05-07T19:44:59.7867647Z 2025-05-07T19:44:59.7867650Z 2025-05-07T19:44:59.7867653Z 2025-05-07T19:44:59.7867657Z 2025-05-07T19:44:59.7867661Z 2025-05-07T19:44:59.7867965Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:44:59.7868318Z 2025-05-07T19:44:59.7868322Z 2025-05-07T19:44:59.7868326Z 2025-05-07T19:44:59.7868329Z 2025-05-07T19:44:59.7868333Z 2025-05-07T19:44:59.7868337Z 2025-05-07T19:44:59.7868340Z 2025-05-07T19:44:59.7868344Z 2025-05-07T19:44:59.7868347Z 2025-05-07T19:44:59.7868351Z 2025-05-07T19:44:59.7868355Z 2025-05-07T19:44:59.7868358Z 2025-05-07T19:44:59.7868362Z 2025-05-07T19:44:59.7868365Z 2025-05-07T19:44:59.7869451Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:44:59.7869824Z 2025-05-07T19:44:59.7869829Z 2025-05-07T19:44:59.7869832Z 2025-05-07T19:44:59.7869836Z 2025-05-07T19:44:59.7869840Z 2025-05-07T19:44:59.7869844Z 2025-05-07T19:44:59.7869847Z 2025-05-07T19:44:59.7869851Z 2025-05-07T19:44:59.7869874Z 2025-05-07T19:44:59.7869878Z 2025-05-07T19:44:59.7869882Z 2025-05-07T19:44:59.7869886Z 2025-05-07T19:44:59.7869890Z 2025-05-07T19:44:59.7869894Z 2025-05-07T19:44:59.7869919Z 2025-05-07T19:44:59.7870184Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:44:59.7870474Z 2025-05-07T19:44:59.7870496Z 2025-05-07T19:44:59.7870500Z 2025-05-07T19:44:59.7870504Z 2025-05-07T19:44:59.7870507Z 2025-05-07T19:44:59.7870511Z 2025-05-07T19:44:59.7870514Z 2025-05-07T19:44:59.7870518Z 2025-05-07T19:44:59.7870521Z 2025-05-07T19:44:59.7870525Z 2025-05-07T19:44:59.7870529Z 2025-05-07T19:44:59.7870538Z 2025-05-07T19:44:59.7870542Z 2025-05-07T19:44:59.7870545Z 2025-05-07T19:44:59.7870550Z 2025-05-07T19:44:59.7870558Z 2025-05-07T19:44:59.7871270Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:44:59.7871647Z 2025-05-07T19:44:59.7871651Z 2025-05-07T19:44:59.7871672Z 2025-05-07T19:44:59.7871675Z 2025-05-07T19:44:59.7871679Z 2025-05-07T19:44:59.7871683Z 2025-05-07T19:44:59.7871686Z 2025-05-07T19:44:59.7871690Z 2025-05-07T19:44:59.7871694Z 2025-05-07T19:44:59.7871697Z 2025-05-07T19:44:59.7871701Z 2025-05-07T19:44:59.7871704Z 2025-05-07T19:44:59.7871708Z 2025-05-07T19:44:59.7871711Z 2025-05-07T19:44:59.7871715Z 2025-05-07T19:44:59.7871719Z 2025-05-07T19:44:59.7871742Z 2025-05-07T19:44:59.7872280Z cairo-1.18.4 | 955 KB | | 0%  2025-05-07T19:44:59.7872585Z 2025-05-07T19:44:59.7872603Z 2025-05-07T19:44:59.7872608Z 2025-05-07T19:44:59.7872618Z 2025-05-07T19:44:59.7872622Z 2025-05-07T19:44:59.7872644Z 2025-05-07T19:44:59.7872649Z 2025-05-07T19:44:59.7872658Z 2025-05-07T19:44:59.7872661Z 2025-05-07T19:44:59.7872665Z 2025-05-07T19:44:59.7872668Z 2025-05-07T19:44:59.7872672Z 2025-05-07T19:44:59.7872675Z 2025-05-07T19:44:59.7872678Z 2025-05-07T19:44:59.7872682Z 2025-05-07T19:44:59.7872685Z 2025-05-07T19:44:59.7872689Z 2025-05-07T19:44:59.7872692Z 2025-05-07T19:44:59.7873365Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:44:59.7873692Z 2025-05-07T19:44:59.7873696Z 2025-05-07T19:44:59.7873699Z 2025-05-07T19:44:59.7873702Z 2025-05-07T19:44:59.7873720Z 2025-05-07T19:44:59.7873724Z 2025-05-07T19:44:59.7873727Z 2025-05-07T19:44:59.7873731Z 2025-05-07T19:44:59.7873734Z 2025-05-07T19:44:59.7873738Z 2025-05-07T19:44:59.7873741Z 2025-05-07T19:44:59.7873745Z 2025-05-07T19:44:59.7873748Z 2025-05-07T19:44:59.7873751Z 2025-05-07T19:44:59.7874562Z 2025-05-07T19:44:59.7874568Z 2025-05-07T19:44:59.7874571Z 2025-05-07T19:44:59.7874576Z 2025-05-07T19:44:59.7874599Z 2025-05-07T19:45:00.1793387Z ... (more hidden) ... 2025-05-07T19:45:00.1793719Z 2025-05-07T19:45:00.1793724Z 2025-05-07T19:45:00.1793728Z 2025-05-07T19:45:00.1904441Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:00.1905247Z 2025-05-07T19:45:00.1905261Z 2025-05-07T19:45:00.1905272Z 2025-05-07T19:45:00.1905283Z 2025-05-07T19:45:00.2289971Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:00.2290271Z 2025-05-07T19:45:00.2290429Z 2025-05-07T19:45:00.2365249Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:45:00.2393028Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:00.2393315Z 2025-05-07T19:45:00.2794115Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:00.2794954Z 2025-05-07T19:45:00.2794978Z 2025-05-07T19:45:00.2794983Z 2025-05-07T19:45:00.2903470Z cmake-4.0.2 | 19.4 MB | ####1 | 42%  2025-05-07T19:45:00.2903792Z 2025-05-07T19:45:00.2903807Z 2025-05-07T19:45:00.2903811Z 2025-05-07T19:45:00.2903815Z 2025-05-07T19:45:00.3313520Z icu-75.1 | 11.6 MB | ###9 | 40%  2025-05-07T19:45:00.3313801Z 2025-05-07T19:45:00.3313826Z 2025-05-07T19:45:00.3401529Z python-3.13.2 | 31.7 MB | 7 | 8%  2025-05-07T19:45:00.3444564Z openjdk-23.0.2 | 181.4 MB | 4 | 5% 2025-05-07T19:45:00.3444978Z 2025-05-07T19:45:00.3971936Z bazel-7.5.0 | 47.4 MB | ##4 | 24%  2025-05-07T19:45:00.3972212Z 2025-05-07T19:45:00.3972261Z 2025-05-07T19:45:00.3972265Z 2025-05-07T19:45:00.4313109Z 2025-05-07T19:45:00.4313570Z icu-75.1 | 11.6 MB | #######3 | 74%  2025-05-07T19:45:00.4313853Z 2025-05-07T19:45:00.4313858Z 2025-05-07T19:45:00.4398530Z python-3.13.2 | 31.7 MB | #6 | 16%  2025-05-07T19:45:00.4777815Z openjdk-23.0.2 | 181.4 MB | # | 10% 2025-05-07T19:45:00.4778626Z 2025-05-07T19:45:00.4778640Z 2025-05-07T19:45:00.4778652Z 2025-05-07T19:45:00.4779135Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:00.4779396Z 2025-05-07T19:45:00.4779400Z 2025-05-07T19:45:00.4779404Z 2025-05-07T19:45:00.4826253Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:00.4827029Z 2025-05-07T19:45:00.4827075Z 2025-05-07T19:45:00.4827087Z 2025-05-07T19:45:00.4827099Z 2025-05-07T19:45:00.4907619Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:00.4907898Z 2025-05-07T19:45:00.5198255Z bazel-7.5.0 | 47.4 MB | ###7 | 38%  2025-05-07T19:45:00.5198533Z 2025-05-07T19:45:00.5198538Z 2025-05-07T19:45:00.5198542Z 2025-05-07T19:45:00.5198546Z 2025-05-07T19:45:00.5198578Z 2025-05-07T19:45:00.5375448Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:00.5376293Z 2025-05-07T19:45:00.5376321Z 2025-05-07T19:45:00.5541945Z python-3.13.2 | 31.7 MB | ###7 | 37%  2025-05-07T19:45:00.5542780Z 2025-05-07T19:45:00.5542817Z 2025-05-07T19:45:00.5542828Z 2025-05-07T19:45:00.5542840Z 2025-05-07T19:45:00.5542851Z 2025-05-07T19:45:00.5542862Z 2025-05-07T19:45:00.5908248Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:00.5908577Z 2025-05-07T19:45:00.6063489Z bazel-7.5.0 | 47.4 MB | ####9 | 50%  2025-05-07T19:45:00.6200375Z openjdk-23.0.2 | 181.4 MB | #4 | 14% 2025-05-07T19:45:00.6200668Z 2025-05-07T19:45:00.6200673Z 2025-05-07T19:45:00.6200676Z 2025-05-07T19:45:00.6200680Z 2025-05-07T19:45:00.6200685Z 2025-05-07T19:45:00.6541889Z libgrpc-1.71.0 | 7.6 MB | ######9 | 70%  2025-05-07T19:45:00.6542729Z 2025-05-07T19:45:00.6542744Z 2025-05-07T19:45:00.6542756Z 2025-05-07T19:45:00.6543077Z 2025-05-07T19:45:00.6543090Z 2025-05-07T19:45:00.6543102Z 2025-05-07T19:45:00.6909421Z openblas-0.3.29 | 5.8 MB | ########3 | 84%  2025-05-07T19:45:00.6909955Z 2025-05-07T19:45:00.7095767Z bazel-7.5.0 | 47.4 MB | ######3 | 63%  2025-05-07T19:45:00.7396373Z openjdk-23.0.2 | 181.4 MB | #7 | 18% 2025-05-07T19:45:00.7396659Z 2025-05-07T19:45:00.7396664Z 2025-05-07T19:45:00.7437291Z python-3.13.2 | 31.7 MB | ####9 | 49%  2025-05-07T19:45:00.7437577Z 2025-05-07T19:45:00.7437582Z 2025-05-07T19:45:00.7437587Z 2025-05-07T19:45:00.7437590Z 2025-05-07T19:45:00.7437594Z 2025-05-07T19:45:00.7437598Z 2025-05-07T19:45:00.7602684Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:00.7603000Z 2025-05-07T19:45:00.7603005Z 2025-05-07T19:45:00.7603009Z 2025-05-07T19:45:00.7603013Z 2025-05-07T19:45:00.7603016Z 2025-05-07T19:45:00.7909908Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:00.7910204Z 2025-05-07T19:45:00.8098926Z bazel-7.5.0 | 47.4 MB | ########2 | 83%  2025-05-07T19:45:00.8099230Z 2025-05-07T19:45:00.8099235Z 2025-05-07T19:45:00.8099239Z 2025-05-07T19:45:00.8099242Z 2025-05-07T19:45:00.8099246Z 2025-05-07T19:45:00.8099250Z 2025-05-07T19:45:00.8099254Z 2025-05-07T19:45:00.8175748Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:00.8176112Z 2025-05-07T19:45:00.8176118Z 2025-05-07T19:45:00.8176123Z 2025-05-07T19:45:00.8176126Z 2025-05-07T19:45:00.8176130Z 2025-05-07T19:45:00.8176133Z 2025-05-07T19:45:00.8176138Z 2025-05-07T19:45:00.8176142Z 2025-05-07T19:45:00.8402453Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:00.8403313Z 2025-05-07T19:45:00.8778606Z 2025-05-07T19:45:00.8779475Z python-3.13.2 | 31.7 MB | #####8 | 59%  2025-05-07T19:45:00.8911878Z openjdk-23.0.2 | 181.4 MB | ##1 | 21% 2025-05-07T19:45:00.8912195Z 2025-05-07T19:45:00.9402272Z bazel-7.5.0 | 47.4 MB | #########8 | 98%  2025-05-07T19:45:00.9402557Z 2025-05-07T19:45:00.9402562Z 2025-05-07T19:45:00.9639589Z python-3.13.2 | 31.7 MB | #######8 | 78%  2025-05-07T19:45:00.9639882Z 2025-05-07T19:45:00.9639887Z 2025-05-07T19:45:00.9639890Z 2025-05-07T19:45:00.9639894Z 2025-05-07T19:45:00.9639898Z 2025-05-07T19:45:00.9639901Z 2025-05-07T19:45:00.9639905Z 2025-05-07T19:45:00.9639910Z 2025-05-07T19:45:00.9640500Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:00.9640814Z 2025-05-07T19:45:00.9640819Z 2025-05-07T19:45:00.9640823Z 2025-05-07T19:45:00.9640827Z 2025-05-07T19:45:00.9640831Z 2025-05-07T19:45:00.9640834Z 2025-05-07T19:45:00.9640839Z 2025-05-07T19:45:00.9640853Z 2025-05-07T19:45:00.9778468Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:00.9933559Z openjdk-23.0.2 | 181.4 MB | ##4 | 25% 2025-05-07T19:45:00.9934310Z 2025-05-07T19:45:00.9934326Z 2025-05-07T19:45:00.9934338Z 2025-05-07T19:45:00.9934363Z 2025-05-07T19:45:00.9934374Z 2025-05-07T19:45:00.9934385Z 2025-05-07T19:45:00.9934395Z 2025-05-07T19:45:00.9935305Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:00.9936159Z 2025-05-07T19:45:00.9936171Z 2025-05-07T19:45:00.9936182Z 2025-05-07T19:45:00.9936192Z 2025-05-07T19:45:00.9936203Z 2025-05-07T19:45:00.9936214Z 2025-05-07T19:45:00.9936225Z 2025-05-07T19:45:01.0092909Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:01.0093237Z 2025-05-07T19:45:01.0093241Z 2025-05-07T19:45:01.0093246Z 2025-05-07T19:45:01.0093249Z 2025-05-07T19:45:01.0093253Z 2025-05-07T19:45:01.0093257Z 2025-05-07T19:45:01.0093260Z 2025-05-07T19:45:01.0093264Z 2025-05-07T19:45:01.0093267Z 2025-05-07T19:45:01.0409639Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:01.0409963Z 2025-05-07T19:45:01.0409968Z 2025-05-07T19:45:01.0514691Z python-3.13.2 | 31.7 MB | #########1 | 91%  2025-05-07T19:45:01.0515904Z 2025-05-07T19:45:01.0515919Z 2025-05-07T19:45:01.0515932Z 2025-05-07T19:45:01.0515943Z 2025-05-07T19:45:01.0515954Z 2025-05-07T19:45:01.0515964Z 2025-05-07T19:45:01.0515975Z 2025-05-07T19:45:01.0515985Z 2025-05-07T19:45:01.0515996Z 2025-05-07T19:45:01.0516006Z 2025-05-07T19:45:01.0851475Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:01.1419504Z openjdk-23.0.2 | 181.4 MB | ##7 | 28% 2025-05-07T19:45:01.1419783Z 2025-05-07T19:45:01.1419965Z 2025-05-07T19:45:01.1419974Z 2025-05-07T19:45:01.1419979Z 2025-05-07T19:45:01.1420011Z 2025-05-07T19:45:01.1420016Z 2025-05-07T19:45:01.1420022Z 2025-05-07T19:45:01.1420027Z 2025-05-07T19:45:01.1420031Z 2025-05-07T19:45:01.1420516Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:01.1420852Z 2025-05-07T19:45:01.1420870Z 2025-05-07T19:45:01.1420873Z 2025-05-07T19:45:01.1420877Z 2025-05-07T19:45:01.1420903Z 2025-05-07T19:45:01.1420907Z 2025-05-07T19:45:01.1420910Z 2025-05-07T19:45:01.1420913Z 2025-05-07T19:45:01.1420917Z 2025-05-07T19:45:01.1623224Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:01.1623556Z 2025-05-07T19:45:01.1623561Z 2025-05-07T19:45:01.1623564Z 2025-05-07T19:45:01.1623568Z 2025-05-07T19:45:01.1623572Z 2025-05-07T19:45:01.1623575Z 2025-05-07T19:45:01.1623579Z 2025-05-07T19:45:01.1623583Z 2025-05-07T19:45:01.1623586Z 2025-05-07T19:45:01.1623590Z 2025-05-07T19:45:01.1623881Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:01.1624197Z 2025-05-07T19:45:01.1624201Z 2025-05-07T19:45:01.1624204Z 2025-05-07T19:45:01.1624208Z 2025-05-07T19:45:01.1624219Z 2025-05-07T19:45:01.1624223Z 2025-05-07T19:45:01.1624227Z 2025-05-07T19:45:01.1624244Z 2025-05-07T19:45:01.1624247Z 2025-05-07T19:45:01.1624251Z 2025-05-07T19:45:01.1851721Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:01.1867530Z openjdk-23.0.2 | 181.4 MB | ###1 | 32% 2025-05-07T19:45:01.1867907Z 2025-05-07T19:45:01.1868100Z 2025-05-07T19:45:01.1868107Z 2025-05-07T19:45:01.1868112Z 2025-05-07T19:45:01.1868116Z 2025-05-07T19:45:01.1868121Z 2025-05-07T19:45:01.1868126Z 2025-05-07T19:45:01.1868130Z 2025-05-07T19:45:01.1868134Z 2025-05-07T19:45:01.1868139Z 2025-05-07T19:45:01.1868143Z 2025-05-07T19:45:01.2328068Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:01.2328949Z 2025-05-07T19:45:01.2328964Z 2025-05-07T19:45:01.2328975Z 2025-05-07T19:45:01.2328986Z 2025-05-07T19:45:01.2328997Z 2025-05-07T19:45:01.2329008Z 2025-05-07T19:45:01.2329019Z 2025-05-07T19:45:01.2329029Z 2025-05-07T19:45:01.2329040Z 2025-05-07T19:45:01.2329050Z 2025-05-07T19:45:01.2329088Z 2025-05-07T19:45:01.2329099Z 2025-05-07T19:45:01.2838771Z harfbuzz-11.0.0 | 1.6 MB | | 1%  2025-05-07T19:45:01.2839754Z 2025-05-07T19:45:01.2839768Z 2025-05-07T19:45:01.2839779Z 2025-05-07T19:45:01.2839789Z 2025-05-07T19:45:01.2839800Z 2025-05-07T19:45:01.2839810Z 2025-05-07T19:45:01.2839820Z 2025-05-07T19:45:01.2839830Z 2025-05-07T19:45:01.2839841Z 2025-05-07T19:45:01.2839852Z 2025-05-07T19:45:01.2839862Z 2025-05-07T19:45:01.2879698Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:01.2880533Z 2025-05-07T19:45:01.2880582Z 2025-05-07T19:45:01.2880594Z 2025-05-07T19:45:01.2880605Z 2025-05-07T19:45:01.2880615Z 2025-05-07T19:45:01.2880626Z 2025-05-07T19:45:01.2880637Z 2025-05-07T19:45:01.2880669Z 2025-05-07T19:45:01.2880680Z 2025-05-07T19:45:01.2880690Z 2025-05-07T19:45:01.2880701Z 2025-05-07T19:45:01.2880711Z 2025-05-07T19:45:01.3060824Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:01.3097037Z openjdk-23.0.2 | 181.4 MB | ###5 | 35% 2025-05-07T19:45:01.3097511Z 2025-05-07T19:45:01.3097563Z 2025-05-07T19:45:01.3097567Z 2025-05-07T19:45:01.3097624Z 2025-05-07T19:45:01.3189113Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:01.3189391Z 2025-05-07T19:45:01.3189546Z 2025-05-07T19:45:01.3189550Z 2025-05-07T19:45:01.3189553Z 2025-05-07T19:45:01.3189557Z 2025-05-07T19:45:01.3189571Z 2025-05-07T19:45:01.3189575Z 2025-05-07T19:45:01.3189655Z 2025-05-07T19:45:01.3189663Z 2025-05-07T19:45:01.3189679Z 2025-05-07T19:45:01.3189683Z 2025-05-07T19:45:01.3189687Z 2025-05-07T19:45:01.3189737Z 2025-05-07T19:45:01.3671084Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:01.3672302Z 2025-05-07T19:45:01.3672318Z 2025-05-07T19:45:01.3672329Z 2025-05-07T19:45:01.3672340Z 2025-05-07T19:45:01.3672386Z 2025-05-07T19:45:01.3672397Z 2025-05-07T19:45:01.3672408Z 2025-05-07T19:45:01.3672419Z 2025-05-07T19:45:01.3672429Z 2025-05-07T19:45:01.3672456Z 2025-05-07T19:45:01.3672467Z 2025-05-07T19:45:01.3672478Z 2025-05-07T19:45:01.3672511Z 2025-05-07T19:45:01.3934545Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:01.3934902Z 2025-05-07T19:45:01.3934906Z 2025-05-07T19:45:01.3934910Z 2025-05-07T19:45:01.3934914Z 2025-05-07T19:45:01.3934918Z 2025-05-07T19:45:01.3934921Z 2025-05-07T19:45:01.3934925Z 2025-05-07T19:45:01.3934941Z 2025-05-07T19:45:01.3934945Z 2025-05-07T19:45:01.3934948Z 2025-05-07T19:45:01.3934952Z 2025-05-07T19:45:01.3934955Z 2025-05-07T19:45:01.3934959Z 2025-05-07T19:45:01.3934962Z 2025-05-07T19:45:01.4062925Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:01.4268505Z openjdk-23.0.2 | 181.4 MB | ###9 | 39% 2025-05-07T19:45:01.4268942Z 2025-05-07T19:45:01.4268977Z 2025-05-07T19:45:01.4268983Z 2025-05-07T19:45:01.4268987Z 2025-05-07T19:45:01.4269007Z 2025-05-07T19:45:01.4269028Z 2025-05-07T19:45:01.4269032Z 2025-05-07T19:45:01.4269050Z 2025-05-07T19:45:01.4269053Z 2025-05-07T19:45:01.4269057Z 2025-05-07T19:45:01.4269060Z 2025-05-07T19:45:01.4269064Z 2025-05-07T19:45:01.4269068Z 2025-05-07T19:45:01.4269071Z 2025-05-07T19:45:01.4269075Z 2025-05-07T19:45:01.4519795Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:01.4520123Z 2025-05-07T19:45:01.4520128Z 2025-05-07T19:45:01.4520132Z 2025-05-07T19:45:01.4520135Z 2025-05-07T19:45:01.4520139Z 2025-05-07T19:45:01.4520142Z 2025-05-07T19:45:01.4520146Z 2025-05-07T19:45:01.4520150Z 2025-05-07T19:45:01.4520153Z 2025-05-07T19:45:01.4520157Z 2025-05-07T19:45:01.4520160Z 2025-05-07T19:45:01.4520176Z 2025-05-07T19:45:01.4520179Z 2025-05-07T19:45:01.4520183Z 2025-05-07T19:45:01.4715057Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:01.4715412Z 2025-05-07T19:45:01.4715422Z 2025-05-07T19:45:01.4715426Z 2025-05-07T19:45:01.4715430Z 2025-05-07T19:45:01.4715445Z 2025-05-07T19:45:01.4715449Z 2025-05-07T19:45:01.4715452Z 2025-05-07T19:45:01.4715456Z 2025-05-07T19:45:01.4715459Z 2025-05-07T19:45:01.4715463Z 2025-05-07T19:45:01.4715466Z 2025-05-07T19:45:01.4715470Z 2025-05-07T19:45:01.4715473Z 2025-05-07T19:45:01.4715477Z 2025-05-07T19:45:01.4715480Z 2025-05-07T19:45:01.4944028Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:01.4944524Z 2025-05-07T19:45:01.4944529Z 2025-05-07T19:45:01.4944533Z 2025-05-07T19:45:01.4944536Z 2025-05-07T19:45:01.4944540Z 2025-05-07T19:45:01.4944543Z 2025-05-07T19:45:01.5107592Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:01.5108473Z 2025-05-07T19:45:01.5108487Z 2025-05-07T19:45:01.5108901Z 2025-05-07T19:45:01.5108915Z 2025-05-07T19:45:01.5108926Z 2025-05-07T19:45:01.5108937Z 2025-05-07T19:45:01.5108947Z 2025-05-07T19:45:01.5109149Z 2025-05-07T19:45:01.5109160Z 2025-05-07T19:45:01.5109170Z 2025-05-07T19:45:01.5109181Z 2025-05-07T19:45:01.5109192Z 2025-05-07T19:45:01.5109202Z 2025-05-07T19:45:01.5109213Z 2025-05-07T19:45:01.5109223Z 2025-05-07T19:45:01.5109234Z 2025-05-07T19:45:01.5109244Z 2025-05-07T19:45:01.5277161Z cairo-1.18.4 | 955 KB | 1 | 2%  2025-05-07T19:45:01.5277499Z 2025-05-07T19:45:01.5277503Z 2025-05-07T19:45:01.5277507Z 2025-05-07T19:45:01.5277510Z 2025-05-07T19:45:01.5277514Z 2025-05-07T19:45:01.5277518Z 2025-05-07T19:45:01.5277522Z 2025-05-07T19:45:01.5277525Z 2025-05-07T19:45:01.5277529Z 2025-05-07T19:45:01.5277532Z 2025-05-07T19:45:01.5277536Z 2025-05-07T19:45:01.5277539Z 2025-05-07T19:45:01.5277555Z 2025-05-07T19:45:01.5277559Z 2025-05-07T19:45:01.5277562Z 2025-05-07T19:45:01.5277577Z 2025-05-07T19:45:01.5379019Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:01.5379745Z 2025-05-07T19:45:01.5379750Z 2025-05-07T19:45:01.5379753Z 2025-05-07T19:45:01.5379771Z 2025-05-07T19:45:01.5379774Z 2025-05-07T19:45:01.5379778Z 2025-05-07T19:45:01.5379781Z 2025-05-07T19:45:01.5379785Z 2025-05-07T19:45:01.5379788Z 2025-05-07T19:45:01.5379791Z 2025-05-07T19:45:01.5379795Z 2025-05-07T19:45:01.5379798Z 2025-05-07T19:45:01.5379802Z 2025-05-07T19:45:01.5379805Z 2025-05-07T19:45:01.5379809Z 2025-05-07T19:45:01.5379812Z 2025-05-07T19:45:01.5379815Z 2025-05-07T19:45:01.5712969Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:01.5713307Z 2025-05-07T19:45:01.5713313Z 2025-05-07T19:45:01.5713317Z 2025-05-07T19:45:01.5713320Z 2025-05-07T19:45:01.5713324Z 2025-05-07T19:45:01.5713328Z 2025-05-07T19:45:01.5713331Z 2025-05-07T19:45:01.5713347Z 2025-05-07T19:45:01.5713351Z 2025-05-07T19:45:01.5713354Z 2025-05-07T19:45:01.5713358Z 2025-05-07T19:45:01.5713361Z 2025-05-07T19:45:01.5713371Z 2025-05-07T19:45:01.5713374Z 2025-05-07T19:45:01.5713378Z 2025-05-07T19:45:01.5713382Z 2025-05-07T19:45:01.5743343Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:01.5743700Z 2025-05-07T19:45:01.5743704Z 2025-05-07T19:45:01.6026051Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:01.6026333Z 2025-05-07T19:45:01.6026491Z 2025-05-07T19:45:01.6026500Z 2025-05-07T19:45:01.6026504Z 2025-05-07T19:45:01.6026508Z 2025-05-07T19:45:01.6026512Z 2025-05-07T19:45:01.6026516Z 2025-05-07T19:45:01.6026560Z 2025-05-07T19:45:01.6026563Z 2025-05-07T19:45:01.6026567Z 2025-05-07T19:45:01.6026571Z 2025-05-07T19:45:01.6026574Z 2025-05-07T19:45:01.6026578Z 2025-05-07T19:45:01.6026581Z 2025-05-07T19:45:01.6026585Z 2025-05-07T19:45:01.6026588Z 2025-05-07T19:45:01.6026604Z 2025-05-07T19:45:01.6026608Z 2025-05-07T19:45:01.6180090Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:01.6290894Z openjdk-23.0.2 | 181.4 MB | ####2 | 43% 2025-05-07T19:45:01.6291652Z 2025-05-07T19:45:01.6336242Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:01.6337038Z 2025-05-07T19:45:01.6337052Z 2025-05-07T19:45:01.6337063Z 2025-05-07T19:45:01.6337074Z 2025-05-07T19:45:01.6337084Z 2025-05-07T19:45:01.6337095Z 2025-05-07T19:45:01.6337106Z 2025-05-07T19:45:01.6337117Z 2025-05-07T19:45:01.6337127Z 2025-05-07T19:45:01.6337138Z 2025-05-07T19:45:01.6337148Z 2025-05-07T19:45:01.6337159Z 2025-05-07T19:45:01.6337169Z 2025-05-07T19:45:01.6337180Z 2025-05-07T19:45:01.6337190Z 2025-05-07T19:45:01.6337218Z 2025-05-07T19:45:01.6337229Z 2025-05-07T19:45:01.6337239Z 2025-05-07T19:45:01.6337249Z 2025-05-07T19:45:01.6342660Z ... (more hidden) ... 2025-05-07T19:45:01.6342955Z 2025-05-07T19:45:01.6342959Z 2025-05-07T19:45:01.6342975Z 2025-05-07T19:45:01.6343065Z 2025-05-07T19:45:01.6343084Z 2025-05-07T19:45:01.6343087Z 2025-05-07T19:45:01.6343091Z 2025-05-07T19:45:01.6343094Z 2025-05-07T19:45:01.6343098Z 2025-05-07T19:45:01.6343102Z 2025-05-07T19:45:01.6343105Z 2025-05-07T19:45:01.6343109Z 2025-05-07T19:45:01.6343112Z 2025-05-07T19:45:01.6343116Z 2025-05-07T19:45:01.6343119Z 2025-05-07T19:45:01.6343122Z 2025-05-07T19:45:01.6343126Z 2025-05-07T19:45:01.6343596Z 2025-05-07T19:45:01.6553478Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:01.6553823Z 2025-05-07T19:45:01.6553828Z 2025-05-07T19:45:01.6553832Z 2025-05-07T19:45:01.6553835Z 2025-05-07T19:45:01.6553839Z 2025-05-07T19:45:01.6553842Z 2025-05-07T19:45:01.6553846Z 2025-05-07T19:45:01.6553849Z 2025-05-07T19:45:01.6554122Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:01.6554399Z 2025-05-07T19:45:01.6554402Z 2025-05-07T19:45:01.6554406Z 2025-05-07T19:45:01.6554417Z 2025-05-07T19:45:01.6554421Z 2025-05-07T19:45:01.6554425Z 2025-05-07T19:45:01.6554428Z 2025-05-07T19:45:01.6554432Z 2025-05-07T19:45:01.6554435Z 2025-05-07T19:45:01.6554439Z 2025-05-07T19:45:01.6554443Z 2025-05-07T19:45:01.6554446Z 2025-05-07T19:45:01.6554450Z 2025-05-07T19:45:01.6554467Z 2025-05-07T19:45:01.6554470Z 2025-05-07T19:45:01.6554473Z 2025-05-07T19:45:01.6554477Z 2025-05-07T19:45:01.6554480Z 2025-05-07T19:45:01.6554528Z 2025-05-07T19:45:01.8561714Z ... (more hidden) ... 2025-05-07T19:45:01.8562648Z 2025-05-07T19:45:01.8562663Z 2025-05-07T19:45:01.8562674Z 2025-05-07T19:45:01.8562685Z 2025-05-07T19:45:01.8562696Z 2025-05-07T19:45:01.8589499Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:01.8590396Z 2025-05-07T19:45:01.8590409Z 2025-05-07T19:45:01.8590453Z 2025-05-07T19:45:01.8590857Z 2025-05-07T19:45:01.8590874Z 2025-05-07T19:45:01.8590884Z 2025-05-07T19:45:01.8590911Z 2025-05-07T19:45:02.0317847Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:02.1317735Z openjdk-23.0.2 | 181.4 MB | ####5 | 46% 2025-05-07T19:45:02.1550475Z openjdk-23.0.2 | 181.4 MB | ####8 | 49% 2025-05-07T19:45:02.1550746Z 2025-05-07T19:45:02.1551046Z 2025-05-07T19:45:02.1551053Z 2025-05-07T19:45:02.1551058Z 2025-05-07T19:45:02.1551062Z 2025-05-07T19:45:02.1551066Z 2025-05-07T19:45:02.1551069Z 2025-05-07T19:45:02.1551073Z 2025-05-07T19:45:02.1551077Z 2025-05-07T19:45:02.2530230Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:02.2530529Z 2025-05-07T19:45:02.2530637Z 2025-05-07T19:45:02.2530842Z 2025-05-07T19:45:02.2530849Z 2025-05-07T19:45:02.2530854Z 2025-05-07T19:45:02.2530858Z 2025-05-07T19:45:02.2530862Z 2025-05-07T19:45:02.2530884Z 2025-05-07T19:45:02.2530889Z 2025-05-07T19:45:02.2530892Z 2025-05-07T19:45:02.3218189Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.3797177Z openjdk-23.0.2 | 181.4 MB | #####1 | 51% 2025-05-07T19:45:02.3797483Z 2025-05-07T19:45:02.3797488Z 2025-05-07T19:45:02.3797493Z 2025-05-07T19:45:02.3797499Z 2025-05-07T19:45:02.3797503Z 2025-05-07T19:45:02.3797508Z 2025-05-07T19:45:02.3797513Z 2025-05-07T19:45:02.3797518Z 2025-05-07T19:45:02.3797535Z 2025-05-07T19:45:02.3797540Z 2025-05-07T19:45:02.3797545Z 2025-05-07T19:45:02.3797549Z 2025-05-07T19:45:02.3799574Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:02.3799897Z 2025-05-07T19:45:02.3799902Z 2025-05-07T19:45:02.3799906Z 2025-05-07T19:45:02.3799909Z 2025-05-07T19:45:02.3799912Z 2025-05-07T19:45:02.3799917Z 2025-05-07T19:45:02.3799922Z 2025-05-07T19:45:02.3799925Z 2025-05-07T19:45:02.3799929Z 2025-05-07T19:45:02.3800651Z 2025-05-07T19:45:02.3800656Z 2025-05-07T19:45:02.3800670Z 2025-05-07T19:45:02.4253062Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:02.4253602Z 2025-05-07T19:45:02.4253607Z 2025-05-07T19:45:02.4253611Z 2025-05-07T19:45:02.4253615Z 2025-05-07T19:45:02.4253619Z 2025-05-07T19:45:02.4253622Z 2025-05-07T19:45:02.4253626Z 2025-05-07T19:45:02.4253630Z 2025-05-07T19:45:02.4253634Z 2025-05-07T19:45:02.4253637Z 2025-05-07T19:45:02.4253655Z 2025-05-07T19:45:02.4253659Z 2025-05-07T19:45:02.4253663Z 2025-05-07T19:45:02.4253967Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:02.4254284Z 2025-05-07T19:45:02.4254287Z 2025-05-07T19:45:02.4254291Z 2025-05-07T19:45:02.4254294Z 2025-05-07T19:45:02.4254298Z 2025-05-07T19:45:02.4254301Z 2025-05-07T19:45:02.4254305Z 2025-05-07T19:45:02.4254321Z 2025-05-07T19:45:02.4254325Z 2025-05-07T19:45:02.4254328Z 2025-05-07T19:45:02.4254338Z 2025-05-07T19:45:02.4254343Z 2025-05-07T19:45:02.4254346Z 2025-05-07T19:45:02.5074466Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:02.5074824Z 2025-05-07T19:45:02.5074857Z 2025-05-07T19:45:02.5074861Z 2025-05-07T19:45:02.5074865Z 2025-05-07T19:45:02.5074868Z 2025-05-07T19:45:02.5074872Z 2025-05-07T19:45:02.5074875Z 2025-05-07T19:45:02.5074879Z 2025-05-07T19:45:02.5074882Z 2025-05-07T19:45:02.5074886Z 2025-05-07T19:45:02.5074889Z 2025-05-07T19:45:02.5074893Z 2025-05-07T19:45:02.5074897Z 2025-05-07T19:45:02.5074900Z 2025-05-07T19:45:02.5075812Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:02.5076146Z 2025-05-07T19:45:02.5076150Z 2025-05-07T19:45:02.5076154Z 2025-05-07T19:45:02.5076169Z 2025-05-07T19:45:02.5076173Z 2025-05-07T19:45:02.5076177Z 2025-05-07T19:45:02.5076180Z 2025-05-07T19:45:02.5076184Z 2025-05-07T19:45:02.5076187Z 2025-05-07T19:45:02.5076199Z 2025-05-07T19:45:02.5076202Z 2025-05-07T19:45:02.5076206Z 2025-05-07T19:45:02.5076209Z 2025-05-07T19:45:02.5076217Z 2025-05-07T19:45:02.5535267Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:02.5535639Z 2025-05-07T19:45:02.5535644Z 2025-05-07T19:45:02.5535647Z 2025-05-07T19:45:02.5535651Z 2025-05-07T19:45:02.5535654Z 2025-05-07T19:45:02.5535658Z 2025-05-07T19:45:02.5535661Z 2025-05-07T19:45:02.5535665Z 2025-05-07T19:45:02.5535668Z 2025-05-07T19:45:02.5535672Z 2025-05-07T19:45:02.5535675Z 2025-05-07T19:45:02.5535928Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.5536201Z 2025-05-07T19:45:02.5536205Z 2025-05-07T19:45:02.5536209Z 2025-05-07T19:45:02.5536212Z 2025-05-07T19:45:02.5536216Z 2025-05-07T19:45:02.5536220Z 2025-05-07T19:45:02.5536223Z 2025-05-07T19:45:02.5536226Z 2025-05-07T19:45:02.5536230Z 2025-05-07T19:45:02.5536247Z 2025-05-07T19:45:02.5536262Z 2025-05-07T19:45:02.5624114Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.6553889Z openjdk-23.0.2 | 181.4 MB | #####3 | 53% 2025-05-07T19:45:02.6554638Z 2025-05-07T19:45:02.6554696Z 2025-05-07T19:45:02.6554709Z 2025-05-07T19:45:02.6554720Z 2025-05-07T19:45:02.6554731Z 2025-05-07T19:45:02.6554741Z 2025-05-07T19:45:02.6554751Z 2025-05-07T19:45:02.6554762Z 2025-05-07T19:45:02.6554773Z 2025-05-07T19:45:02.6554783Z 2025-05-07T19:45:02.6554794Z 2025-05-07T19:45:02.6554804Z 2025-05-07T19:45:02.6554830Z 2025-05-07T19:45:02.6554841Z 2025-05-07T19:45:02.6554851Z 2025-05-07T19:45:02.6554862Z 2025-05-07T19:45:02.6554872Z 2025-05-07T19:45:02.6555860Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:02.6556755Z 2025-05-07T19:45:02.6556766Z 2025-05-07T19:45:02.6556777Z 2025-05-07T19:45:02.6556787Z 2025-05-07T19:45:02.6556797Z 2025-05-07T19:45:02.6557213Z 2025-05-07T19:45:02.6557229Z 2025-05-07T19:45:02.6557239Z 2025-05-07T19:45:02.6557250Z 2025-05-07T19:45:02.6557446Z 2025-05-07T19:45:02.6557457Z 2025-05-07T19:45:02.6557467Z 2025-05-07T19:45:02.6557478Z 2025-05-07T19:45:02.6557488Z 2025-05-07T19:45:02.6557498Z 2025-05-07T19:45:02.6557508Z 2025-05-07T19:45:02.6557518Z 2025-05-07T19:45:02.6625326Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:02.6838987Z openjdk-23.0.2 | 181.4 MB | #####5 | 56% 2025-05-07T19:45:02.6839289Z 2025-05-07T19:45:02.6839294Z 2025-05-07T19:45:02.6839298Z 2025-05-07T19:45:02.6839301Z 2025-05-07T19:45:02.6839304Z 2025-05-07T19:45:02.6839323Z 2025-05-07T19:45:02.6839327Z 2025-05-07T19:45:02.6839330Z 2025-05-07T19:45:02.6839334Z 2025-05-07T19:45:02.6839337Z 2025-05-07T19:45:02.6839341Z 2025-05-07T19:45:02.6839344Z 2025-05-07T19:45:02.6839348Z 2025-05-07T19:45:02.6839351Z 2025-05-07T19:45:02.6839355Z 2025-05-07T19:45:02.6840592Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:02.6840903Z 2025-05-07T19:45:02.6840919Z 2025-05-07T19:45:02.6840922Z 2025-05-07T19:45:02.6840926Z 2025-05-07T19:45:02.6840929Z 2025-05-07T19:45:02.6840933Z 2025-05-07T19:45:02.6840936Z 2025-05-07T19:45:02.6840939Z 2025-05-07T19:45:02.6840943Z 2025-05-07T19:45:02.6840946Z 2025-05-07T19:45:02.6840950Z 2025-05-07T19:45:02.6840953Z 2025-05-07T19:45:02.6840956Z 2025-05-07T19:45:02.6840960Z 2025-05-07T19:45:02.6840963Z 2025-05-07T19:45:02.7626958Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:02.8627760Z openjdk-23.0.2 | 181.4 MB | #####8 | 58% 2025-05-07T19:45:02.9630181Z openjdk-23.0.2 | 181.4 MB | ######1 | 61% 2025-05-07T19:45:03.0630593Z openjdk-23.0.2 | 181.4 MB | ######4 | 65% 2025-05-07T19:45:03.1633533Z openjdk-23.0.2 | 181.4 MB | ######8 | 69% 2025-05-07T19:45:03.2634009Z openjdk-23.0.2 | 181.4 MB | #######2 | 73% 2025-05-07T19:45:03.3121696Z openjdk-23.0.2 | 181.4 MB | #######6 | 77% 2025-05-07T19:45:03.3121990Z 2025-05-07T19:45:03.3121995Z 2025-05-07T19:45:03.3121999Z 2025-05-07T19:45:03.3122003Z 2025-05-07T19:45:03.3122006Z 2025-05-07T19:45:03.3122010Z 2025-05-07T19:45:03.3122013Z 2025-05-07T19:45:03.3122017Z 2025-05-07T19:45:03.3122020Z 2025-05-07T19:45:03.3122024Z 2025-05-07T19:45:03.3122027Z 2025-05-07T19:45:03.3122031Z 2025-05-07T19:45:03.3122034Z 2025-05-07T19:45:03.3122038Z 2025-05-07T19:45:03.3122053Z 2025-05-07T19:45:03.3122057Z 2025-05-07T19:45:03.3123799Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.3124141Z 2025-05-07T19:45:03.3124145Z 2025-05-07T19:45:03.3124148Z 2025-05-07T19:45:03.3124163Z 2025-05-07T19:45:03.3124166Z 2025-05-07T19:45:03.3124183Z 2025-05-07T19:45:03.3124186Z 2025-05-07T19:45:03.3124190Z 2025-05-07T19:45:03.3124202Z 2025-05-07T19:45:03.3124205Z 2025-05-07T19:45:03.3124209Z 2025-05-07T19:45:03.3124212Z 2025-05-07T19:45:03.3124219Z 2025-05-07T19:45:03.3124223Z 2025-05-07T19:45:03.3124226Z 2025-05-07T19:45:03.3124230Z 2025-05-07T19:45:03.3635526Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.4802540Z openjdk-23.0.2 | 181.4 MB | ######## | 81% 2025-05-07T19:45:03.6208224Z openjdk-23.0.2 | 181.4 MB | ########4 | 85% 2025-05-07T19:45:03.7215120Z openjdk-23.0.2 | 181.4 MB | ########8 | 88% 2025-05-07T19:45:03.8297358Z openjdk-23.0.2 | 181.4 MB | #########1 | 91% 2025-05-07T19:45:03.9298580Z openjdk-23.0.2 | 181.4 MB | #########4 | 95% 2025-05-07T19:45:03.9861056Z openjdk-23.0.2 | 181.4 MB | #########9 | 100% 2025-05-07T19:45:03.9861451Z 2025-05-07T19:45:03.9861602Z 2025-05-07T19:45:03.9861610Z 2025-05-07T19:45:04.1677005Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:04.1677304Z 2025-05-07T19:45:04.1677309Z 2025-05-07T19:45:04.1677314Z 2025-05-07T19:45:04.1677463Z 2025-05-07T19:45:04.1677466Z 2025-05-07T19:45:04.1677470Z 2025-05-07T19:45:04.1677473Z 2025-05-07T19:45:04.1677477Z 2025-05-07T19:45:04.1677481Z 2025-05-07T19:45:04.1677484Z 2025-05-07T19:45:04.1677488Z 2025-05-07T19:45:04.1677492Z 2025-05-07T19:45:04.1677495Z 2025-05-07T19:45:04.1677513Z 2025-05-07T19:45:04.1677516Z 2025-05-07T19:45:04.1677520Z 2025-05-07T19:45:04.1677524Z 2025-05-07T19:45:04.1677527Z 2025-05-07T19:45:04.1679509Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:04.1679821Z 2025-05-07T19:45:04.1679824Z 2025-05-07T19:45:04.1679855Z 2025-05-07T19:45:04.1679859Z 2025-05-07T19:45:04.1679862Z 2025-05-07T19:45:04.1679866Z 2025-05-07T19:45:04.1679869Z 2025-05-07T19:45:04.1679873Z 2025-05-07T19:45:04.1679876Z 2025-05-07T19:45:04.1679880Z 2025-05-07T19:45:04.1679889Z 2025-05-07T19:45:04.1679892Z 2025-05-07T19:45:04.1679896Z 2025-05-07T19:45:04.1679899Z 2025-05-07T19:45:04.1679903Z 2025-05-07T19:45:04.1679910Z 2025-05-07T19:45:04.1679914Z 2025-05-07T19:45:04.1680008Z 2025-05-07T19:45:04.2076972Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:04.2077316Z 2025-05-07T19:45:04.2077321Z 2025-05-07T19:45:04.2077325Z 2025-05-07T19:45:04.2077329Z 2025-05-07T19:45:04.2077332Z 2025-05-07T19:45:04.2077336Z 2025-05-07T19:45:04.2077339Z 2025-05-07T19:45:04.2077343Z 2025-05-07T19:45:04.2077346Z 2025-05-07T19:45:04.2077350Z 2025-05-07T19:45:04.2077353Z 2025-05-07T19:45:04.2077357Z 2025-05-07T19:45:04.2077360Z 2025-05-07T19:45:04.2077364Z 2025-05-07T19:45:04.2077379Z 2025-05-07T19:45:04.2077383Z 2025-05-07T19:45:04.2077386Z 2025-05-07T19:45:04.2077390Z 2025-05-07T19:45:04.2077393Z 2025-05-07T19:45:04.2078437Z ... (more hidden) ... 2025-05-07T19:45:04.2078723Z 2025-05-07T19:45:04.2078750Z 2025-05-07T19:45:04.2078754Z 2025-05-07T19:45:04.2078757Z 2025-05-07T19:45:04.2078767Z 2025-05-07T19:45:04.2078770Z 2025-05-07T19:45:04.2078774Z 2025-05-07T19:45:04.2078777Z 2025-05-07T19:45:04.2078781Z 2025-05-07T19:45:04.2078784Z 2025-05-07T19:45:04.2078788Z 2025-05-07T19:45:04.2078791Z 2025-05-07T19:45:04.2078795Z 2025-05-07T19:45:04.2078798Z 2025-05-07T19:45:04.2078802Z 2025-05-07T19:45:04.2078805Z 2025-05-07T19:45:04.2078808Z 2025-05-07T19:45:04.2078812Z 2025-05-07T19:45:04.2078815Z 2025-05-07T19:45:04.6355317Z ... (more hidden) ... 2025-05-07T19:45:04.6355648Z 2025-05-07T19:45:04.8970882Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:04.8971160Z 2025-05-07T19:45:04.8971305Z 2025-05-07T19:45:05.7568469Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:06.9607307Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:06.9614813Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:06.9615591Z 2025-05-07T19:45:06.9615705Z 2025-05-07T19:45:06.9615718Z 2025-05-07T19:45:06.9615730Z 2025-05-07T19:45:06.9615740Z 2025-05-07T19:45:06.9615751Z 2025-05-07T19:45:06.9615761Z 2025-05-07T19:45:06.9615771Z 2025-05-07T19:45:06.9615782Z 2025-05-07T19:45:06.9615792Z 2025-05-07T19:45:06.9615826Z 2025-05-07T19:45:06.9615837Z 2025-05-07T19:45:06.9615847Z 2025-05-07T19:45:06.9615858Z 2025-05-07T19:45:06.9615868Z 2025-05-07T19:45:06.9615878Z 2025-05-07T19:45:06.9615889Z 2025-05-07T19:45:06.9615899Z 2025-05-07T19:45:06.9615909Z 2025-05-07T19:45:06.9616147Z 2025-05-07T19:45:06.9617234Z  2025-05-07T19:45:06.9618166Z 2025-05-07T19:45:06.9618780Z 2025-05-07T19:45:06.9619271Z  2025-05-07T19:45:06.9620422Z 2025-05-07T19:45:06.9620439Z 2025-05-07T19:45:06.9620978Z  2025-05-07T19:45:06.9621830Z 2025-05-07T19:45:06.9621843Z 2025-05-07T19:45:06.9621853Z 2025-05-07T19:45:06.9622344Z  2025-05-07T19:45:06.9622984Z 2025-05-07T19:45:06.9622995Z 2025-05-07T19:45:06.9623006Z 2025-05-07T19:45:06.9623016Z 2025-05-07T19:45:06.9623522Z  2025-05-07T19:45:06.9624091Z 2025-05-07T19:45:06.9624094Z 2025-05-07T19:45:06.9624098Z 2025-05-07T19:45:06.9624101Z 2025-05-07T19:45:06.9624105Z 2025-05-07T19:45:06.9624306Z  2025-05-07T19:45:06.9624529Z 2025-05-07T19:45:06.9624532Z 2025-05-07T19:45:06.9624536Z 2025-05-07T19:45:06.9624540Z 2025-05-07T19:45:06.9624543Z 2025-05-07T19:45:06.9624546Z 2025-05-07T19:45:06.9624756Z  2025-05-07T19:45:06.9624983Z 2025-05-07T19:45:06.9624986Z 2025-05-07T19:45:06.9624990Z 2025-05-07T19:45:06.9624999Z 2025-05-07T19:45:06.9625002Z 2025-05-07T19:45:06.9625006Z 2025-05-07T19:45:06.9625010Z 2025-05-07T19:45:06.9625357Z  2025-05-07T19:45:06.9625609Z 2025-05-07T19:45:06.9625613Z 2025-05-07T19:45:06.9625616Z 2025-05-07T19:45:06.9625620Z 2025-05-07T19:45:06.9625623Z 2025-05-07T19:45:06.9625627Z 2025-05-07T19:45:06.9625631Z 2025-05-07T19:45:06.9625635Z 2025-05-07T19:45:06.9625829Z  2025-05-07T19:45:06.9626083Z 2025-05-07T19:45:06.9626086Z 2025-05-07T19:45:06.9626089Z 2025-05-07T19:45:06.9626093Z 2025-05-07T19:45:06.9626096Z 2025-05-07T19:45:06.9626100Z 2025-05-07T19:45:06.9626104Z 2025-05-07T19:45:06.9626107Z 2025-05-07T19:45:06.9626111Z 2025-05-07T19:45:06.9626315Z  2025-05-07T19:45:06.9626569Z 2025-05-07T19:45:06.9626572Z 2025-05-07T19:45:06.9626580Z 2025-05-07T19:45:06.9626583Z 2025-05-07T19:45:06.9626587Z 2025-05-07T19:45:06.9626591Z 2025-05-07T19:45:06.9626595Z 2025-05-07T19:45:06.9626598Z 2025-05-07T19:45:06.9626602Z 2025-05-07T19:45:06.9626605Z 2025-05-07T19:45:06.9626814Z  2025-05-07T19:45:06.9627074Z 2025-05-07T19:45:06.9627077Z 2025-05-07T19:45:06.9627081Z 2025-05-07T19:45:06.9627085Z 2025-05-07T19:45:06.9627088Z 2025-05-07T19:45:06.9627092Z 2025-05-07T19:45:06.9627095Z 2025-05-07T19:45:06.9627098Z 2025-05-07T19:45:06.9627102Z 2025-05-07T19:45:06.9627106Z 2025-05-07T19:45:06.9627109Z 2025-05-07T19:45:06.9627313Z  2025-05-07T19:45:06.9627567Z 2025-05-07T19:45:06.9627571Z 2025-05-07T19:45:06.9627574Z 2025-05-07T19:45:06.9627582Z 2025-05-07T19:45:06.9627585Z 2025-05-07T19:45:06.9627589Z 2025-05-07T19:45:06.9627592Z 2025-05-07T19:45:06.9627596Z 2025-05-07T19:45:06.9627603Z 2025-05-07T19:45:06.9627606Z 2025-05-07T19:45:06.9627609Z 2025-05-07T19:45:06.9627613Z 2025-05-07T19:45:06.9627819Z  2025-05-07T19:45:06.9628074Z 2025-05-07T19:45:06.9628079Z 2025-05-07T19:45:06.9628082Z 2025-05-07T19:45:06.9628086Z 2025-05-07T19:45:06.9628089Z 2025-05-07T19:45:06.9628093Z 2025-05-07T19:45:06.9628096Z 2025-05-07T19:45:06.9628100Z 2025-05-07T19:45:06.9628103Z 2025-05-07T19:45:06.9628107Z 2025-05-07T19:45:06.9628110Z 2025-05-07T19:45:06.9628114Z 2025-05-07T19:45:06.9628117Z 2025-05-07T19:45:06.9628342Z  2025-05-07T19:45:06.9628586Z 2025-05-07T19:45:06.9628590Z 2025-05-07T19:45:06.9628594Z 2025-05-07T19:45:06.9628597Z 2025-05-07T19:45:06.9628675Z 2025-05-07T19:45:06.9628679Z 2025-05-07T19:45:06.9628683Z 2025-05-07T19:45:06.9628686Z 2025-05-07T19:45:06.9628690Z 2025-05-07T19:45:06.9628792Z 2025-05-07T19:45:06.9628796Z 2025-05-07T19:45:06.9628799Z 2025-05-07T19:45:06.9628803Z 2025-05-07T19:45:06.9628824Z 2025-05-07T19:45:06.9629046Z  2025-05-07T19:45:06.9629293Z 2025-05-07T19:45:06.9629297Z 2025-05-07T19:45:06.9629300Z 2025-05-07T19:45:06.9629304Z 2025-05-07T19:45:06.9629307Z 2025-05-07T19:45:06.9629311Z 2025-05-07T19:45:06.9629314Z 2025-05-07T19:45:06.9629318Z 2025-05-07T19:45:06.9629321Z 2025-05-07T19:45:06.9629354Z 2025-05-07T19:45:06.9629357Z 2025-05-07T19:45:06.9629361Z 2025-05-07T19:45:06.9629364Z 2025-05-07T19:45:06.9629368Z 2025-05-07T19:45:06.9629371Z 2025-05-07T19:45:06.9629596Z  2025-05-07T19:45:06.9629847Z 2025-05-07T19:45:06.9629855Z 2025-05-07T19:45:06.9629859Z 2025-05-07T19:45:06.9629880Z 2025-05-07T19:45:06.9629883Z 2025-05-07T19:45:06.9629887Z 2025-05-07T19:45:06.9629894Z 2025-05-07T19:45:06.9629898Z 2025-05-07T19:45:06.9629902Z 2025-05-07T19:45:06.9629905Z 2025-05-07T19:45:06.9629909Z 2025-05-07T19:45:06.9629912Z 2025-05-07T19:45:06.9629916Z 2025-05-07T19:45:06.9629919Z 2025-05-07T19:45:06.9629922Z 2025-05-07T19:45:06.9629926Z 2025-05-07T19:45:06.9630153Z  2025-05-07T19:45:06.9630426Z 2025-05-07T19:45:06.9630429Z 2025-05-07T19:45:06.9630433Z 2025-05-07T19:45:06.9630437Z 2025-05-07T19:45:06.9630440Z 2025-05-07T19:45:06.9630444Z 2025-05-07T19:45:06.9630447Z 2025-05-07T19:45:06.9630451Z 2025-05-07T19:45:06.9630454Z 2025-05-07T19:45:06.9630458Z 2025-05-07T19:45:06.9630461Z 2025-05-07T19:45:06.9630465Z 2025-05-07T19:45:06.9630469Z 2025-05-07T19:45:06.9630472Z 2025-05-07T19:45:06.9630476Z 2025-05-07T19:45:06.9630482Z 2025-05-07T19:45:06.9630486Z 2025-05-07T19:45:06.9630738Z  2025-05-07T19:45:06.9630996Z 2025-05-07T19:45:06.9630999Z 2025-05-07T19:45:06.9631003Z 2025-05-07T19:45:06.9631006Z 2025-05-07T19:45:06.9631010Z 2025-05-07T19:45:06.9631014Z 2025-05-07T19:45:06.9631018Z 2025-05-07T19:45:06.9631021Z 2025-05-07T19:45:06.9631024Z 2025-05-07T19:45:06.9631028Z 2025-05-07T19:45:06.9631032Z 2025-05-07T19:45:06.9631035Z 2025-05-07T19:45:06.9631039Z 2025-05-07T19:45:06.9631060Z 2025-05-07T19:45:06.9631064Z 2025-05-07T19:45:06.9631067Z 2025-05-07T19:45:06.9631071Z 2025-05-07T19:45:06.9631074Z 2025-05-07T19:45:06.9631455Z  2025-05-07T19:45:06.9631714Z 2025-05-07T19:45:06.9631718Z 2025-05-07T19:45:06.9631841Z  2025-05-07T19:45:06.9631951Z 2025-05-07T19:45:06.9631955Z 2025-05-07T19:45:06.9632062Z  2025-05-07T19:45:06.9632201Z 2025-05-07T19:45:06.9632205Z 2025-05-07T19:45:06.9632208Z 2025-05-07T19:45:06.9632315Z  2025-05-07T19:45:06.9632523Z 2025-05-07T19:45:06.9632526Z 2025-05-07T19:45:06.9632530Z 2025-05-07T19:45:06.9632535Z 2025-05-07T19:45:06.9632662Z  2025-05-07T19:45:06.9632786Z 2025-05-07T19:45:06.9632789Z 2025-05-07T19:45:06.9632793Z 2025-05-07T19:45:06.9632796Z 2025-05-07T19:45:06.9632801Z 2025-05-07T19:45:06.9632908Z  2025-05-07T19:45:06.9633053Z 2025-05-07T19:45:06.9633057Z 2025-05-07T19:45:06.9633060Z 2025-05-07T19:45:06.9633064Z 2025-05-07T19:45:06.9633067Z 2025-05-07T19:45:06.9633071Z 2025-05-07T19:45:06.9633182Z  2025-05-07T19:45:06.9633317Z 2025-05-07T19:45:06.9633321Z 2025-05-07T19:45:06.9633341Z 2025-05-07T19:45:06.9633344Z 2025-05-07T19:45:06.9633348Z 2025-05-07T19:45:06.9633351Z 2025-05-07T19:45:06.9633355Z 2025-05-07T19:45:06.9633470Z  2025-05-07T19:45:06.9633697Z 2025-05-07T19:45:06.9633701Z 2025-05-07T19:45:06.9633705Z 2025-05-07T19:45:06.9633708Z 2025-05-07T19:45:06.9633712Z 2025-05-07T19:45:06.9633777Z 2025-05-07T19:45:06.9633800Z 2025-05-07T19:45:06.9633803Z 2025-05-07T19:45:06.9633930Z  2025-05-07T19:45:06.9634093Z 2025-05-07T19:45:06.9634096Z 2025-05-07T19:45:06.9634100Z 2025-05-07T19:45:06.9634104Z 2025-05-07T19:45:06.9634107Z 2025-05-07T19:45:06.9634111Z 2025-05-07T19:45:06.9634115Z 2025-05-07T19:45:06.9634118Z 2025-05-07T19:45:06.9634122Z 2025-05-07T19:45:06.9634271Z  2025-05-07T19:45:06.9634439Z 2025-05-07T19:45:06.9634443Z 2025-05-07T19:45:06.9634446Z 2025-05-07T19:45:06.9634450Z 2025-05-07T19:45:06.9634453Z 2025-05-07T19:45:06.9634457Z 2025-05-07T19:45:06.9634461Z 2025-05-07T19:45:06.9634465Z 2025-05-07T19:45:06.9634468Z 2025-05-07T19:45:06.9634472Z 2025-05-07T19:45:06.9634632Z  2025-05-07T19:45:06.9634804Z 2025-05-07T19:45:06.9634808Z 2025-05-07T19:45:06.9634815Z 2025-05-07T19:45:06.9634819Z 2025-05-07T19:45:06.9634823Z 2025-05-07T19:45:06.9634826Z 2025-05-07T19:45:06.9634833Z 2025-05-07T19:45:06.9634837Z 2025-05-07T19:45:06.9634840Z 2025-05-07T19:45:06.9634843Z 2025-05-07T19:45:06.9634847Z 2025-05-07T19:45:06.9635006Z  2025-05-07T19:45:06.9635188Z 2025-05-07T19:45:06.9635191Z 2025-05-07T19:45:06.9635195Z 2025-05-07T19:45:06.9635198Z 2025-05-07T19:45:06.9635202Z 2025-05-07T19:45:06.9635206Z 2025-05-07T19:45:06.9635209Z 2025-05-07T19:45:06.9635213Z 2025-05-07T19:45:06.9635216Z 2025-05-07T19:45:06.9635220Z 2025-05-07T19:45:06.9635223Z 2025-05-07T19:45:06.9635227Z 2025-05-07T19:45:06.9635386Z  2025-05-07T19:45:06.9635576Z 2025-05-07T19:45:06.9635580Z 2025-05-07T19:45:06.9635584Z 2025-05-07T19:45:06.9635587Z 2025-05-07T19:45:06.9635591Z 2025-05-07T19:45:06.9635594Z 2025-05-07T19:45:06.9635598Z 2025-05-07T19:45:06.9635601Z 2025-05-07T19:45:06.9635605Z 2025-05-07T19:45:06.9635612Z 2025-05-07T19:45:06.9635616Z 2025-05-07T19:45:06.9635635Z 2025-05-07T19:45:06.9635639Z 2025-05-07T19:45:06.9635785Z  2025-05-07T19:45:06.9635982Z 2025-05-07T19:45:06.9635985Z 2025-05-07T19:45:06.9635989Z 2025-05-07T19:45:06.9635992Z 2025-05-07T19:45:06.9635996Z 2025-05-07T19:45:06.9635999Z 2025-05-07T19:45:06.9636003Z 2025-05-07T19:45:06.9636006Z 2025-05-07T19:45:06.9636029Z 2025-05-07T19:45:06.9636033Z 2025-05-07T19:45:06.9636036Z 2025-05-07T19:45:06.9636040Z 2025-05-07T19:45:06.9636043Z 2025-05-07T19:45:06.9636047Z 2025-05-07T19:45:06.9636204Z  2025-05-07T19:45:06.9636405Z 2025-05-07T19:45:06.9636409Z 2025-05-07T19:45:06.9636412Z 2025-05-07T19:45:06.9636416Z 2025-05-07T19:45:06.9636420Z 2025-05-07T19:45:06.9636441Z 2025-05-07T19:45:06.9636445Z 2025-05-07T19:45:06.9636448Z 2025-05-07T19:45:06.9636451Z 2025-05-07T19:45:06.9636455Z 2025-05-07T19:45:06.9636458Z 2025-05-07T19:45:06.9636465Z 2025-05-07T19:45:06.9636469Z 2025-05-07T19:45:06.9636472Z 2025-05-07T19:45:06.9636476Z 2025-05-07T19:45:06.9636628Z  2025-05-07T19:45:06.9636857Z 2025-05-07T19:45:06.9636861Z 2025-05-07T19:45:06.9636864Z 2025-05-07T19:45:06.9636868Z 2025-05-07T19:45:06.9636871Z 2025-05-07T19:45:06.9636874Z 2025-05-07T19:45:06.9636878Z 2025-05-07T19:45:06.9636881Z 2025-05-07T19:45:06.9636885Z 2025-05-07T19:45:06.9636889Z 2025-05-07T19:45:06.9636893Z 2025-05-07T19:45:06.9636897Z 2025-05-07T19:45:06.9636900Z 2025-05-07T19:45:06.9636903Z 2025-05-07T19:45:06.9636907Z 2025-05-07T19:45:06.9636910Z 2025-05-07T19:45:06.9637075Z  2025-05-07T19:45:06.9637343Z 2025-05-07T19:45:06.9637347Z 2025-05-07T19:45:06.9637350Z 2025-05-07T19:45:06.9637354Z 2025-05-07T19:45:06.9637357Z 2025-05-07T19:45:06.9637361Z 2025-05-07T19:45:06.9637364Z 2025-05-07T19:45:06.9637368Z 2025-05-07T19:45:06.9637371Z 2025-05-07T19:45:06.9637434Z 2025-05-07T19:45:06.9637438Z 2025-05-07T19:45:06.9637442Z 2025-05-07T19:45:06.9637445Z 2025-05-07T19:45:06.9637448Z 2025-05-07T19:45:06.9637520Z 2025-05-07T19:45:06.9637524Z 2025-05-07T19:45:06.9637701Z 2025-05-07T19:45:06.9637886Z  2025-05-07T19:45:06.9638107Z 2025-05-07T19:45:06.9638111Z 2025-05-07T19:45:06.9638114Z 2025-05-07T19:45:06.9638118Z 2025-05-07T19:45:06.9638122Z 2025-05-07T19:45:06.9638125Z 2025-05-07T19:45:06.9638129Z 2025-05-07T19:45:06.9638132Z 2025-05-07T19:45:06.9638136Z 2025-05-07T19:45:06.9638140Z 2025-05-07T19:45:06.9638162Z 2025-05-07T19:45:06.9638166Z 2025-05-07T19:45:06.9638169Z 2025-05-07T19:45:06.9638172Z 2025-05-07T19:45:06.9638176Z 2025-05-07T19:45:06.9638179Z 2025-05-07T19:45:06.9638183Z 2025-05-07T19:45:06.9638187Z 2025-05-07T19:45:06.9638359Z  2025-05-07T19:45:06.9638580Z 2025-05-07T19:45:06.9638584Z 2025-05-07T19:45:06.9638707Z  2025-05-07T19:45:06.9638823Z 2025-05-07T19:45:06.9638826Z 2025-05-07T19:45:06.9638926Z  2025-05-07T19:45:06.9639060Z 2025-05-07T19:45:06.9639067Z 2025-05-07T19:45:06.9639071Z 2025-05-07T19:45:06.9639173Z  2025-05-07T19:45:06.9639285Z 2025-05-07T19:45:06.9639288Z 2025-05-07T19:45:06.9639292Z 2025-05-07T19:45:06.9639296Z 2025-05-07T19:45:06.9639422Z  2025-05-07T19:45:06.9639544Z 2025-05-07T19:45:06.9639547Z 2025-05-07T19:45:06.9639551Z 2025-05-07T19:45:06.9639555Z 2025-05-07T19:45:06.9639558Z 2025-05-07T19:45:06.9639666Z  2025-05-07T19:45:06.9639812Z 2025-05-07T19:45:06.9639816Z 2025-05-07T19:45:06.9639820Z 2025-05-07T19:45:06.9639823Z 2025-05-07T19:45:06.9639827Z 2025-05-07T19:45:06.9639830Z 2025-05-07T19:45:06.9639941Z  2025-05-07T19:45:06.9640095Z 2025-05-07T19:45:06.9640098Z 2025-05-07T19:45:06.9640102Z 2025-05-07T19:45:06.9640105Z 2025-05-07T19:45:06.9640108Z 2025-05-07T19:45:06.9640113Z 2025-05-07T19:45:06.9640116Z 2025-05-07T19:45:06.9640232Z  2025-05-07T19:45:06.9640375Z 2025-05-07T19:45:06.9640378Z 2025-05-07T19:45:06.9640382Z 2025-05-07T19:45:06.9640389Z 2025-05-07T19:45:06.9640410Z 2025-05-07T19:45:06.9640413Z 2025-05-07T19:45:06.9640417Z 2025-05-07T19:45:06.9640420Z 2025-05-07T19:45:06.9640538Z  2025-05-07T19:45:06.9640691Z 2025-05-07T19:45:06.9640694Z 2025-05-07T19:45:06.9640698Z 2025-05-07T19:45:06.9640701Z 2025-05-07T19:45:06.9640705Z 2025-05-07T19:45:06.9640708Z 2025-05-07T19:45:06.9640712Z 2025-05-07T19:45:06.9640732Z 2025-05-07T19:45:06.9640736Z 2025-05-07T19:45:06.9640857Z  2025-05-07T19:45:06.9641018Z 2025-05-07T19:45:06.9641022Z 2025-05-07T19:45:06.9641025Z 2025-05-07T19:45:06.9641029Z 2025-05-07T19:45:06.9641032Z 2025-05-07T19:45:06.9641036Z 2025-05-07T19:45:06.9641039Z 2025-05-07T19:45:06.9641043Z 2025-05-07T19:45:06.9641046Z 2025-05-07T19:45:06.9641066Z 2025-05-07T19:45:06.9641197Z  2025-05-07T19:45:06.9641365Z 2025-05-07T19:45:06.9641369Z 2025-05-07T19:45:06.9641372Z 2025-05-07T19:45:06.9641376Z 2025-05-07T19:45:06.9641382Z 2025-05-07T19:45:06.9641386Z 2025-05-07T19:45:06.9641389Z 2025-05-07T19:45:06.9641392Z 2025-05-07T19:45:06.9641396Z 2025-05-07T19:45:06.9641399Z 2025-05-07T19:45:06.9641419Z 2025-05-07T19:45:06.9641548Z  2025-05-07T19:45:06.9641726Z 2025-05-07T19:45:06.9641729Z 2025-05-07T19:45:06.9641733Z 2025-05-07T19:45:06.9641736Z 2025-05-07T19:45:06.9641740Z 2025-05-07T19:45:06.9641743Z 2025-05-07T19:45:06.9641746Z 2025-05-07T19:45:06.9641750Z 2025-05-07T19:45:06.9641753Z 2025-05-07T19:45:06.9641757Z 2025-05-07T19:45:06.9641776Z 2025-05-07T19:45:06.9641780Z 2025-05-07T19:45:06.9641913Z  2025-05-07T19:45:06.9642098Z 2025-05-07T19:45:06.9642102Z 2025-05-07T19:45:06.9642106Z 2025-05-07T19:45:06.9642109Z 2025-05-07T19:45:06.9642113Z 2025-05-07T19:45:06.9642116Z 2025-05-07T19:45:06.9642181Z 2025-05-07T19:45:06.9642185Z 2025-05-07T19:45:06.9642188Z 2025-05-07T19:45:06.9642211Z 2025-05-07T19:45:06.9642215Z 2025-05-07T19:45:06.9642277Z 2025-05-07T19:45:06.9642281Z 2025-05-07T19:45:06.9642418Z  2025-05-07T19:45:06.9642613Z 2025-05-07T19:45:06.9642617Z 2025-05-07T19:45:06.9642620Z 2025-05-07T19:45:06.9642624Z 2025-05-07T19:45:06.9642627Z 2025-05-07T19:45:06.9642630Z 2025-05-07T19:45:06.9642651Z 2025-05-07T19:45:06.9642654Z 2025-05-07T19:45:06.9642658Z 2025-05-07T19:45:06.9642661Z 2025-05-07T19:45:06.9642665Z 2025-05-07T19:45:06.9642668Z 2025-05-07T19:45:06.9642672Z 2025-05-07T19:45:06.9642676Z 2025-05-07T19:45:06.9642819Z  2025-05-07T19:45:06.9643022Z 2025-05-07T19:45:06.9643025Z 2025-05-07T19:45:06.9643029Z 2025-05-07T19:45:06.9643049Z 2025-05-07T19:45:06.9643053Z 2025-05-07T19:45:06.9643056Z 2025-05-07T19:45:06.9643060Z 2025-05-07T19:45:06.9643063Z 2025-05-07T19:45:06.9643067Z 2025-05-07T19:45:06.9643074Z 2025-05-07T19:45:06.9643077Z 2025-05-07T19:45:06.9643081Z 2025-05-07T19:45:06.9643084Z 2025-05-07T19:45:06.9643091Z 2025-05-07T19:45:06.9643094Z 2025-05-07T19:45:06.9643243Z  2025-05-07T19:45:06.9643465Z 2025-05-07T19:45:06.9643469Z 2025-05-07T19:45:06.9643472Z 2025-05-07T19:45:06.9643476Z 2025-05-07T19:45:06.9643479Z 2025-05-07T19:45:06.9643483Z 2025-05-07T19:45:06.9643486Z 2025-05-07T19:45:06.9643490Z 2025-05-07T19:45:06.9643493Z 2025-05-07T19:45:06.9643497Z 2025-05-07T19:45:06.9643500Z 2025-05-07T19:45:06.9643504Z 2025-05-07T19:45:06.9643507Z 2025-05-07T19:45:06.9643511Z 2025-05-07T19:45:06.9643515Z 2025-05-07T19:45:06.9643518Z 2025-05-07T19:45:06.9643672Z  2025-05-07T19:45:06.9643901Z 2025-05-07T19:45:06.9643905Z 2025-05-07T19:45:06.9643908Z 2025-05-07T19:45:06.9643912Z 2025-05-07T19:45:06.9643916Z 2025-05-07T19:45:06.9643919Z 2025-05-07T19:45:06.9643923Z 2025-05-07T19:45:06.9643929Z 2025-05-07T19:45:06.9643933Z 2025-05-07T19:45:06.9643937Z 2025-05-07T19:45:06.9643941Z 2025-05-07T19:45:06.9643947Z 2025-05-07T19:45:06.9643951Z 2025-05-07T19:45:06.9643954Z 2025-05-07T19:45:06.9643958Z 2025-05-07T19:45:06.9643961Z 2025-05-07T19:45:06.9643965Z 2025-05-07T19:45:06.9644141Z  2025-05-07T19:45:06.9644356Z 2025-05-07T19:45:06.9644360Z 2025-05-07T19:45:06.9644364Z 2025-05-07T19:45:06.9644367Z 2025-05-07T19:45:06.9644371Z 2025-05-07T19:45:06.9644374Z 2025-05-07T19:45:06.9644378Z 2025-05-07T19:45:06.9644381Z 2025-05-07T19:45:06.9644385Z 2025-05-07T19:45:06.9644388Z 2025-05-07T19:45:06.9644403Z 2025-05-07T19:45:06.9644406Z 2025-05-07T19:45:06.9644410Z 2025-05-07T19:45:06.9644413Z 2025-05-07T19:45:06.9644417Z 2025-05-07T19:45:06.9644420Z 2025-05-07T19:45:06.9644424Z 2025-05-07T19:45:06.9644427Z 2025-05-07T19:45:06.9644612Z  2025-05-07T19:45:06.9644856Z 2025-05-07T19:45:06.9644859Z 2025-05-07T19:45:06.9644960Z  2025-05-07T19:45:06.9645068Z 2025-05-07T19:45:06.9645071Z 2025-05-07T19:45:06.9645196Z  2025-05-07T19:45:06.9645306Z 2025-05-07T19:45:06.9645310Z 2025-05-07T19:45:06.9645314Z 2025-05-07T19:45:06.9645422Z  2025-05-07T19:45:06.9645557Z 2025-05-07T19:45:06.9645560Z 2025-05-07T19:45:06.9645563Z 2025-05-07T19:45:06.9645567Z 2025-05-07T19:45:06.9645676Z  2025-05-07T19:45:06.9645799Z 2025-05-07T19:45:06.9645802Z 2025-05-07T19:45:06.9645806Z 2025-05-07T19:45:06.9645810Z 2025-05-07T19:45:06.9645833Z 2025-05-07T19:45:06.9645942Z  2025-05-07T19:45:06.9646070Z 2025-05-07T19:45:06.9646074Z 2025-05-07T19:45:06.9646078Z 2025-05-07T19:45:06.9646081Z 2025-05-07T19:45:06.9646084Z 2025-05-07T19:45:06.9646088Z 2025-05-07T19:45:06.9646226Z  2025-05-07T19:45:06.9646371Z 2025-05-07T19:45:06.9646375Z 2025-05-07T19:45:06.9646378Z 2025-05-07T19:45:06.9646382Z 2025-05-07T19:45:06.9646460Z 2025-05-07T19:45:06.9646465Z 2025-05-07T19:45:06.9646468Z 2025-05-07T19:45:06.9646593Z  2025-05-07T19:45:06.9646936Z 2025-05-07T19:45:06.9646939Z 2025-05-07T19:45:06.9646943Z 2025-05-07T19:45:06.9646946Z 2025-05-07T19:45:06.9646950Z 2025-05-07T19:45:06.9646953Z 2025-05-07T19:45:06.9646957Z 2025-05-07T19:45:06.9646960Z 2025-05-07T19:45:06.9647084Z  2025-05-07T19:45:06.9647270Z 2025-05-07T19:45:06.9647274Z 2025-05-07T19:45:06.9647277Z 2025-05-07T19:45:06.9647281Z 2025-05-07T19:45:06.9647284Z 2025-05-07T19:45:06.9647289Z 2025-05-07T19:45:06.9647293Z 2025-05-07T19:45:06.9647296Z 2025-05-07T19:45:06.9647299Z 2025-05-07T19:45:06.9647425Z  2025-05-07T19:45:06.9647604Z 2025-05-07T19:45:06.9647608Z 2025-05-07T19:45:06.9647611Z 2025-05-07T19:45:06.9647615Z 2025-05-07T19:45:06.9647618Z 2025-05-07T19:45:06.9647622Z 2025-05-07T19:45:06.9647625Z 2025-05-07T19:45:06.9647629Z 2025-05-07T19:45:06.9647632Z 2025-05-07T19:45:06.9647639Z 2025-05-07T19:45:06.9647765Z  2025-05-07T19:45:06.9647931Z 2025-05-07T19:45:06.9647954Z 2025-05-07T19:45:06.9647961Z 2025-05-07T19:45:06.9647964Z 2025-05-07T19:45:06.9647968Z 2025-05-07T19:45:06.9647971Z 2025-05-07T19:45:06.9647974Z 2025-05-07T19:45:06.9647978Z 2025-05-07T19:45:06.9647982Z 2025-05-07T19:45:06.9647986Z 2025-05-07T19:45:06.9647990Z 2025-05-07T19:45:06.9648121Z  2025-05-07T19:45:06.9648299Z 2025-05-07T19:45:06.9648329Z 2025-05-07T19:45:06.9648332Z 2025-05-07T19:45:06.9648335Z 2025-05-07T19:45:06.9648339Z 2025-05-07T19:45:06.9648342Z 2025-05-07T19:45:06.9648346Z 2025-05-07T19:45:06.9648349Z 2025-05-07T19:45:06.9648352Z 2025-05-07T19:45:06.9648356Z 2025-05-07T19:45:06.9648359Z 2025-05-07T19:45:06.9648363Z 2025-05-07T19:45:06.9648503Z  2025-05-07T19:45:06.9648723Z 2025-05-07T19:45:06.9648726Z 2025-05-07T19:45:06.9648729Z 2025-05-07T19:45:06.9648733Z 2025-05-07T19:45:06.9648740Z 2025-05-07T19:45:06.9648743Z 2025-05-07T19:45:06.9648747Z 2025-05-07T19:45:06.9648751Z 2025-05-07T19:45:06.9648754Z 2025-05-07T19:45:06.9648761Z 2025-05-07T19:45:06.9648765Z 2025-05-07T19:45:06.9648768Z 2025-05-07T19:45:06.9648772Z 2025-05-07T19:45:06.9648907Z  2025-05-07T19:45:06.9649123Z 2025-05-07T19:45:06.9649126Z 2025-05-07T19:45:06.9649130Z 2025-05-07T19:45:06.9649133Z 2025-05-07T19:45:06.9649137Z 2025-05-07T19:45:06.9649140Z 2025-05-07T19:45:06.9649143Z 2025-05-07T19:45:06.9649147Z 2025-05-07T19:45:06.9649150Z 2025-05-07T19:45:06.9649154Z 2025-05-07T19:45:06.9649157Z 2025-05-07T19:45:06.9649161Z 2025-05-07T19:45:06.9649164Z 2025-05-07T19:45:06.9649167Z 2025-05-07T19:45:06.9649311Z  2025-05-07T19:45:06.9649532Z 2025-05-07T19:45:06.9649536Z 2025-05-07T19:45:06.9649539Z 2025-05-07T19:45:06.9649542Z 2025-05-07T19:45:06.9649546Z 2025-05-07T19:45:06.9649549Z 2025-05-07T19:45:06.9649555Z 2025-05-07T19:45:06.9649559Z 2025-05-07T19:45:06.9649562Z 2025-05-07T19:45:06.9649566Z 2025-05-07T19:45:06.9649569Z 2025-05-07T19:45:06.9649575Z 2025-05-07T19:45:06.9649578Z 2025-05-07T19:45:06.9649582Z 2025-05-07T19:45:06.9649585Z 2025-05-07T19:45:06.9649752Z  2025-05-07T19:45:06.9649957Z 2025-05-07T19:45:06.9649961Z 2025-05-07T19:45:06.9649964Z 2025-05-07T19:45:06.9649968Z 2025-05-07T19:45:06.9649971Z 2025-05-07T19:45:06.9649974Z 2025-05-07T19:45:06.9649978Z 2025-05-07T19:45:06.9649981Z 2025-05-07T19:45:06.9649984Z 2025-05-07T19:45:06.9649988Z 2025-05-07T19:45:06.9649992Z 2025-05-07T19:45:06.9649996Z 2025-05-07T19:45:06.9649999Z 2025-05-07T19:45:06.9650020Z 2025-05-07T19:45:06.9650023Z 2025-05-07T19:45:06.9650027Z 2025-05-07T19:45:06.9650183Z  2025-05-07T19:45:06.9650391Z 2025-05-07T19:45:06.9650394Z 2025-05-07T19:45:06.9650398Z 2025-05-07T19:45:06.9650401Z 2025-05-07T19:45:06.9650465Z 2025-05-07T19:45:06.9650470Z 2025-05-07T19:45:06.9650473Z 2025-05-07T19:45:06.9650477Z 2025-05-07T19:45:06.9650503Z 2025-05-07T19:45:06.9650563Z 2025-05-07T19:45:06.9650567Z 2025-05-07T19:45:06.9650571Z 2025-05-07T19:45:06.9650574Z 2025-05-07T19:45:06.9650578Z 2025-05-07T19:45:06.9650581Z 2025-05-07T19:45:06.9650585Z 2025-05-07T19:45:06.9650588Z 2025-05-07T19:45:06.9650743Z  2025-05-07T19:45:06.9650957Z 2025-05-07T19:45:06.9650961Z 2025-05-07T19:45:06.9650984Z 2025-05-07T19:45:06.9650988Z 2025-05-07T19:45:06.9650991Z 2025-05-07T19:45:06.9650994Z 2025-05-07T19:45:06.9650997Z 2025-05-07T19:45:06.9651001Z 2025-05-07T19:45:06.9651004Z 2025-05-07T19:45:06.9651008Z 2025-05-07T19:45:06.9651011Z 2025-05-07T19:45:06.9651015Z 2025-05-07T19:45:06.9651018Z 2025-05-07T19:45:06.9651022Z 2025-05-07T19:45:06.9651025Z 2025-05-07T19:45:06.9651028Z 2025-05-07T19:45:06.9651032Z 2025-05-07T19:45:06.9651035Z 2025-05-07T19:45:06.9651204Z  2025-05-07T19:45:06.9651439Z 2025-05-07T19:45:06.9651442Z 2025-05-07T19:45:06.9651542Z  2025-05-07T19:45:06.9651649Z 2025-05-07T19:45:06.9651653Z 2025-05-07T19:45:06.9651781Z  2025-05-07T19:45:06.9651891Z 2025-05-07T19:45:06.9651895Z 2025-05-07T19:45:06.9651898Z 2025-05-07T19:45:06.9652004Z  2025-05-07T19:45:06.9652133Z 2025-05-07T19:45:06.9652137Z 2025-05-07T19:45:06.9652140Z 2025-05-07T19:45:06.9652143Z 2025-05-07T19:45:06.9652245Z  2025-05-07T19:45:06.9652361Z 2025-05-07T19:45:06.9652365Z 2025-05-07T19:45:06.9652369Z 2025-05-07T19:45:06.9652372Z 2025-05-07T19:45:06.9652393Z 2025-05-07T19:45:06.9652496Z  2025-05-07T19:45:06.9652618Z 2025-05-07T19:45:06.9652621Z 2025-05-07T19:45:06.9652625Z 2025-05-07T19:45:06.9652628Z 2025-05-07T19:45:06.9652632Z 2025-05-07T19:45:06.9652635Z 2025-05-07T19:45:06.9652757Z  2025-05-07T19:45:06.9652882Z 2025-05-07T19:45:06.9652885Z 2025-05-07T19:45:06.9652892Z 2025-05-07T19:45:06.9652896Z 2025-05-07T19:45:06.9652899Z 2025-05-07T19:45:06.9652903Z 2025-05-07T19:45:06.9652906Z 2025-05-07T19:45:06.9653022Z  2025-05-07T19:45:06.9653174Z 2025-05-07T19:45:06.9653177Z 2025-05-07T19:45:06.9653181Z 2025-05-07T19:45:06.9653184Z 2025-05-07T19:45:06.9653188Z 2025-05-07T19:45:06.9653191Z 2025-05-07T19:45:06.9653195Z 2025-05-07T19:45:06.9653198Z 2025-05-07T19:45:06.9653316Z  2025-05-07T19:45:06.9653478Z 2025-05-07T19:45:06.9653481Z 2025-05-07T19:45:06.9653485Z 2025-05-07T19:45:06.9653488Z 2025-05-07T19:45:06.9653492Z 2025-05-07T19:45:06.9653496Z 2025-05-07T19:45:06.9653499Z 2025-05-07T19:45:06.9653503Z 2025-05-07T19:45:06.9653506Z 2025-05-07T19:45:06.9653625Z  2025-05-07T19:45:06.9653799Z 2025-05-07T19:45:06.9653802Z 2025-05-07T19:45:06.9653805Z 2025-05-07T19:45:06.9653808Z 2025-05-07T19:45:06.9653812Z 2025-05-07T19:45:06.9653815Z 2025-05-07T19:45:06.9653818Z 2025-05-07T19:45:06.9653825Z 2025-05-07T19:45:06.9653829Z 2025-05-07T19:45:06.9653833Z 2025-05-07T19:45:06.9653954Z  2025-05-07T19:45:06.9654122Z 2025-05-07T19:45:06.9654141Z 2025-05-07T19:45:06.9654145Z 2025-05-07T19:45:06.9654148Z 2025-05-07T19:45:06.9654151Z 2025-05-07T19:45:06.9654155Z 2025-05-07T19:45:06.9654158Z 2025-05-07T19:45:06.9654162Z 2025-05-07T19:45:06.9654165Z 2025-05-07T19:45:06.9654168Z 2025-05-07T19:45:06.9654172Z 2025-05-07T19:45:06.9654297Z  2025-05-07T19:45:06.9654471Z 2025-05-07T19:45:06.9654491Z 2025-05-07T19:45:06.9654495Z 2025-05-07T19:45:06.9654498Z 2025-05-07T19:45:06.9654502Z 2025-05-07T19:45:06.9654505Z 2025-05-07T19:45:06.9654509Z 2025-05-07T19:45:06.9654512Z 2025-05-07T19:45:06.9654515Z 2025-05-07T19:45:06.9654518Z 2025-05-07T19:45:06.9654522Z 2025-05-07T19:45:06.9654525Z 2025-05-07T19:45:06.9654654Z  2025-05-07T19:45:06.9654853Z 2025-05-07T19:45:06.9654856Z 2025-05-07T19:45:06.9654916Z 2025-05-07T19:45:06.9654920Z 2025-05-07T19:45:06.9654923Z 2025-05-07T19:45:06.9654927Z 2025-05-07T19:45:06.9655028Z 2025-05-07T19:45:06.9655031Z 2025-05-07T19:45:06.9655035Z 2025-05-07T19:45:06.9655038Z 2025-05-07T19:45:06.9655042Z 2025-05-07T19:45:06.9655045Z 2025-05-07T19:45:06.9655048Z 2025-05-07T19:45:06.9655186Z  2025-05-07T19:45:06.9655392Z 2025-05-07T19:45:06.9655396Z 2025-05-07T19:45:06.9655400Z 2025-05-07T19:45:06.9655403Z 2025-05-07T19:45:06.9655407Z 2025-05-07T19:45:06.9655410Z 2025-05-07T19:45:06.9655414Z 2025-05-07T19:45:06.9655417Z 2025-05-07T19:45:06.9655420Z 2025-05-07T19:45:06.9655424Z 2025-05-07T19:45:06.9655427Z 2025-05-07T19:45:06.9655431Z 2025-05-07T19:45:06.9655434Z 2025-05-07T19:45:06.9655437Z 2025-05-07T19:45:06.9655575Z  2025-05-07T19:45:06.9655786Z 2025-05-07T19:45:06.9655790Z 2025-05-07T19:45:06.9655794Z 2025-05-07T19:45:06.9655797Z 2025-05-07T19:45:06.9655803Z 2025-05-07T19:45:06.9655807Z 2025-05-07T19:45:06.9655810Z 2025-05-07T19:45:06.9655814Z 2025-05-07T19:45:06.9655817Z 2025-05-07T19:45:06.9655823Z 2025-05-07T19:45:06.9655827Z 2025-05-07T19:45:06.9655830Z 2025-05-07T19:45:06.9655833Z 2025-05-07T19:45:06.9655837Z 2025-05-07T19:45:06.9655840Z 2025-05-07T19:45:06.9656001Z  2025-05-07T19:45:06.9656200Z 2025-05-07T19:45:06.9656241Z 2025-05-07T19:45:06.9656245Z 2025-05-07T19:45:06.9656249Z 2025-05-07T19:45:06.9656252Z 2025-05-07T19:45:06.9656255Z 2025-05-07T19:45:06.9656276Z 2025-05-07T19:45:06.9656280Z 2025-05-07T19:45:06.9656283Z 2025-05-07T19:45:06.9656287Z 2025-05-07T19:45:06.9656290Z 2025-05-07T19:45:06.9656293Z 2025-05-07T19:45:06.9656296Z 2025-05-07T19:45:06.9656300Z 2025-05-07T19:45:06.9656303Z 2025-05-07T19:45:06.9656306Z 2025-05-07T19:45:06.9656460Z  2025-05-07T19:45:06.9656671Z 2025-05-07T19:45:06.9656692Z 2025-05-07T19:45:06.9656698Z 2025-05-07T19:45:06.9656701Z 2025-05-07T19:45:06.9656705Z 2025-05-07T19:45:06.9656708Z 2025-05-07T19:45:06.9656711Z 2025-05-07T19:45:06.9656717Z 2025-05-07T19:45:06.9656721Z 2025-05-07T19:45:06.9656724Z 2025-05-07T19:45:06.9656727Z 2025-05-07T19:45:06.9656731Z 2025-05-07T19:45:06.9656734Z 2025-05-07T19:45:06.9656737Z 2025-05-07T19:45:06.9656740Z 2025-05-07T19:45:06.9656744Z 2025-05-07T19:45:06.9656747Z 2025-05-07T19:45:06.9656905Z  2025-05-07T19:45:06.9657138Z 2025-05-07T19:45:06.9657142Z 2025-05-07T19:45:06.9657145Z 2025-05-07T19:45:06.9657148Z 2025-05-07T19:45:06.9657152Z 2025-05-07T19:45:06.9657155Z 2025-05-07T19:45:06.9657159Z 2025-05-07T19:45:06.9657162Z 2025-05-07T19:45:06.9657166Z 2025-05-07T19:45:06.9657169Z 2025-05-07T19:45:06.9657173Z 2025-05-07T19:45:06.9657176Z 2025-05-07T19:45:06.9657179Z 2025-05-07T19:45:06.9657183Z 2025-05-07T19:45:06.9657186Z 2025-05-07T19:45:06.9657190Z 2025-05-07T19:45:06.9657193Z 2025-05-07T19:45:06.9657200Z 2025-05-07T19:45:06.9657384Z  2025-05-07T19:45:06.9657610Z 2025-05-07T19:45:06.9657614Z 2025-05-07T19:45:06.9657717Z  2025-05-07T19:45:06.9657853Z 2025-05-07T19:45:06.9657856Z 2025-05-07T19:45:06.9657962Z  2025-05-07T19:45:06.9658069Z 2025-05-07T19:45:06.9658072Z 2025-05-07T19:45:06.9658076Z 2025-05-07T19:45:06.9658204Z  2025-05-07T19:45:06.9658324Z 2025-05-07T19:45:06.9658327Z 2025-05-07T19:45:06.9658331Z 2025-05-07T19:45:06.9658334Z 2025-05-07T19:45:06.9658446Z  2025-05-07T19:45:06.9658596Z 2025-05-07T19:45:06.9658600Z 2025-05-07T19:45:06.9658603Z 2025-05-07T19:45:06.9658606Z 2025-05-07T19:45:06.9658610Z 2025-05-07T19:45:06.9658727Z  done 2025-05-07T19:45:07.2806368Z Preparing transaction: \ | / done 2025-05-07T19:45:11.1184636Z Verifying transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:13.9386194Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:45:14.3459527Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:16.2155270Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:16.2155873Z 2025-05-07T19:45:16.2170795Z 2025-05-07T19:45:16.2202790Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:18.5175010Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:18.5179702Z Collecting build 2025-05-07T19:45:18.5180735Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:18.5183116Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build) (25.0) 2025-05-07T19:45:18.5184595Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:18.5185024Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:18.5185495Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:18.5185924Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:18.5186331Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:18.5186592Z 2025-05-07T19:45:18.5186776Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:18.5187054Z 2025-05-07T19:45:18.5187077Z 2025-05-07T19:45:20.3737525Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:20.3738369Z 2025-05-07T19:45:20.4454774Z [CHECK] Binary make found in PATH 2025-05-07T19:45:22.2400737Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:22.2401066Z 2025-05-07T19:45:22.3170641Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:24.0847606Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:24.0847924Z 2025-05-07T19:45:24.1414981Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:26.0472714Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:28.0355234Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:29.9374054Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:31.9624522Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:33.8820158Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:33.8825769Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:33.8916118Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:33.8916569Z . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:33.8917175Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:33.8917509Z env: 2025-05-07T19:45:33.8917735Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:33.8918067Z BUILD_ENV: build_binary 2025-05-07T19:45:33.8918312Z BUILD_TARGET: default 2025-05-07T19:45:33.8918563Z BUILD_VARIANT: cuda 2025-05-07T19:45:33.8918819Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:45:33.8919065Z ##[endgroup] 2025-05-07T19:45:34.3689088Z ################################################################################ 2025-05-07T19:45:34.3690090Z # Install CUDA 2025-05-07T19:45:34.3691127Z # 2025-05-07T19:45:34.3704076Z # [2025-05-07T19:45:34.369Z] + install_cuda build_binary 11.8.0 2025-05-07T19:45:34.3705267Z ################################################################################ 2025-05-07T19:45:34.3706028Z 2025-05-07T19:45:34.3730046Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:34.4585101Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:34.4586139Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:34.4591004Z + conda clean --packages --tarball -y 2025-05-07T19:45:34.4592194Z 2025-05-07T19:45:34.9274346Z Will remove 130 (465.2 MB) tarball(s). 2025-05-07T19:45:34.9274697Z Will remove 14 (1.7 MB) package(s). 2025-05-07T19:45:34.9825990Z 2025-05-07T19:45:34.9829997Z + conda clean --all -y 2025-05-07T19:45:34.9830449Z 2025-05-07T19:45:35.5954839Z There are no unused tarball(s) to remove. 2025-05-07T19:45:35.5955211Z Will remove 1 index cache(s). 2025-05-07T19:45:35.5955539Z There are no unused package(s) to remove. 2025-05-07T19:45:35.5955861Z There are no tempfile(s) to remove. 2025-05-07T19:45:35.5956175Z There are no logfile(s) to remove. 2025-05-07T19:45:35.6533476Z 2025-05-07T19:45:35.6543279Z [INSTALL] Installing CUDA 11.8.0 ... 2025-05-07T19:45:35.6570660Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c nvidia/label/cuda-11.8.0 -y cuda 2025-05-07T19:45:36.6730740Z Channels: 2025-05-07T19:45:36.6731512Z - nvidia/label/cuda-11.8.0 2025-05-07T19:45:36.6732274Z - defaults 2025-05-07T19:45:36.6732867Z Platform: linux-64 2025-05-07T19:45:37.8067617Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:45:38.0226100Z Solving environment: / - done 2025-05-07T19:45:38.1338121Z 2025-05-07T19:45:38.1338746Z ## Package Plan ## 2025-05-07T19:45:38.1339204Z 2025-05-07T19:45:38.1339806Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:38.1340120Z 2025-05-07T19:45:38.1340225Z added / updated specs: 2025-05-07T19:45:38.1340495Z - cuda 2025-05-07T19:45:38.1340731Z 2025-05-07T19:45:38.1340735Z 2025-05-07T19:45:38.1340856Z The following packages will be downloaded: 2025-05-07T19:45:38.1341099Z 2025-05-07T19:45:38.1341242Z package | build 2025-05-07T19:45:38.1341582Z ---------------------------|----------------- 2025-05-07T19:45:38.1341959Z cuda-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1342427Z cuda-cccl-11.8.89 | 0 1.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1342955Z cuda-command-line-tools-11.8.0| 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1343500Z cuda-compiler-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1343990Z cuda-cudart-11.8.89 | 0 197 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1344481Z cuda-cudart-dev-11.8.89 | 0 1.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1345179Z cuda-cuobjdump-11.8.86 | 0 229 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1346068Z cuda-cupti-11.8.87 | 0 25.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1346600Z cuda-cuxxfilt-11.8.86 | 0 291 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1347202Z cuda-demo-suite-11.8.86 | 0 5.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1347843Z cuda-documentation-11.8.86 | 0 89 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1348625Z cuda-driver-dev-11.8.89 | 0 16 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1349127Z cuda-gdb-11.8.86 | 0 4.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1349613Z cuda-libraries-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1350140Z cuda-libraries-dev-11.8.0 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1350654Z cuda-memcheck-11.8.86 | 0 168 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1351294Z cuda-nsight-11.8.86 | 0 113.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1351839Z cuda-nsight-compute-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1352339Z cuda-nvcc-11.8.89 | 0 50.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1353020Z cuda-nvdisasm-11.8.86 | 0 48.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1353506Z cuda-nvml-dev-11.8.86 | 0 83 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1354007Z cuda-nvprof-11.8.87 | 0 4.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1354489Z cuda-nvprune-11.8.86 | 0 65 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1354986Z cuda-nvrtc-11.8.89 | 0 19.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1355495Z cuda-nvrtc-dev-11.8.89 | 0 17.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1355972Z cuda-nvtx-11.8.86 | 0 57 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1356447Z cuda-nvvp-11.8.87 | 0 114.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1356942Z cuda-profiler-api-11.8.86 | 0 18 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1357476Z cuda-runtime-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1358122Z cuda-sanitizer-api-11.8.86 | 0 16.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1358592Z cuda-toolkit-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1359044Z cuda-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1359500Z cuda-visual-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1359977Z gds-tools-1.4.0.31 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1360428Z libcublas-11.11.3.6 | 0 364.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1360882Z libcublas-dev-11.11.3.6 | 0 394.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1361348Z libcufft-10.9.0.58 | 0 142.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1361797Z libcufft-dev-10.9.0.58 | 0 275.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1362264Z libcufile-1.4.0.31 | 0 548 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1362713Z libcufile-dev-1.4.0.31 | 0 1.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1363182Z libcurand-10.3.0.86 | 0 53.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1363755Z libcurand-dev-10.3.0.86 | 0 53.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1364221Z libcusolver-11.4.1.48 | 0 96.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1364707Z libcusolver-dev-11.4.1.48 | 0 66.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1365179Z libcusparse-11.7.5.86 | 0 176.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1365681Z libcusparse-dev-11.7.5.86 | 0 359.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1366155Z libnpp-11.8.0.86 | 0 147.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1366590Z libnpp-dev-11.8.0.86 | 0 144.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1367057Z libnvjpeg-11.9.0.86 | 0 2.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1367508Z libnvjpeg-dev-11.9.0.86 | 0 2.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1368012Z nsight-compute-2022.3.0.22 | 0 610.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:38.1368457Z ------------------------------------------------------------ 2025-05-07T19:45:38.1368786Z Total: 3.24 GB 2025-05-07T19:45:38.1368991Z 2025-05-07T19:45:38.1369202Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:38.1369421Z 2025-05-07T19:45:38.1369604Z cuda nvidia/label/cuda-11.8.0/linux-64::cuda-11.8.0-0 2025-05-07T19:45:38.1370067Z cuda-cccl nvidia/label/cuda-11.8.0/linux-64::cuda-cccl-11.8.89-0 2025-05-07T19:45:38.1370647Z cuda-command-line~ nvidia/label/cuda-11.8.0/linux-64::cuda-command-line-tools-11.8.0-0 2025-05-07T19:45:38.1371242Z cuda-compiler nvidia/label/cuda-11.8.0/linux-64::cuda-compiler-11.8.0-0 2025-05-07T19:45:38.1371786Z cuda-cudart nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-11.8.89-0 2025-05-07T19:45:38.1372331Z cuda-cudart-dev nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-dev-11.8.89-0 2025-05-07T19:45:38.1372923Z cuda-cuobjdump nvidia/label/cuda-11.8.0/linux-64::cuda-cuobjdump-11.8.86-0 2025-05-07T19:45:38.1373474Z cuda-cupti nvidia/label/cuda-11.8.0/linux-64::cuda-cupti-11.8.87-0 2025-05-07T19:45:38.1373998Z cuda-cuxxfilt nvidia/label/cuda-11.8.0/linux-64::cuda-cuxxfilt-11.8.86-0 2025-05-07T19:45:38.1374573Z cuda-demo-suite nvidia/label/cuda-11.8.0/linux-64::cuda-demo-suite-11.8.86-0 2025-05-07T19:45:38.1375157Z cuda-documentation nvidia/label/cuda-11.8.0/linux-64::cuda-documentation-11.8.86-0 2025-05-07T19:45:38.1375758Z cuda-driver-dev nvidia/label/cuda-11.8.0/linux-64::cuda-driver-dev-11.8.89-0 2025-05-07T19:45:38.1376279Z cuda-gdb nvidia/label/cuda-11.8.0/linux-64::cuda-gdb-11.8.86-0 2025-05-07T19:45:38.1376775Z cuda-libraries nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-11.8.0-0 2025-05-07T19:45:38.1377365Z cuda-libraries-dev nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-dev-11.8.0-0 2025-05-07T19:45:38.1377936Z cuda-memcheck nvidia/label/cuda-11.8.0/linux-64::cuda-memcheck-11.8.86-0 2025-05-07T19:45:38.1378470Z cuda-nsight nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-11.8.86-0 2025-05-07T19:45:38.1379046Z cuda-nsight-compu~ nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-compute-11.8.0-0 2025-05-07T19:45:38.1379590Z cuda-nvcc nvidia/label/cuda-11.8.0/linux-64::cuda-nvcc-11.8.89-0 2025-05-07T19:45:38.1380104Z cuda-nvdisasm nvidia/label/cuda-11.8.0/linux-64::cuda-nvdisasm-11.8.86-0 2025-05-07T19:45:38.1380632Z cuda-nvml-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvml-dev-11.8.86-0 2025-05-07T19:45:38.1381160Z cuda-nvprof nvidia/label/cuda-11.8.0/linux-64::cuda-nvprof-11.8.87-0 2025-05-07T19:45:38.1381692Z cuda-nvprune nvidia/label/cuda-11.8.0/linux-64::cuda-nvprune-11.8.86-0 2025-05-07T19:45:38.1382195Z cuda-nvrtc nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-11.8.89-0 2025-05-07T19:45:38.1384125Z cuda-nvrtc-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-dev-11.8.89-0 2025-05-07T19:45:38.1384646Z cuda-nvtx nvidia/label/cuda-11.8.0/linux-64::cuda-nvtx-11.8.86-0 2025-05-07T19:45:38.1385130Z cuda-nvvp nvidia/label/cuda-11.8.0/linux-64::cuda-nvvp-11.8.87-0 2025-05-07T19:45:38.1385686Z cuda-profiler-api nvidia/label/cuda-11.8.0/linux-64::cuda-profiler-api-11.8.86-0 2025-05-07T19:45:38.1386244Z cuda-runtime nvidia/label/cuda-11.8.0/linux-64::cuda-runtime-11.8.0-0 2025-05-07T19:45:38.1386823Z cuda-sanitizer-api nvidia/label/cuda-11.8.0/linux-64::cuda-sanitizer-api-11.8.86-0 2025-05-07T19:45:38.1387378Z cuda-toolkit nvidia/label/cuda-11.8.0/linux-64::cuda-toolkit-11.8.0-0 2025-05-07T19:45:38.1387884Z cuda-tools nvidia/label/cuda-11.8.0/linux-64::cuda-tools-11.8.0-0 2025-05-07T19:45:38.1388432Z cuda-visual-tools nvidia/label/cuda-11.8.0/linux-64::cuda-visual-tools-11.8.0-0 2025-05-07T19:45:38.1388976Z gds-tools nvidia/label/cuda-11.8.0/linux-64::gds-tools-1.4.0.31-0 2025-05-07T19:45:38.1389477Z libcublas nvidia/label/cuda-11.8.0/linux-64::libcublas-11.11.3.6-0 2025-05-07T19:45:38.1390000Z libcublas-dev nvidia/label/cuda-11.8.0/linux-64::libcublas-dev-11.11.3.6-0 2025-05-07T19:45:38.1390931Z libcufft nvidia/label/cuda-11.8.0/linux-64::libcufft-10.9.0.58-0 2025-05-07T19:45:38.1391840Z libcufft-dev nvidia/label/cuda-11.8.0/linux-64::libcufft-dev-10.9.0.58-0 2025-05-07T19:45:38.1392462Z libcufile nvidia/label/cuda-11.8.0/linux-64::libcufile-1.4.0.31-0 2025-05-07T19:45:38.1393045Z libcufile-dev nvidia/label/cuda-11.8.0/linux-64::libcufile-dev-1.4.0.31-0 2025-05-07T19:45:38.1393613Z libcurand nvidia/label/cuda-11.8.0/linux-64::libcurand-10.3.0.86-0 2025-05-07T19:45:38.1394190Z libcurand-dev nvidia/label/cuda-11.8.0/linux-64::libcurand-dev-10.3.0.86-0 2025-05-07T19:45:38.1394788Z libcusolver nvidia/label/cuda-11.8.0/linux-64::libcusolver-11.4.1.48-0 2025-05-07T19:45:38.1395388Z libcusolver-dev nvidia/label/cuda-11.8.0/linux-64::libcusolver-dev-11.4.1.48-0 2025-05-07T19:45:38.1395999Z libcusparse nvidia/label/cuda-11.8.0/linux-64::libcusparse-11.7.5.86-0 2025-05-07T19:45:38.1396611Z libcusparse-dev nvidia/label/cuda-11.8.0/linux-64::libcusparse-dev-11.7.5.86-0 2025-05-07T19:45:38.1397168Z libnpp nvidia/label/cuda-11.8.0/linux-64::libnpp-11.8.0.86-0 2025-05-07T19:45:38.1397696Z libnpp-dev nvidia/label/cuda-11.8.0/linux-64::libnpp-dev-11.8.0.86-0 2025-05-07T19:45:38.1398460Z libnvjpeg nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-11.9.0.86-0 2025-05-07T19:45:38.1399000Z libnvjpeg-dev nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-dev-11.9.0.86-0 2025-05-07T19:45:38.1399586Z nsight-compute nvidia/label/cuda-11.8.0/linux-64::nsight-compute-2022.3.0.22-0 2025-05-07T19:45:38.1399931Z 2025-05-07T19:45:38.1441243Z 2025-05-07T19:45:38.1441824Z 2025-05-07T19:45:38.1442436Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:38.1444138Z nsight-compute-2022. | 610.0 MB | | 0% 2025-05-07T19:45:38.1444456Z 2025-05-07T19:45:38.1453937Z libcublas-dev-11.11. | 394.1 MB | | 0%  2025-05-07T19:45:38.1454686Z 2025-05-07T19:45:38.1454690Z 2025-05-07T19:45:38.1470640Z libcublas-11.11.3.6 | 364.0 MB | | 0%  2025-05-07T19:45:38.1471756Z 2025-05-07T19:45:38.1471797Z 2025-05-07T19:45:38.1471809Z 2025-05-07T19:45:38.1480149Z libcusparse-dev-11.7 | 359.7 MB | | 0%  2025-05-07T19:45:38.1481002Z 2025-05-07T19:45:38.1481006Z 2025-05-07T19:45:38.1481010Z 2025-05-07T19:45:38.1481093Z 2025-05-07T19:45:38.1491502Z libcufft-dev-10.9.0. | 275.8 MB | | 0%  2025-05-07T19:45:38.1492350Z 2025-05-07T19:45:38.1492361Z 2025-05-07T19:45:38.1492373Z 2025-05-07T19:45:38.1492383Z 2025-05-07T19:45:38.1492393Z 2025-05-07T19:45:38.1493155Z libcusparse-11.7.5.8 | 176.3 MB | | 0%  2025-05-07T19:45:38.1494420Z 2025-05-07T19:45:38.1494434Z 2025-05-07T19:45:38.1494445Z 2025-05-07T19:45:38.1494455Z 2025-05-07T19:45:38.1494466Z 2025-05-07T19:45:38.1494477Z 2025-05-07T19:45:38.1495244Z libnpp-11.8.0.86 | 147.8 MB | | 0%  2025-05-07T19:45:38.1496037Z 2025-05-07T19:45:38.1496066Z 2025-05-07T19:45:38.1496076Z 2025-05-07T19:45:38.1496087Z 2025-05-07T19:45:38.1496097Z 2025-05-07T19:45:38.1496107Z 2025-05-07T19:45:38.1496118Z 2025-05-07T19:45:38.1496848Z libnpp-dev-11.8.0.86 | 144.5 MB | | 0%  2025-05-07T19:45:38.1497712Z 2025-05-07T19:45:38.1497722Z 2025-05-07T19:45:38.1497733Z 2025-05-07T19:45:38.1497744Z 2025-05-07T19:45:38.1497754Z 2025-05-07T19:45:38.1497764Z 2025-05-07T19:45:38.1497774Z 2025-05-07T19:45:38.1497785Z 2025-05-07T19:45:38.1498510Z libcufft-10.9.0.58 | 142.8 MB | | 0%  2025-05-07T19:45:38.1499368Z 2025-05-07T19:45:38.1499378Z 2025-05-07T19:45:38.1499403Z 2025-05-07T19:45:38.1499414Z 2025-05-07T19:45:38.1499425Z 2025-05-07T19:45:38.1499435Z 2025-05-07T19:45:38.1499445Z 2025-05-07T19:45:38.1499456Z 2025-05-07T19:45:38.1499491Z 2025-05-07T19:45:38.1500234Z cuda-nvvp-11.8.87 | 114.4 MB | | 0%  2025-05-07T19:45:38.1501341Z 2025-05-07T19:45:38.1501352Z 2025-05-07T19:45:38.1501362Z 2025-05-07T19:45:38.1501372Z 2025-05-07T19:45:38.1501383Z 2025-05-07T19:45:38.1501393Z 2025-05-07T19:45:38.1501404Z 2025-05-07T19:45:38.1501414Z 2025-05-07T19:45:38.1501424Z 2025-05-07T19:45:38.1501434Z 2025-05-07T19:45:38.1502221Z cuda-nsight-11.8.86 | 113.6 MB | | 0%  2025-05-07T19:45:38.1503304Z 2025-05-07T19:45:38.1503308Z 2025-05-07T19:45:38.1503311Z 2025-05-07T19:45:38.1503315Z 2025-05-07T19:45:38.1503318Z 2025-05-07T19:45:38.1503322Z 2025-05-07T19:45:38.1503325Z 2025-05-07T19:45:38.1503329Z 2025-05-07T19:45:38.1503332Z 2025-05-07T19:45:38.1503335Z 2025-05-07T19:45:38.1503343Z 2025-05-07T19:45:38.1504818Z libcusolver-11.4.1.4 | 96.5 MB | | 0%  2025-05-07T19:45:38.1505143Z 2025-05-07T19:45:38.1505147Z 2025-05-07T19:45:38.1505150Z 2025-05-07T19:45:38.1505154Z 2025-05-07T19:45:38.1505158Z 2025-05-07T19:45:38.1505161Z 2025-05-07T19:45:38.1505169Z 2025-05-07T19:45:38.1505182Z 2025-05-07T19:45:38.1505186Z 2025-05-07T19:45:38.1505190Z 2025-05-07T19:45:38.1505193Z 2025-05-07T19:45:38.1505196Z 2025-05-07T19:45:38.1505925Z libcusolver-dev-11.4 | 66.3 MB | | 0%  2025-05-07T19:45:38.1506243Z 2025-05-07T19:45:38.1506260Z 2025-05-07T19:45:38.1506264Z 2025-05-07T19:45:38.1506267Z 2025-05-07T19:45:38.1506271Z 2025-05-07T19:45:38.1506275Z 2025-05-07T19:45:38.1506278Z 2025-05-07T19:45:38.1506281Z 2025-05-07T19:45:38.1506285Z 2025-05-07T19:45:38.1506288Z 2025-05-07T19:45:38.1506292Z 2025-05-07T19:45:38.1506295Z 2025-05-07T19:45:38.1506315Z 2025-05-07T19:45:38.1506963Z libcurand-dev-10.3.0 | 53.7 MB | | 0%  2025-05-07T19:45:38.1507279Z 2025-05-07T19:45:38.1507283Z 2025-05-07T19:45:38.1507302Z 2025-05-07T19:45:38.1507306Z 2025-05-07T19:45:38.1507309Z 2025-05-07T19:45:38.1507313Z 2025-05-07T19:45:38.1507333Z 2025-05-07T19:45:38.1507341Z 2025-05-07T19:45:38.1507344Z 2025-05-07T19:45:38.1507348Z 2025-05-07T19:45:38.1507351Z 2025-05-07T19:45:38.1507355Z 2025-05-07T19:45:38.1507358Z 2025-05-07T19:45:38.1507362Z 2025-05-07T19:45:38.1507853Z libcurand-10.3.0.86 | 53.2 MB | | 0%  2025-05-07T19:45:38.1508163Z 2025-05-07T19:45:38.1508198Z 2025-05-07T19:45:38.1508201Z 2025-05-07T19:45:38.1508205Z 2025-05-07T19:45:38.1508209Z 2025-05-07T19:45:38.1508212Z 2025-05-07T19:45:38.1508215Z 2025-05-07T19:45:38.1508219Z 2025-05-07T19:45:38.1508222Z 2025-05-07T19:45:38.1508225Z 2025-05-07T19:45:38.1508229Z 2025-05-07T19:45:38.1508232Z 2025-05-07T19:45:38.1508307Z 2025-05-07T19:45:38.1508312Z 2025-05-07T19:45:38.1508316Z 2025-05-07T19:45:38.1508848Z cuda-nvcc-11.8.89 | 50.8 MB | | 0%  2025-05-07T19:45:38.1509172Z 2025-05-07T19:45:38.1509191Z 2025-05-07T19:45:38.1509195Z 2025-05-07T19:45:38.1509202Z 2025-05-07T19:45:38.1509206Z 2025-05-07T19:45:38.1509210Z 2025-05-07T19:45:38.1509213Z 2025-05-07T19:45:38.1509217Z 2025-05-07T19:45:38.1509221Z 2025-05-07T19:45:38.1509224Z 2025-05-07T19:45:38.1509228Z 2025-05-07T19:45:38.1509232Z 2025-05-07T19:45:38.1509235Z 2025-05-07T19:45:38.1509239Z 2025-05-07T19:45:38.1509242Z 2025-05-07T19:45:38.1509245Z 2025-05-07T19:45:38.1509834Z cuda-nvdisasm-11.8.8 | 48.7 MB | | 0%  2025-05-07T19:45:38.1510161Z 2025-05-07T19:45:38.1510165Z 2025-05-07T19:45:38.1510169Z 2025-05-07T19:45:38.1510172Z 2025-05-07T19:45:38.1510190Z 2025-05-07T19:45:38.1510193Z 2025-05-07T19:45:38.1510216Z 2025-05-07T19:45:38.1510224Z 2025-05-07T19:45:38.1510228Z 2025-05-07T19:45:38.1510231Z 2025-05-07T19:45:38.1510235Z 2025-05-07T19:45:38.1510238Z 2025-05-07T19:45:38.1510242Z 2025-05-07T19:45:38.1510245Z 2025-05-07T19:45:38.1510249Z 2025-05-07T19:45:38.1510252Z 2025-05-07T19:45:38.1510255Z 2025-05-07T19:45:38.1510898Z cuda-cupti-11.8.87 | 25.3 MB | | 0%  2025-05-07T19:45:38.1511324Z 2025-05-07T19:45:38.1511344Z 2025-05-07T19:45:38.1511347Z 2025-05-07T19:45:38.1511351Z 2025-05-07T19:45:38.1511354Z 2025-05-07T19:45:38.1511358Z 2025-05-07T19:45:38.1511361Z 2025-05-07T19:45:38.1511365Z 2025-05-07T19:45:38.1511368Z 2025-05-07T19:45:38.1511372Z 2025-05-07T19:45:38.1511376Z 2025-05-07T19:45:38.1511379Z 2025-05-07T19:45:38.1511384Z 2025-05-07T19:45:38.1511387Z 2025-05-07T19:45:38.1511390Z 2025-05-07T19:45:38.1511395Z 2025-05-07T19:45:38.1511398Z 2025-05-07T19:45:38.1511402Z 2025-05-07T19:45:38.1513075Z cuda-nvrtc-11.8.89 | 19.1 MB | | 0%  2025-05-07T19:45:38.1513454Z 2025-05-07T19:45:38.1513460Z 2025-05-07T19:45:38.1513464Z 2025-05-07T19:45:38.1513486Z 2025-05-07T19:45:38.1513490Z 2025-05-07T19:45:38.1513495Z 2025-05-07T19:45:38.1513499Z 2025-05-07T19:45:38.1513517Z 2025-05-07T19:45:38.1513521Z 2025-05-07T19:45:38.1513524Z 2025-05-07T19:45:38.1513529Z 2025-05-07T19:45:38.1513533Z 2025-05-07T19:45:38.1513536Z 2025-05-07T19:45:38.1513540Z 2025-05-07T19:45:38.1513544Z 2025-05-07T19:45:38.1513548Z 2025-05-07T19:45:38.1513552Z 2025-05-07T19:45:38.1513556Z 2025-05-07T19:45:38.1513560Z 2025-05-07T19:45:42.5026799Z ... (more hidden) ... 2025-05-07T19:45:42.5027187Z 2025-05-07T19:45:42.5027193Z 2025-05-07T19:45:42.5027197Z 2025-05-07T19:45:42.5027200Z 2025-05-07T19:45:42.5027511Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:45:42.5027797Z 2025-05-07T19:45:42.5027828Z 2025-05-07T19:45:42.5027847Z 2025-05-07T19:45:42.5027851Z 2025-05-07T19:45:42.7849817Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:45:42.7850165Z 2025-05-07T19:45:42.7850170Z 2025-05-07T19:45:42.7850434Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:45:42.7850736Z 2025-05-07T19:45:42.7850740Z 2025-05-07T19:45:44.6141035Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:45:44.6141375Z 2025-05-07T19:45:44.6141380Z 2025-05-07T19:45:44.6141385Z 2025-05-07T19:45:44.6141390Z 2025-05-07T19:45:44.6141409Z 2025-05-07T19:45:44.6141413Z 2025-05-07T19:45:44.6141700Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:45:44.6141978Z 2025-05-07T19:45:44.6141983Z 2025-05-07T19:45:44.6141987Z 2025-05-07T19:45:44.6141992Z 2025-05-07T19:45:44.6141996Z 2025-05-07T19:45:44.6142001Z 2025-05-07T19:45:44.8833792Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:45:44.8834125Z 2025-05-07T19:45:44.8834393Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:45:44.8834655Z 2025-05-07T19:45:45.9735605Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:45:45.9735943Z 2025-05-07T19:45:45.9735950Z 2025-05-07T19:45:45.9735983Z 2025-05-07T19:45:45.9735988Z 2025-05-07T19:45:45.9735992Z 2025-05-07T19:45:45.9736307Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:45:45.9736599Z 2025-05-07T19:45:45.9736603Z 2025-05-07T19:45:45.9736607Z 2025-05-07T19:45:45.9736610Z 2025-05-07T19:45:45.9736615Z 2025-05-07T19:45:46.6706459Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:45:46.6706836Z 2025-05-07T19:45:46.6706840Z 2025-05-07T19:45:46.6706845Z 2025-05-07T19:45:46.6706849Z 2025-05-07T19:45:46.6706853Z 2025-05-07T19:45:46.6706857Z 2025-05-07T19:45:46.6706861Z 2025-05-07T19:45:46.6707163Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:45:46.6707477Z 2025-05-07T19:45:46.6707481Z 2025-05-07T19:45:46.6707485Z 2025-05-07T19:45:46.6707490Z 2025-05-07T19:45:46.6707494Z 2025-05-07T19:45:46.6707497Z 2025-05-07T19:45:46.6707501Z 2025-05-07T19:45:48.1374530Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:45:48.1375215Z 2025-05-07T19:45:48.1375221Z 2025-05-07T19:45:48.1375225Z 2025-05-07T19:45:48.1375230Z 2025-05-07T19:45:48.1375234Z 2025-05-07T19:45:48.1375238Z 2025-05-07T19:45:48.1375242Z 2025-05-07T19:45:48.1375246Z 2025-05-07T19:45:48.1375250Z 2025-05-07T19:45:48.1375551Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:45:48.1375842Z 2025-05-07T19:45:48.1375846Z 2025-05-07T19:45:48.1375849Z 2025-05-07T19:45:48.1375853Z 2025-05-07T19:45:48.1375856Z 2025-05-07T19:45:48.1375860Z 2025-05-07T19:45:48.1375863Z 2025-05-07T19:45:48.1375868Z 2025-05-07T19:45:48.1375872Z 2025-05-07T19:45:48.1458714Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:45:48.1459043Z 2025-05-07T19:45:48.1459049Z 2025-05-07T19:45:48.1459053Z 2025-05-07T19:45:48.1459057Z 2025-05-07T19:45:48.1459061Z 2025-05-07T19:45:48.1459064Z 2025-05-07T19:45:48.1459068Z 2025-05-07T19:45:48.1459071Z 2025-05-07T19:45:48.1459360Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:45:48.1459653Z 2025-05-07T19:45:48.1459656Z 2025-05-07T19:45:48.1459660Z 2025-05-07T19:45:48.1459663Z 2025-05-07T19:45:48.1459667Z 2025-05-07T19:45:48.1459670Z 2025-05-07T19:45:48.1459674Z 2025-05-07T19:45:48.1459677Z 2025-05-07T19:45:48.3892705Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:45:48.3893044Z 2025-05-07T19:45:48.3893049Z 2025-05-07T19:45:48.3893053Z 2025-05-07T19:45:48.3893056Z 2025-05-07T19:45:48.3893060Z 2025-05-07T19:45:48.3893063Z 2025-05-07T19:45:48.3893067Z 2025-05-07T19:45:48.3893070Z 2025-05-07T19:45:48.3893094Z 2025-05-07T19:45:48.3893097Z 2025-05-07T19:45:48.3893403Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:45:48.3893708Z 2025-05-07T19:45:48.3893713Z 2025-05-07T19:45:48.3893718Z 2025-05-07T19:45:48.3893722Z 2025-05-07T19:45:48.3893726Z 2025-05-07T19:45:48.3893740Z 2025-05-07T19:45:48.3893744Z 2025-05-07T19:45:48.3893747Z 2025-05-07T19:45:48.3893750Z 2025-05-07T19:45:48.3893754Z 2025-05-07T19:45:48.3986324Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:45:48.3986664Z 2025-05-07T19:45:48.3986669Z 2025-05-07T19:45:48.3986672Z 2025-05-07T19:45:48.3986921Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:45:48.3987215Z 2025-05-07T19:45:48.3987219Z 2025-05-07T19:45:48.3987222Z 2025-05-07T19:45:49.1314388Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:45:49.1314726Z 2025-05-07T19:45:49.1314771Z 2025-05-07T19:45:49.1314775Z 2025-05-07T19:45:49.1315027Z 2025-05-07T19:45:49.1315032Z 2025-05-07T19:45:49.1315039Z 2025-05-07T19:45:49.1315043Z 2025-05-07T19:45:49.1315048Z 2025-05-07T19:45:49.1315053Z 2025-05-07T19:45:49.1315057Z 2025-05-07T19:45:49.1315061Z 2025-05-07T19:45:49.1315065Z 2025-05-07T19:45:49.1315069Z 2025-05-07T19:45:49.1315446Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:45:49.1315784Z 2025-05-07T19:45:49.1315787Z 2025-05-07T19:45:49.1315791Z 2025-05-07T19:45:49.1315794Z 2025-05-07T19:45:49.1315798Z 2025-05-07T19:45:49.1315801Z 2025-05-07T19:45:49.1315805Z 2025-05-07T19:45:49.1315808Z 2025-05-07T19:45:49.1315812Z 2025-05-07T19:45:49.1315815Z 2025-05-07T19:45:49.1315819Z 2025-05-07T19:45:49.1315822Z 2025-05-07T19:45:49.1315826Z 2025-05-07T19:45:49.5399335Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:45:49.5399877Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:45:49.6394198Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:45:49.6394498Z 2025-05-07T19:45:49.6394504Z 2025-05-07T19:45:49.6394508Z 2025-05-07T19:45:49.6394511Z 2025-05-07T19:45:49.6394516Z 2025-05-07T19:45:49.6394519Z 2025-05-07T19:45:49.6394523Z 2025-05-07T19:45:49.6394527Z 2025-05-07T19:45:49.6394761Z 2025-05-07T19:45:49.6394765Z 2025-05-07T19:45:49.6394784Z 2025-05-07T19:45:49.6394788Z 2025-05-07T19:45:49.6395231Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:45:49.6395556Z 2025-05-07T19:45:49.6395559Z 2025-05-07T19:45:49.6395563Z 2025-05-07T19:45:49.6395567Z 2025-05-07T19:45:49.6395570Z 2025-05-07T19:45:49.6395574Z 2025-05-07T19:45:49.6395578Z 2025-05-07T19:45:49.6395581Z 2025-05-07T19:45:49.6395598Z 2025-05-07T19:45:49.6395602Z 2025-05-07T19:45:49.6395605Z 2025-05-07T19:45:49.6395609Z 2025-05-07T19:45:50.0773937Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:45:50.0774365Z 2025-05-07T19:45:50.0774370Z 2025-05-07T19:45:50.0774374Z 2025-05-07T19:45:50.0774379Z 2025-05-07T19:45:50.0774384Z 2025-05-07T19:45:50.0774388Z 2025-05-07T19:45:50.0774392Z 2025-05-07T19:45:50.0774396Z 2025-05-07T19:45:50.0774400Z 2025-05-07T19:45:50.0774404Z 2025-05-07T19:45:50.0774432Z 2025-05-07T19:45:50.0774436Z 2025-05-07T19:45:50.0774439Z 2025-05-07T19:45:50.0774443Z 2025-05-07T19:45:50.0774446Z 2025-05-07T19:45:50.0774810Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:45:50.0775118Z 2025-05-07T19:45:50.0775122Z 2025-05-07T19:45:50.0775126Z 2025-05-07T19:45:50.0775129Z 2025-05-07T19:45:50.0775133Z 2025-05-07T19:45:50.0775136Z 2025-05-07T19:45:50.0775139Z 2025-05-07T19:45:50.0775143Z 2025-05-07T19:45:50.0775147Z 2025-05-07T19:45:50.0775150Z 2025-05-07T19:45:50.0775154Z 2025-05-07T19:45:50.0775158Z 2025-05-07T19:45:50.0775166Z 2025-05-07T19:45:50.0775169Z 2025-05-07T19:45:50.0775179Z 2025-05-07T19:45:50.1203819Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:45:50.1204190Z 2025-05-07T19:45:50.1204194Z 2025-05-07T19:45:50.1204198Z 2025-05-07T19:45:50.1204201Z 2025-05-07T19:45:50.1204205Z 2025-05-07T19:45:50.1204237Z 2025-05-07T19:45:50.1204241Z 2025-05-07T19:45:50.1204245Z 2025-05-07T19:45:50.1204248Z 2025-05-07T19:45:50.1204251Z 2025-05-07T19:45:50.1204255Z 2025-05-07T19:45:50.1204258Z 2025-05-07T19:45:50.1204262Z 2025-05-07T19:45:50.1204265Z 2025-05-07T19:45:50.1204269Z 2025-05-07T19:45:50.1204272Z 2025-05-07T19:45:50.1204276Z 2025-05-07T19:45:50.1204595Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:45:50.1204928Z 2025-05-07T19:45:50.1204932Z 2025-05-07T19:45:50.1204935Z 2025-05-07T19:45:50.1204939Z 2025-05-07T19:45:50.1204942Z 2025-05-07T19:45:50.1204946Z 2025-05-07T19:45:50.1204949Z 2025-05-07T19:45:50.1204952Z 2025-05-07T19:45:50.1205179Z 2025-05-07T19:45:50.1205185Z 2025-05-07T19:45:50.1205189Z 2025-05-07T19:45:50.1205193Z 2025-05-07T19:45:50.1205196Z 2025-05-07T19:45:50.1205200Z 2025-05-07T19:45:50.1205203Z 2025-05-07T19:45:50.1205207Z 2025-05-07T19:45:50.1205210Z 2025-05-07T19:45:50.1671679Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:45:50.1672043Z 2025-05-07T19:45:50.1672143Z 2025-05-07T19:45:50.1672161Z 2025-05-07T19:45:50.1672165Z 2025-05-07T19:45:50.1672178Z 2025-05-07T19:45:50.1672182Z 2025-05-07T19:45:50.1672185Z 2025-05-07T19:45:50.1672189Z 2025-05-07T19:45:50.1672192Z 2025-05-07T19:45:50.1672196Z 2025-05-07T19:45:50.1672199Z 2025-05-07T19:45:50.1672203Z 2025-05-07T19:45:50.1672206Z 2025-05-07T19:45:50.1672214Z 2025-05-07T19:45:50.1672513Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:45:50.1672837Z 2025-05-07T19:45:50.1672847Z 2025-05-07T19:45:50.1672858Z 2025-05-07T19:45:50.1672862Z 2025-05-07T19:45:50.1672865Z 2025-05-07T19:45:50.1672869Z 2025-05-07T19:45:50.1672872Z 2025-05-07T19:45:50.1672876Z 2025-05-07T19:45:50.1672879Z 2025-05-07T19:45:50.1672883Z 2025-05-07T19:45:50.1672886Z 2025-05-07T19:45:50.1672890Z 2025-05-07T19:45:50.1673078Z 2025-05-07T19:45:50.1673081Z 2025-05-07T19:45:50.4073912Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:45:50.4074295Z 2025-05-07T19:45:50.4074300Z 2025-05-07T19:45:50.4074304Z 2025-05-07T19:45:50.4074308Z 2025-05-07T19:45:50.4074311Z 2025-05-07T19:45:50.4074315Z 2025-05-07T19:45:50.4074333Z 2025-05-07T19:45:50.4074337Z 2025-05-07T19:45:50.4074342Z 2025-05-07T19:45:50.4074346Z 2025-05-07T19:45:50.4074351Z 2025-05-07T19:45:50.4074355Z 2025-05-07T19:45:50.4074360Z 2025-05-07T19:45:50.4074364Z 2025-05-07T19:45:50.4074368Z 2025-05-07T19:45:50.4074373Z 2025-05-07T19:45:50.4074377Z 2025-05-07T19:45:50.4074384Z 2025-05-07T19:45:50.4074729Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:45:50.4075072Z 2025-05-07T19:45:50.4075076Z 2025-05-07T19:45:50.4075080Z 2025-05-07T19:45:50.4075084Z 2025-05-07T19:45:50.4075088Z 2025-05-07T19:45:50.4075092Z 2025-05-07T19:45:50.4075108Z 2025-05-07T19:45:50.4075111Z 2025-05-07T19:45:50.4075115Z 2025-05-07T19:45:50.4075118Z 2025-05-07T19:45:50.4075122Z 2025-05-07T19:45:50.4075125Z 2025-05-07T19:45:50.4075129Z 2025-05-07T19:45:50.4075132Z 2025-05-07T19:45:50.4075135Z 2025-05-07T19:45:50.4075139Z 2025-05-07T19:45:50.4075142Z 2025-05-07T19:45:50.4075146Z 2025-05-07T19:45:50.4275330Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:45:50.4275694Z 2025-05-07T19:45:50.4275699Z 2025-05-07T19:45:50.4275703Z 2025-05-07T19:45:50.4275706Z 2025-05-07T19:45:50.4275710Z 2025-05-07T19:45:50.4275713Z 2025-05-07T19:45:50.4275717Z 2025-05-07T19:45:50.4275732Z 2025-05-07T19:45:50.4275750Z 2025-05-07T19:45:50.4275754Z 2025-05-07T19:45:50.4275757Z 2025-05-07T19:45:50.4275760Z 2025-05-07T19:45:50.4275764Z 2025-05-07T19:45:50.4275768Z 2025-05-07T19:45:50.4275772Z 2025-05-07T19:45:50.4275775Z 2025-05-07T19:45:50.4276098Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:45:50.4276438Z 2025-05-07T19:45:50.4276454Z 2025-05-07T19:45:50.4276457Z 2025-05-07T19:45:50.4276461Z 2025-05-07T19:45:50.4276464Z 2025-05-07T19:45:50.4276468Z 2025-05-07T19:45:50.4276472Z 2025-05-07T19:45:50.4276475Z 2025-05-07T19:45:50.4276478Z 2025-05-07T19:45:50.4276482Z 2025-05-07T19:45:50.4276485Z 2025-05-07T19:45:50.4276489Z 2025-05-07T19:45:50.4276492Z 2025-05-07T19:45:50.4276495Z 2025-05-07T19:45:50.4276499Z 2025-05-07T19:45:50.4276502Z 2025-05-07T19:45:50.6717068Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:45:50.6717672Z 2025-05-07T19:45:50.6717679Z 2025-05-07T19:45:50.6717684Z 2025-05-07T19:45:50.6717688Z 2025-05-07T19:45:50.6717691Z 2025-05-07T19:45:50.6717696Z 2025-05-07T19:45:50.6717700Z 2025-05-07T19:45:50.6717703Z 2025-05-07T19:45:50.6717707Z 2025-05-07T19:45:50.6717711Z 2025-05-07T19:45:50.6717714Z 2025-05-07T19:45:50.6717726Z 2025-05-07T19:45:50.6717730Z 2025-05-07T19:45:50.6717733Z 2025-05-07T19:45:50.6717737Z 2025-05-07T19:45:50.6717740Z 2025-05-07T19:45:50.6717744Z 2025-05-07T19:45:50.6717747Z 2025-05-07T19:45:50.6717751Z 2025-05-07T19:45:50.6718039Z ... (more hidden) ... 2025-05-07T19:45:50.6718325Z 2025-05-07T19:45:50.6718329Z 2025-05-07T19:45:50.6718332Z 2025-05-07T19:45:50.6718336Z 2025-05-07T19:45:50.6718339Z 2025-05-07T19:45:50.6718343Z 2025-05-07T19:45:50.6718346Z 2025-05-07T19:45:50.6718350Z 2025-05-07T19:45:50.6718353Z 2025-05-07T19:45:50.6718356Z 2025-05-07T19:45:50.6718360Z 2025-05-07T19:45:50.6718363Z 2025-05-07T19:45:50.6718386Z 2025-05-07T19:45:50.6718389Z 2025-05-07T19:45:50.6718393Z 2025-05-07T19:45:50.6718397Z 2025-05-07T19:45:50.6718400Z 2025-05-07T19:45:50.6718404Z 2025-05-07T19:45:50.6718407Z 2025-05-07T19:45:50.7735517Z ... (more hidden) ... 2025-05-07T19:45:50.7736095Z 2025-05-07T19:45:50.7736100Z 2025-05-07T19:45:50.7736104Z 2025-05-07T19:45:50.7736108Z 2025-05-07T19:45:50.7736112Z 2025-05-07T19:45:50.7736116Z 2025-05-07T19:45:50.7736120Z 2025-05-07T19:45:50.7736123Z 2025-05-07T19:45:50.7736127Z 2025-05-07T19:45:50.7736131Z 2025-05-07T19:45:50.7736134Z 2025-05-07T19:45:50.7736435Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:45:50.7736752Z 2025-05-07T19:45:50.7736756Z 2025-05-07T19:45:50.7736759Z 2025-05-07T19:45:50.7736763Z 2025-05-07T19:45:50.7736766Z 2025-05-07T19:45:50.7736770Z 2025-05-07T19:45:50.7736773Z 2025-05-07T19:45:50.7736777Z 2025-05-07T19:45:50.7736780Z 2025-05-07T19:45:50.7736794Z 2025-05-07T19:45:50.7736798Z 2025-05-07T19:45:59.6024917Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:45:59.6025311Z 2025-05-07T19:45:59.6025317Z 2025-05-07T19:45:59.6025322Z 2025-05-07T19:45:59.6025327Z 2025-05-07T19:45:59.6025366Z 2025-05-07T19:45:59.6025387Z 2025-05-07T19:46:12.7925676Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:12.7926015Z 2025-05-07T19:46:12.7926021Z 2025-05-07T19:46:12.7926025Z 2025-05-07T19:46:12.7926030Z 2025-05-07T19:46:19.7607606Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:19.7607969Z 2025-05-07T19:46:19.7607977Z 2025-05-07T19:46:29.6363462Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:29.6363817Z 2025-05-07T19:46:29.6363822Z 2025-05-07T19:46:29.6363826Z 2025-05-07T19:46:29.6363830Z 2025-05-07T19:46:29.6363834Z 2025-05-07T19:46:33.5608630Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:33.5609007Z 2025-05-07T19:46:33.5609012Z 2025-05-07T19:46:33.5609016Z 2025-05-07T19:46:33.5609023Z 2025-05-07T19:46:33.5609030Z 2025-05-07T19:46:33.5609034Z 2025-05-07T19:46:33.5609039Z 2025-05-07T19:46:37.9393963Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:46:37.9394351Z 2025-05-07T19:46:37.9394357Z 2025-05-07T19:46:37.9394361Z 2025-05-07T19:46:37.9394365Z 2025-05-07T19:46:37.9394369Z 2025-05-07T19:46:37.9394373Z 2025-05-07T19:46:37.9394377Z 2025-05-07T19:46:37.9394380Z 2025-05-07T19:46:37.9394384Z 2025-05-07T19:46:41.4538256Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:46:41.4538608Z 2025-05-07T19:46:45.7195428Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:45.7195730Z 2025-05-07T19:46:45.7195756Z 2025-05-07T19:46:45.7195761Z 2025-05-07T19:46:45.7195781Z 2025-05-07T19:46:45.7195785Z 2025-05-07T19:46:45.7196041Z 2025-05-07T19:46:45.7196046Z 2025-05-07T19:46:45.7196050Z 2025-05-07T19:46:45.7196053Z 2025-05-07T19:46:45.7196057Z 2025-05-07T19:46:48.3975917Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:46:48.3976804Z 2025-05-07T19:46:48.3976824Z 2025-05-07T19:46:48.3976863Z 2025-05-07T19:46:48.3976868Z 2025-05-07T19:46:48.3976872Z 2025-05-07T19:46:48.3976876Z 2025-05-07T19:46:48.3976880Z 2025-05-07T19:46:48.3976884Z 2025-05-07T19:46:50.3607335Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:46:50.3607700Z 2025-05-07T19:46:50.3607708Z 2025-05-07T19:46:50.3607713Z 2025-05-07T19:46:50.3607717Z 2025-05-07T19:46:50.3607722Z 2025-05-07T19:46:50.3607726Z 2025-05-07T19:46:50.3607730Z 2025-05-07T19:46:50.3607733Z 2025-05-07T19:46:50.3607737Z 2025-05-07T19:46:50.3607741Z 2025-05-07T19:46:50.3607744Z 2025-05-07T19:46:50.3607748Z 2025-05-07T19:46:50.3607753Z 2025-05-07T19:46:59.9075179Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:46:59.9075567Z 2025-05-07T19:46:59.9075571Z 2025-05-07T19:46:59.9075576Z 2025-05-07T19:46:59.9075579Z 2025-05-07T19:46:59.9075583Z 2025-05-07T19:46:59.9075587Z 2025-05-07T19:46:59.9075591Z 2025-05-07T19:46:59.9075830Z 2025-05-07T19:46:59.9075835Z 2025-05-07T19:46:59.9075839Z 2025-05-07T19:46:59.9075842Z 2025-05-07T19:46:59.9075846Z 2025-05-07T19:47:05.6629610Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:47:05.6630623Z 2025-05-07T19:47:05.6630638Z 2025-05-07T19:47:05.6630650Z 2025-05-07T19:47:05.6630661Z 2025-05-07T19:47:05.6630671Z 2025-05-07T19:47:05.6630682Z 2025-05-07T19:47:05.6630692Z 2025-05-07T19:47:05.6630702Z 2025-05-07T19:47:05.6630713Z 2025-05-07T19:47:05.6630723Z 2025-05-07T19:47:05.6630733Z 2025-05-07T19:47:05.6630743Z 2025-05-07T19:47:05.6630754Z 2025-05-07T19:47:05.6630765Z 2025-05-07T19:47:05.6630775Z 2025-05-07T19:47:08.5830113Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:47:08.5830495Z 2025-05-07T19:47:08.5830500Z 2025-05-07T19:47:08.5830504Z 2025-05-07T19:47:08.5830508Z 2025-05-07T19:47:08.5830512Z 2025-05-07T19:47:08.5830516Z 2025-05-07T19:47:08.5830533Z 2025-05-07T19:47:08.5830536Z 2025-05-07T19:47:08.5830540Z 2025-05-07T19:47:08.5830543Z 2025-05-07T19:47:08.5830547Z 2025-05-07T19:47:08.5830550Z 2025-05-07T19:47:08.5830554Z 2025-05-07T19:47:08.5830557Z 2025-05-07T19:47:08.5830561Z 2025-05-07T19:47:08.5830564Z 2025-05-07T19:47:08.5830568Z 2025-05-07T19:47:13.0572934Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:47:13.0573335Z 2025-05-07T19:47:13.0573344Z 2025-05-07T19:47:13.0573349Z 2025-05-07T19:47:13.0573354Z 2025-05-07T19:47:13.0573358Z 2025-05-07T19:47:13.0573362Z 2025-05-07T19:47:13.0573366Z 2025-05-07T19:47:13.0573370Z 2025-05-07T19:47:13.0573421Z 2025-05-07T19:47:13.0573425Z 2025-05-07T19:47:13.0573429Z 2025-05-07T19:47:13.0573432Z 2025-05-07T19:47:13.0573436Z 2025-05-07T19:47:13.0573440Z 2025-05-07T19:47:15.2831437Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:47:15.2831862Z 2025-05-07T19:47:15.2831894Z 2025-05-07T19:47:15.2831899Z 2025-05-07T19:47:15.2831903Z 2025-05-07T19:47:15.2831907Z 2025-05-07T19:47:15.2831911Z 2025-05-07T19:47:15.2831915Z 2025-05-07T19:47:15.2831919Z 2025-05-07T19:47:15.2831922Z 2025-05-07T19:47:15.2831926Z 2025-05-07T19:47:15.2831930Z 2025-05-07T19:47:15.2831933Z 2025-05-07T19:47:15.2831937Z 2025-05-07T19:47:15.2831941Z 2025-05-07T19:47:15.2831945Z 2025-05-07T19:47:15.2831948Z 2025-05-07T19:47:15.2831952Z 2025-05-07T19:47:15.2831955Z 2025-05-07T19:47:15.3997230Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:47:15.3997602Z 2025-05-07T19:47:15.3997607Z 2025-05-07T19:47:15.3997848Z 2025-05-07T19:47:18.5698121Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:47:18.5698488Z 2025-05-07T19:47:18.5698494Z 2025-05-07T19:47:18.5698498Z 2025-05-07T19:47:18.5698502Z 2025-05-07T19:47:18.5698506Z 2025-05-07T19:47:18.5698511Z 2025-05-07T19:47:18.5698536Z 2025-05-07T19:47:18.5698541Z 2025-05-07T19:47:18.5698545Z 2025-05-07T19:47:18.5698549Z 2025-05-07T19:47:18.5698553Z 2025-05-07T19:47:18.5698557Z 2025-05-07T19:47:18.5698561Z 2025-05-07T19:47:18.5698565Z 2025-05-07T19:47:18.5698568Z 2025-05-07T19:47:18.5698572Z 2025-05-07T19:47:20.3292983Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:47:20.3293403Z 2025-05-07T19:47:20.3293408Z 2025-05-07T19:47:20.3293413Z 2025-05-07T19:47:20.3293417Z 2025-05-07T19:47:20.3293420Z 2025-05-07T19:47:20.3293424Z 2025-05-07T19:47:20.3293428Z 2025-05-07T19:47:20.3293432Z 2025-05-07T19:47:20.3293437Z 2025-05-07T19:47:20.3293442Z 2025-05-07T19:47:20.3293476Z 2025-05-07T19:47:20.3293480Z 2025-05-07T19:47:20.3293484Z 2025-05-07T19:47:20.3293487Z 2025-05-07T19:47:20.3293491Z 2025-05-07T19:47:20.3293519Z 2025-05-07T19:47:20.3293523Z 2025-05-07T19:47:20.3293527Z 2025-05-07T19:47:20.3293531Z 2025-05-07T19:47:32.8008432Z ... (more hidden) ... 2025-05-07T19:47:32.8009279Z 2025-05-07T19:47:32.8009285Z 2025-05-07T19:47:32.8009289Z 2025-05-07T19:47:32.8009293Z 2025-05-07T19:47:32.8009296Z 2025-05-07T19:47:32.8009300Z 2025-05-07T19:47:32.8009303Z 2025-05-07T19:47:32.8009308Z 2025-05-07T19:47:32.8009311Z 2025-05-07T19:47:32.8009315Z 2025-05-07T19:47:32.8009318Z 2025-05-07T19:47:35.8529803Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:47:35.8541942Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:47:35.8542432Z 2025-05-07T19:47:35.8542488Z 2025-05-07T19:47:35.8542494Z 2025-05-07T19:47:35.8542622Z 2025-05-07T19:47:35.8542629Z 2025-05-07T19:47:35.8542719Z 2025-05-07T19:47:35.8542725Z 2025-05-07T19:47:35.8542759Z 2025-05-07T19:47:35.8542765Z 2025-05-07T19:47:35.8542804Z 2025-05-07T19:47:35.8542811Z 2025-05-07T19:47:35.8542842Z 2025-05-07T19:47:35.8543079Z 2025-05-07T19:47:35.8543088Z 2025-05-07T19:47:35.8543119Z 2025-05-07T19:47:35.8543124Z 2025-05-07T19:47:35.8543129Z 2025-05-07T19:47:35.8543133Z 2025-05-07T19:47:35.8543138Z 2025-05-07T19:47:35.8543439Z 2025-05-07T19:47:35.8543953Z  2025-05-07T19:47:35.8544319Z 2025-05-07T19:47:35.8544578Z 2025-05-07T19:47:35.8544772Z  2025-05-07T19:47:35.8544997Z 2025-05-07T19:47:35.8545009Z 2025-05-07T19:47:35.8545223Z  2025-05-07T19:47:35.8545446Z 2025-05-07T19:47:35.8546063Z 2025-05-07T19:47:35.8546076Z 2025-05-07T19:47:35.8546327Z  2025-05-07T19:47:35.8546594Z 2025-05-07T19:47:35.8546625Z 2025-05-07T19:47:35.8546629Z 2025-05-07T19:47:35.8546633Z 2025-05-07T19:47:35.8546831Z  2025-05-07T19:47:35.8547104Z 2025-05-07T19:47:35.8547108Z 2025-05-07T19:47:35.8547111Z 2025-05-07T19:47:35.8547114Z 2025-05-07T19:47:35.8547118Z 2025-05-07T19:47:35.8547313Z  2025-05-07T19:47:35.8547552Z 2025-05-07T19:47:35.8547555Z 2025-05-07T19:47:35.8547558Z 2025-05-07T19:47:35.8547562Z 2025-05-07T19:47:35.8547599Z 2025-05-07T19:47:35.8547603Z 2025-05-07T19:47:35.8547801Z  2025-05-07T19:47:35.8548039Z 2025-05-07T19:47:35.8548042Z 2025-05-07T19:47:35.8548045Z 2025-05-07T19:47:35.8548049Z 2025-05-07T19:47:35.8548052Z 2025-05-07T19:47:35.8548056Z 2025-05-07T19:47:35.8548323Z 2025-05-07T19:47:35.8548564Z  2025-05-07T19:47:35.8548802Z 2025-05-07T19:47:35.8548806Z 2025-05-07T19:47:35.8548810Z 2025-05-07T19:47:35.8548813Z 2025-05-07T19:47:35.8548817Z 2025-05-07T19:47:35.8548821Z 2025-05-07T19:47:35.8548833Z 2025-05-07T19:47:35.8548837Z 2025-05-07T19:47:35.8549060Z  2025-05-07T19:47:35.8549296Z 2025-05-07T19:47:35.8549300Z 2025-05-07T19:47:35.8549303Z 2025-05-07T19:47:35.8549307Z 2025-05-07T19:47:35.8549310Z 2025-05-07T19:47:35.8549314Z 2025-05-07T19:47:35.8549317Z 2025-05-07T19:47:35.8549321Z 2025-05-07T19:47:35.8549324Z 2025-05-07T19:47:35.8549555Z  2025-05-07T19:47:35.8549794Z 2025-05-07T19:47:35.8549797Z 2025-05-07T19:47:35.8549801Z 2025-05-07T19:47:35.8549804Z 2025-05-07T19:47:35.8549808Z 2025-05-07T19:47:35.8549816Z 2025-05-07T19:47:35.8549819Z 2025-05-07T19:47:35.8549823Z 2025-05-07T19:47:35.8549826Z 2025-05-07T19:47:35.8549830Z 2025-05-07T19:47:35.8550087Z  2025-05-07T19:47:35.8550400Z 2025-05-07T19:47:35.8550432Z 2025-05-07T19:47:35.8550546Z 2025-05-07T19:47:35.8550549Z 2025-05-07T19:47:35.8550553Z 2025-05-07T19:47:35.8550556Z 2025-05-07T19:47:35.8550560Z 2025-05-07T19:47:35.8550564Z 2025-05-07T19:47:35.8550567Z 2025-05-07T19:47:35.8550570Z 2025-05-07T19:47:35.8550574Z 2025-05-07T19:47:35.8550793Z  2025-05-07T19:47:35.8551161Z 2025-05-07T19:47:35.8551167Z 2025-05-07T19:47:35.8551173Z 2025-05-07T19:47:35.8551178Z 2025-05-07T19:47:35.8551185Z 2025-05-07T19:47:35.8551191Z 2025-05-07T19:47:35.8551195Z 2025-05-07T19:47:35.8551198Z 2025-05-07T19:47:35.8551202Z 2025-05-07T19:47:35.8551205Z 2025-05-07T19:47:35.8551209Z 2025-05-07T19:47:35.8551217Z 2025-05-07T19:47:35.8551435Z  2025-05-07T19:47:35.8551724Z 2025-05-07T19:47:35.8551728Z 2025-05-07T19:47:35.8551732Z 2025-05-07T19:47:35.8551735Z 2025-05-07T19:47:35.8551739Z 2025-05-07T19:47:35.8551747Z 2025-05-07T19:47:35.8551752Z 2025-05-07T19:47:35.8551755Z 2025-05-07T19:47:35.8551759Z 2025-05-07T19:47:35.8551762Z 2025-05-07T19:47:35.8551766Z 2025-05-07T19:47:35.8551769Z 2025-05-07T19:47:35.8551773Z 2025-05-07T19:47:35.8551991Z  2025-05-07T19:47:35.8552293Z 2025-05-07T19:47:35.8552296Z 2025-05-07T19:47:35.8552300Z 2025-05-07T19:47:35.8552303Z 2025-05-07T19:47:35.8552307Z 2025-05-07T19:47:35.8552310Z 2025-05-07T19:47:35.8552313Z 2025-05-07T19:47:35.8552317Z 2025-05-07T19:47:35.8552320Z 2025-05-07T19:47:35.8552324Z 2025-05-07T19:47:35.8552327Z 2025-05-07T19:47:35.8552330Z 2025-05-07T19:47:35.8552337Z 2025-05-07T19:47:35.8552341Z 2025-05-07T19:47:35.8552592Z  2025-05-07T19:47:35.8552858Z 2025-05-07T19:47:35.8552862Z 2025-05-07T19:47:35.8552865Z 2025-05-07T19:47:35.8552869Z 2025-05-07T19:47:35.8552876Z 2025-05-07T19:47:35.8552879Z 2025-05-07T19:47:35.8552883Z 2025-05-07T19:47:35.8552886Z 2025-05-07T19:47:35.8552890Z 2025-05-07T19:47:35.8552893Z 2025-05-07T19:47:35.8552897Z 2025-05-07T19:47:35.8552900Z 2025-05-07T19:47:35.8552903Z 2025-05-07T19:47:35.8552907Z 2025-05-07T19:47:35.8552938Z 2025-05-07T19:47:35.8553162Z  2025-05-07T19:47:35.8553423Z 2025-05-07T19:47:35.8553427Z 2025-05-07T19:47:35.8553430Z 2025-05-07T19:47:35.8553434Z 2025-05-07T19:47:35.8553437Z 2025-05-07T19:47:35.8553440Z 2025-05-07T19:47:35.8553444Z 2025-05-07T19:47:35.8553447Z 2025-05-07T19:47:35.8553518Z 2025-05-07T19:47:35.8553554Z 2025-05-07T19:47:35.8553557Z 2025-05-07T19:47:35.8553560Z 2025-05-07T19:47:35.8553564Z 2025-05-07T19:47:35.8553567Z 2025-05-07T19:47:35.8553571Z 2025-05-07T19:47:35.8553574Z 2025-05-07T19:47:35.8553808Z  2025-05-07T19:47:35.8554074Z 2025-05-07T19:47:35.8554078Z 2025-05-07T19:47:35.8554109Z 2025-05-07T19:47:35.8554113Z 2025-05-07T19:47:35.8554116Z 2025-05-07T19:47:35.8554120Z 2025-05-07T19:47:35.8554123Z 2025-05-07T19:47:35.8554127Z 2025-05-07T19:47:35.8554130Z 2025-05-07T19:47:35.8554134Z 2025-05-07T19:47:35.8554137Z 2025-05-07T19:47:35.8554141Z 2025-05-07T19:47:35.8554145Z 2025-05-07T19:47:35.8554148Z 2025-05-07T19:47:35.8554152Z 2025-05-07T19:47:35.8554155Z 2025-05-07T19:47:35.8554159Z 2025-05-07T19:47:35.8554401Z  2025-05-07T19:47:35.8554698Z 2025-05-07T19:47:35.8554706Z 2025-05-07T19:47:35.8554709Z 2025-05-07T19:47:35.8554713Z 2025-05-07T19:47:35.8554716Z 2025-05-07T19:47:35.8554720Z 2025-05-07T19:47:35.8554723Z 2025-05-07T19:47:35.8554727Z 2025-05-07T19:47:35.8554730Z 2025-05-07T19:47:35.8554734Z 2025-05-07T19:47:35.8554737Z 2025-05-07T19:47:35.8554804Z 2025-05-07T19:47:35.8554807Z 2025-05-07T19:47:35.8554811Z 2025-05-07T19:47:35.8554814Z 2025-05-07T19:47:35.8554818Z 2025-05-07T19:47:35.8554821Z 2025-05-07T19:47:35.8554825Z 2025-05-07T19:47:35.8555104Z  2025-05-07T19:47:35.8555370Z 2025-05-07T19:47:35.8555374Z 2025-05-07T19:47:35.8555486Z  2025-05-07T19:47:35.8555638Z 2025-05-07T19:47:35.8555642Z 2025-05-07T19:47:35.8555746Z  2025-05-07T19:47:35.8555855Z 2025-05-07T19:47:35.8555858Z 2025-05-07T19:47:35.8555862Z 2025-05-07T19:47:35.8555975Z  2025-05-07T19:47:35.8556085Z 2025-05-07T19:47:35.8556088Z 2025-05-07T19:47:35.8556095Z 2025-05-07T19:47:35.8556099Z 2025-05-07T19:47:35.8556200Z  2025-05-07T19:47:35.8556330Z 2025-05-07T19:47:35.8556334Z 2025-05-07T19:47:35.8556337Z 2025-05-07T19:47:35.8556341Z 2025-05-07T19:47:35.8556344Z 2025-05-07T19:47:35.8556448Z  2025-05-07T19:47:35.8556575Z 2025-05-07T19:47:35.8556579Z 2025-05-07T19:47:35.8556582Z 2025-05-07T19:47:35.8556597Z 2025-05-07T19:47:35.8556601Z 2025-05-07T19:47:35.8556604Z 2025-05-07T19:47:35.8571854Z  2025-05-07T19:47:35.8572068Z 2025-05-07T19:47:35.8572073Z 2025-05-07T19:47:35.8572078Z 2025-05-07T19:47:35.8572081Z 2025-05-07T19:47:35.8572102Z 2025-05-07T19:47:35.8572106Z 2025-05-07T19:47:35.8572162Z 2025-05-07T19:47:35.8572374Z  2025-05-07T19:47:35.8572544Z 2025-05-07T19:47:35.8572549Z 2025-05-07T19:47:35.8572553Z 2025-05-07T19:47:35.8572557Z 2025-05-07T19:47:35.8572562Z 2025-05-07T19:47:35.8572565Z 2025-05-07T19:47:35.8572569Z 2025-05-07T19:47:35.8572573Z 2025-05-07T19:47:35.8572707Z  2025-05-07T19:47:35.8572876Z 2025-05-07T19:47:35.8572880Z 2025-05-07T19:47:35.8572883Z 2025-05-07T19:47:35.8572887Z 2025-05-07T19:47:35.8572891Z 2025-05-07T19:47:35.8572895Z 2025-05-07T19:47:35.8572898Z 2025-05-07T19:47:35.8572902Z 2025-05-07T19:47:35.8572914Z 2025-05-07T19:47:35.8573043Z  2025-05-07T19:47:35.8573219Z 2025-05-07T19:47:35.8573222Z 2025-05-07T19:47:35.8573226Z 2025-05-07T19:47:35.8573229Z 2025-05-07T19:47:35.8573233Z 2025-05-07T19:47:35.8573237Z 2025-05-07T19:47:35.8573240Z 2025-05-07T19:47:35.8573244Z 2025-05-07T19:47:35.8573248Z 2025-05-07T19:47:35.8573251Z 2025-05-07T19:47:35.8573381Z  2025-05-07T19:47:35.8573546Z 2025-05-07T19:47:35.8573568Z 2025-05-07T19:47:35.8573571Z 2025-05-07T19:47:35.8573575Z 2025-05-07T19:47:35.8573578Z 2025-05-07T19:47:35.8573582Z 2025-05-07T19:47:35.8573585Z 2025-05-07T19:47:35.8573589Z 2025-05-07T19:47:35.8573592Z 2025-05-07T19:47:35.8573730Z 2025-05-07T19:47:35.8573735Z 2025-05-07T19:47:35.8573876Z  2025-05-07T19:47:35.8574101Z 2025-05-07T19:47:35.8574105Z 2025-05-07T19:47:35.8574108Z 2025-05-07T19:47:35.8574112Z 2025-05-07T19:47:35.8574115Z 2025-05-07T19:47:35.8574119Z 2025-05-07T19:47:35.8574128Z 2025-05-07T19:47:35.8574131Z 2025-05-07T19:47:35.8574135Z 2025-05-07T19:47:35.8574138Z 2025-05-07T19:47:35.8574142Z 2025-05-07T19:47:35.8574145Z 2025-05-07T19:47:35.8574294Z  2025-05-07T19:47:35.8574497Z 2025-05-07T19:47:35.8574501Z 2025-05-07T19:47:35.8574504Z 2025-05-07T19:47:35.8574508Z 2025-05-07T19:47:35.8574512Z 2025-05-07T19:47:35.8574516Z 2025-05-07T19:47:35.8574519Z 2025-05-07T19:47:35.8574523Z 2025-05-07T19:47:35.8574526Z 2025-05-07T19:47:35.8574530Z 2025-05-07T19:47:35.8574533Z 2025-05-07T19:47:35.8574537Z 2025-05-07T19:47:35.8574541Z 2025-05-07T19:47:35.8574677Z  2025-05-07T19:47:35.8574912Z 2025-05-07T19:47:35.8574919Z 2025-05-07T19:47:35.8574923Z 2025-05-07T19:47:35.8574926Z 2025-05-07T19:47:35.8574930Z 2025-05-07T19:47:35.8574933Z 2025-05-07T19:47:35.8574937Z 2025-05-07T19:47:35.8574940Z 2025-05-07T19:47:35.8574944Z 2025-05-07T19:47:35.8574947Z 2025-05-07T19:47:35.8574951Z 2025-05-07T19:47:35.8575022Z 2025-05-07T19:47:35.8575026Z 2025-05-07T19:47:35.8575030Z 2025-05-07T19:47:35.8575174Z  2025-05-07T19:47:35.8575391Z 2025-05-07T19:47:35.8575395Z 2025-05-07T19:47:35.8575399Z 2025-05-07T19:47:35.8575402Z 2025-05-07T19:47:35.8575406Z 2025-05-07T19:47:35.8575409Z 2025-05-07T19:47:35.8575413Z 2025-05-07T19:47:35.8575417Z 2025-05-07T19:47:35.8575420Z 2025-05-07T19:47:35.8575424Z 2025-05-07T19:47:35.8575427Z 2025-05-07T19:47:35.8575431Z 2025-05-07T19:47:35.8575435Z 2025-05-07T19:47:35.8575438Z 2025-05-07T19:47:35.8575442Z 2025-05-07T19:47:35.8575604Z  2025-05-07T19:47:35.8575811Z 2025-05-07T19:47:35.8575819Z 2025-05-07T19:47:35.8575823Z 2025-05-07T19:47:35.8575826Z 2025-05-07T19:47:35.8575830Z 2025-05-07T19:47:35.8575834Z 2025-05-07T19:47:35.8575837Z 2025-05-07T19:47:35.8575841Z 2025-05-07T19:47:35.8575844Z 2025-05-07T19:47:35.8575848Z 2025-05-07T19:47:35.8575851Z 2025-05-07T19:47:35.8575858Z 2025-05-07T19:47:35.8575862Z 2025-05-07T19:47:35.8575882Z 2025-05-07T19:47:35.8575886Z 2025-05-07T19:47:35.8575889Z 2025-05-07T19:47:35.8576044Z  2025-05-07T19:47:35.8576267Z 2025-05-07T19:47:35.8576271Z 2025-05-07T19:47:35.8576274Z 2025-05-07T19:47:35.8576278Z 2025-05-07T19:47:35.8576282Z 2025-05-07T19:47:35.8576285Z 2025-05-07T19:47:35.8576289Z 2025-05-07T19:47:35.8576293Z 2025-05-07T19:47:35.8576321Z 2025-05-07T19:47:35.8576325Z 2025-05-07T19:47:35.8576328Z 2025-05-07T19:47:35.8576332Z 2025-05-07T19:47:35.8576336Z 2025-05-07T19:47:35.8576340Z 2025-05-07T19:47:35.8576343Z 2025-05-07T19:47:35.8576347Z 2025-05-07T19:47:35.8576354Z 2025-05-07T19:47:35.8576525Z  2025-05-07T19:47:35.8576750Z 2025-05-07T19:47:35.8576772Z 2025-05-07T19:47:35.8576775Z 2025-05-07T19:47:35.8576779Z 2025-05-07T19:47:35.8576782Z 2025-05-07T19:47:35.8576786Z 2025-05-07T19:47:35.8576789Z 2025-05-07T19:47:35.8576796Z 2025-05-07T19:47:35.8576801Z 2025-05-07T19:47:35.8576804Z 2025-05-07T19:47:35.8576808Z 2025-05-07T19:47:35.8576811Z 2025-05-07T19:47:35.8576815Z 2025-05-07T19:47:35.8576818Z 2025-05-07T19:47:35.8576822Z 2025-05-07T19:47:35.8576825Z 2025-05-07T19:47:35.8576829Z 2025-05-07T19:47:35.8576833Z 2025-05-07T19:47:35.8576998Z  2025-05-07T19:47:35.8577236Z 2025-05-07T19:47:35.8577239Z 2025-05-07T19:47:35.8577336Z  2025-05-07T19:47:35.8577441Z 2025-05-07T19:47:35.8577445Z 2025-05-07T19:47:35.8577558Z  2025-05-07T19:47:35.8577666Z 2025-05-07T19:47:35.8577670Z 2025-05-07T19:47:35.8577674Z 2025-05-07T19:47:35.8577849Z  2025-05-07T19:47:35.8577979Z 2025-05-07T19:47:35.8577983Z 2025-05-07T19:47:35.8577986Z 2025-05-07T19:47:35.8577990Z 2025-05-07T19:47:35.8578093Z  2025-05-07T19:47:35.8578211Z 2025-05-07T19:47:35.8578215Z 2025-05-07T19:47:35.8578218Z 2025-05-07T19:47:35.8578222Z 2025-05-07T19:47:35.8578249Z 2025-05-07T19:47:35.8578354Z  2025-05-07T19:47:35.8578476Z 2025-05-07T19:47:35.8578480Z 2025-05-07T19:47:35.8578484Z 2025-05-07T19:47:35.8578488Z 2025-05-07T19:47:35.8578491Z 2025-05-07T19:47:35.8578495Z 2025-05-07T19:47:35.8578623Z  2025-05-07T19:47:35.8578751Z 2025-05-07T19:47:35.8578755Z 2025-05-07T19:47:35.8578758Z 2025-05-07T19:47:35.8578762Z 2025-05-07T19:47:35.8578766Z 2025-05-07T19:47:35.8578769Z 2025-05-07T19:47:35.8578773Z 2025-05-07T19:47:35.8578885Z  2025-05-07T19:47:35.8579043Z 2025-05-07T19:47:35.8579046Z 2025-05-07T19:47:35.8579050Z 2025-05-07T19:47:35.8579054Z 2025-05-07T19:47:35.8579057Z 2025-05-07T19:47:35.8579064Z 2025-05-07T19:47:35.8579067Z 2025-05-07T19:47:35.8579071Z 2025-05-07T19:47:35.8579185Z  2025-05-07T19:47:35.8579352Z 2025-05-07T19:47:35.8579356Z 2025-05-07T19:47:35.8579360Z 2025-05-07T19:47:35.8579363Z 2025-05-07T19:47:35.8579367Z 2025-05-07T19:47:35.8579371Z 2025-05-07T19:47:35.8579430Z 2025-05-07T19:47:35.8579434Z 2025-05-07T19:47:35.8579438Z 2025-05-07T19:47:35.8579557Z  2025-05-07T19:47:35.8579731Z 2025-05-07T19:47:35.8579735Z 2025-05-07T19:47:35.8579739Z 2025-05-07T19:47:35.8579742Z 2025-05-07T19:47:35.8579746Z 2025-05-07T19:47:35.8579749Z 2025-05-07T19:47:35.8579753Z 2025-05-07T19:47:35.8579757Z 2025-05-07T19:47:35.8579760Z 2025-05-07T19:47:35.8579764Z 2025-05-07T19:47:35.8579887Z  2025-05-07T19:47:35.8580053Z 2025-05-07T19:47:35.8580072Z 2025-05-07T19:47:35.8580076Z 2025-05-07T19:47:35.8580079Z 2025-05-07T19:47:35.8580083Z 2025-05-07T19:47:35.8580087Z 2025-05-07T19:47:35.8580090Z 2025-05-07T19:47:35.8580098Z 2025-05-07T19:47:35.8580101Z 2025-05-07T19:47:35.8580106Z 2025-05-07T19:47:35.8580109Z 2025-05-07T19:47:35.8580234Z  2025-05-07T19:47:35.8580425Z 2025-05-07T19:47:35.8580429Z 2025-05-07T19:47:35.8580433Z 2025-05-07T19:47:35.8580436Z 2025-05-07T19:47:35.8580443Z 2025-05-07T19:47:35.8580447Z 2025-05-07T19:47:35.8580450Z 2025-05-07T19:47:35.8580454Z 2025-05-07T19:47:35.8580457Z 2025-05-07T19:47:35.8580461Z 2025-05-07T19:47:35.8580464Z 2025-05-07T19:47:35.8580468Z 2025-05-07T19:47:35.8580606Z  done 2025-05-07T19:47:35.9561920Z Preparing transaction: | done 2025-05-07T19:47:36.1578549Z Verifying transaction: - \ done 2025-05-07T19:47:36.3668313Z Executing transaction: / - done 2025-05-07T19:47:38.3500408Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:47:38.3868155Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib/stubs ... 2025-05-07T19:47:40.1649422Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:47:40.2235020Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib/stubs 2025-05-07T19:47:40.2235557Z 2025-05-07T19:47:40.6306574Z 2025-05-07T19:47:40.6309646Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:47:40.6657060Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:47:40.6657814Z 2025-05-07T19:47:41.0783250Z 2025-05-07T19:47:41.0784088Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:47:41.0786988Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:47:41.0788366Z 2025-05-07T19:47:41.4929839Z 2025-05-07T19:47:43.4468017Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/include/cuda_runtime.h 2025-05-07T19:47:45.4016305Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:47:47.3464342Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:47:49.3099150Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:47:51.1114936Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:47:51.1115294Z 2025-05-07T19:47:51.1694357Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:47:54.8155087Z /tmp/tmpmnr87y_o: line 3: clang: command not found 2025-05-07T19:47:54.8155928Z 2025-05-07T19:47:54.8157124Z ERROR conda.cli.main_run:execute(125): `conda run clang --version` failed. (See above for error) 2025-05-07T19:47:54.8926942Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:47:54.8927300Z 2025-05-07T19:47:54.8947369Z total 32 2025-05-07T19:47:54.8947755Z drwxr-xr-x. 2 root root 161 May 7 19:45 . 2025-05-07T19:47:54.8948246Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:47:54.8948691Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:47:54.8949598Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:47:54.8950047Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:47:54.8950495Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:47:54.8950912Z -rw-r--r--. 2 root root 499 Mar 28 22:35 openjdk_activate.sh 2025-05-07T19:47:54.8951320Z 2025-05-07T19:47:54.8951483Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:47:54.8951743Z 2025-05-07T19:47:56.8072393Z 2025-05-07T19:47:56.8072820Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:47:56.8073451Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler" 2025-05-07T19:47:56.8073856Z 2025-05-07T19:47:57.2247554Z 2025-05-07T19:47:57.2247914Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:47:57.2248200Z 2025-05-07T19:47:59.0360032Z -allow-unsupported-compiler 2025-05-07T19:47:59.0360298Z 2025-05-07T19:47:59.1153944Z 2025-05-07T19:47:59.1154386Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:47:59.1154957Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:47:59.1155302Z 2025-05-07T19:48:00.9749023Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:48:00.9749680Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:48:00.9750033Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:48:00.9750379Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:48:00.9750757Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:48:00.9751124Z #define _STL_PAIR_H 1 2025-05-07T19:48:00.9751426Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:48:00.9751766Z #define __cpp_attributes 200809L 2025-05-07T19:48:00.9752131Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:48:00.9752484Z #define __DELETE_THROW throw() 2025-05-07T19:48:00.9752790Z #define _PTRDIFF_T_ 2025-05-07T19:48:00.9753030Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:48:00.9753342Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:48:00.9753636Z #define _IO_LEFT 02 2025-05-07T19:48:00.9753862Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:48:00.9754135Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:48:00.9754410Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:48:00.9754870Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:48:00.9755312Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:48:00.9755618Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:48:00.9756126Z #define _IOS_OUTPUT 2 2025-05-07T19:48:00.9756451Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:48:00.9756834Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:48:00.9757109Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:48:00.9757416Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:48:00.9758263Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:48:00.9759204Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:48:00.9759533Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:48:00.9759839Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:48:00.9760164Z #define _T_WCHAR_ 2025-05-07T19:48:00.9760396Z #define __WCLONE 0x80000000 2025-05-07T19:48:00.9760668Z #define stdout stdout 2025-05-07T19:48:00.9761003Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:48:00.9761411Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:48:00.9761668Z #define __flexarr [] 2025-05-07T19:48:00.9761929Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:00.9762246Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:48:00.9762515Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:48:00.9762796Z #define _MATH_H 1 2025-05-07T19:48:00.9763175Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:48:00.9763533Z #define __S64_TYPE long int 2025-05-07T19:48:00.9763786Z #define __stub_fchflags 2025-05-07T19:48:00.9764065Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:48:00.9764359Z #define __SQUAD_TYPE long int 2025-05-07T19:48:00.9764641Z #define __INTMAX_C(c) c ## L 2025-05-07T19:48:00.9764900Z #define _BSD_SIZE_T_DEFINED_ 2025-05-07T19:48:00.9765172Z #define NL_NMAX INT_MAX 2025-05-07T19:48:00.9765430Z #define _BITS_TIME_H 1 2025-05-07T19:48:00.9765702Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:48:00.9766079Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:48:00.9766489Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:48:00.9766881Z #define __CHAR_BIT__ 8 2025-05-07T19:48:00.9767141Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:00.9767476Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:48:00.9767775Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:48:00.9768057Z #define FP_NAN 0 2025-05-07T19:48:00.9768321Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:48:00.9768787Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:48:00.9769221Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:48:00.9769486Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:48:00.9769751Z #define _NEW 2025-05-07T19:48:00.9769967Z #define __UINT8_MAX__ 0xff 2025-05-07T19:48:00.9770245Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:48:00.9770500Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:48:00.9770777Z #define __USE_ANSI 1 2025-05-07T19:48:00.9771064Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:48:00.9771422Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:48:00.9771714Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:48:00.9772009Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:48:00.9772287Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:48:00.9772587Z #define PIPE_BUF 4096 2025-05-07T19:48:00.9772921Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:48:00.9773398Z #define EXIT_FAILURE 1 2025-05-07T19:48:00.9773678Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:48:00.9773994Z #define MQ_PRIO_MAX 32768 2025-05-07T19:48:00.9774264Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:48:00.9774575Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:48:00.9775050Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:00.9775570Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:48:00.9775939Z #define _XOPEN_SOURCE 700 2025-05-07T19:48:00.9776276Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:48:00.9776547Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:48:00.9776844Z #define __cpp_static_assert 201411L 2025-05-07T19:48:00.9777118Z #define __need_timer_t 2025-05-07T19:48:00.9777423Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:48:00.9777774Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:48:00.9778063Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:48:00.9778339Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:48:00.9781025Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:48:00.9781331Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:00.9781645Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:48:00.9781944Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:00.9782257Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:48:00.9782558Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:48:00.9782841Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:48:00.9783185Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:48:00.9783501Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:48:00.9783920Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:48:00.9784337Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:48:00.9785932Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:48:00.9786212Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:48:00.9786488Z #define __GCC_IEC_559 2 2025-05-07T19:48:00.9786784Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:48:00.9787116Z #define _IO_flockfile(_fp) 2025-05-07T19:48:00.9787384Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:48:00.9787645Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:48:00.9787917Z #define _IOFBF 0 2025-05-07T19:48:00.9788121Z #define __USE_BSD 1 2025-05-07T19:48:00.9788351Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:48:00.9788615Z #define SHRT_MIN (-SHRT_MAX - 1) 2025-05-07T19:48:00.9788895Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:48:00.9789163Z #define _IO_NO_WRITES 8 2025-05-07T19:48:00.9789409Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:48:00.9789775Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:48:00.9790133Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:48:00.9790439Z #define __cpp_binary_literals 201304L 2025-05-07T19:48:00.9791108Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:48:00.9791395Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:48:00.9791798Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:48:00.9792261Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:48:00.9792653Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:48:00.9792994Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:48:00.9793323Z #define M_PI 3.14159265358979323846 2025-05-07T19:48:00.9793630Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:48:00.9793972Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:48:00.9794429Z #define __NV_GLIBCXX_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:48:00.9794914Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:48:00.9795223Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:48:00.9795517Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:48:00.9795799Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:48:00.9796414Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:48:00.9797053Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:48:00.9797383Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:48:00.9797724Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:48:00.9798020Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:48:00.9798414Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:48:00.9798712Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:48:00.9799042Z #define __cpp_variadic_templates 200704L 2025-05-07T19:48:00.9799466Z #define RAND_MAX 2147483647 2025-05-07T19:48:00.9799729Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:48:00.9800064Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:00.9800374Z #define __SM_90_RT_H__ 2025-05-07T19:48:00.9800630Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:48:00.9800888Z #define __COMPAR_FN_T 2025-05-07T19:48:00.9801139Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:48:00.9801398Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:48:00.9801894Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:48:00.9802432Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:48:00.9802767Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:48:00.9803135Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:48:00.9803422Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:48:00.9803767Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:48:00.9804070Z #define __cpp_variable_templates 201304L 2025-05-07T19:48:00.9804420Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:48:00.9804743Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:48:00.9805018Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:48:00.9805320Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:48:00.9805613Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:48:00.9805971Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:48:00.9806229Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:48:00.9806495Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:48:00.9806731Z #define __u_char_defined 2025-05-07T19:48:00.9807051Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:48:00.9807406Z #define STA_PPSERROR 0x0800 2025-05-07T19:48:00.9807670Z #define _GLIBCXX_STD_A std 2025-05-07T19:48:00.9807919Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:48:00.9808196Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:48:00.9808484Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:48:00.9808926Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:48:00.9809369Z #define FP_INFINITE 1 2025-05-07T19:48:00.9809730Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:00.9810159Z #define _IO_pid_t __pid_t 2025-05-07T19:48:00.9810401Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:48:00.9810671Z #define __LEAF , __leaf__ 2025-05-07T19:48:00.9810906Z #define PATH_MAX 4096 2025-05-07T19:48:00.9811164Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:48:00.9811493Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:48:00.9811825Z #define _LIMITS_H___ 2025-05-07T19:48:00.9812056Z #define __size_t 2025-05-07T19:48:00.9812277Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:48:00.9812843Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:48:00.9813420Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:48:00.9813739Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:48:00.9814064Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:48:00.9814329Z #define _WCHAR_T_DEFINED 2025-05-07T19:48:00.9814679Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:48:00.9815091Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:48:00.9815365Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:48:00.9815642Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:48:00.9815926Z #define __INT8_C(c) c 2025-05-07T19:48:00.9816152Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:48:00.9816421Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:48:00.9816670Z #define __SM_70_RT_HPP__ 2025-05-07T19:48:00.9816927Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:48:00.9817191Z #define __cpp_variadic_using 201611L 2025-05-07T19:48:00.9817522Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:00.9817856Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:48:00.9818121Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:48:00.9818404Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:48:00.9818746Z #define __cpp_capture_star_this 201603L 2025-05-07T19:48:00.9819058Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:48:00.9819411Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:48:00.9819802Z #define NFDBITS __NFDBITS 2025-05-07T19:48:00.9820052Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:48:00.9820350Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:48:00.9820662Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:48:00.9820995Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:48:00.9821256Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:48:00.9821536Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:48:00.9821908Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:48:00.9822320Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:48:00.9822688Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:48:00.9822967Z #define __cpp_if_constexpr 201606L 2025-05-07T19:48:00.9823261Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:48:00.9823573Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:48:00.9823921Z #define __daddr_t_defined 2025-05-07T19:48:00.9824181Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:48:00.9824445Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:48:00.9824836Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:48:00.9825351Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:48:00.9825860Z #define _ACRTIMP 2025-05-07T19:48:00.9826076Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:48:00.9826347Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:48:00.9826629Z #define _IOS_BIN 128 2025-05-07T19:48:00.9826996Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:48:00.9827433Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:48:00.9827705Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:48:00.9827981Z #define UNDERFLOW 4 2025-05-07T19:48:00.9828196Z #define NAME_MAX 255 2025-05-07T19:48:00.9828438Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:48:00.9828701Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:48:00.9828984Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:48:00.9829372Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:48:00.9829779Z #define __ptr_t void * 2025-05-07T19:48:00.9830024Z #define M_E 2.7182818284590452354 2025-05-07T19:48:00.9830291Z #define cudaSurfaceType1D 0x01 2025-05-07T19:48:00.9830563Z #define __USE_ISOCXX11 1 2025-05-07T19:48:00.9830817Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:48:00.9831213Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:48:00.9831681Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:48:00.9831994Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:48:00.9832296Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:48:00.9832601Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:48:00.9832918Z #define cudaSurfaceType2D 0x02 2025-05-07T19:48:00.9833198Z #define __linux 1 2025-05-07T19:48:00.9833443Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:48:00.9833734Z #define cudaDeviceMask 0x1f 2025-05-07T19:48:00.9834013Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:48:00.9834301Z #define __CUDA_API_VER_MAJOR__ 11 2025-05-07T19:48:00.9834597Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:48:00.9834902Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:48:00.9835251Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:48:00.9835557Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:48:00.9835875Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:48:00.9836167Z #define _BITS_TYPES_H 1 2025-05-07T19:48:00.9836464Z #define ULONG_LONG_MAX (LONG_LONG_MAX * 2ULL + 1ULL) 2025-05-07T19:48:00.9836822Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:48:00.9837120Z #define cudaSurfaceType3D 0x03 2025-05-07T19:48:00.9837424Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:48:00.9837799Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:48:00.9838096Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:48:00.9838943Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:48:00.9839816Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:48:00.9840119Z #define __unix 1 2025-05-07T19:48:00.9840334Z #define MATH_ERRNO 1 2025-05-07T19:48:00.9840603Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:48:00.9840887Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:48:00.9841178Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:48:00.9841480Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:48:00.9841773Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:48:00.9842075Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:48:00.9842555Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:48:00.9843052Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:48:00.9843324Z #define CUDARTAPI_CDECL 2025-05-07T19:48:00.9843594Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:48:00.9843968Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:48:00.9844263Z #define __cpp_lib_void_t 201411 2025-05-07T19:48:00.9844515Z #define _POSIX_AIO_MAX 1 2025-05-07T19:48:00.9844817Z #define __SIZE_T 2025-05-07T19:48:00.9845060Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:48:00.9845347Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:48:00.9845619Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:48:00.9845873Z #define _ATFILE_SOURCE 1 2025-05-07T19:48:00.9846268Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:48:00.9846704Z #define __WAIT_STATUS void * 2025-05-07T19:48:00.9846973Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:48:00.9847230Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:48:00.9847502Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:48:00.9847780Z #define __WINT_MIN__ 0U 2025-05-07T19:48:00.9848370Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:48:00.9849020Z #define WUNTRACED 2 2025-05-07T19:48:00.9849251Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:48:00.9849545Z #define NZERO 20 2025-05-07T19:48:00.9849850Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:48:00.9850241Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:48:00.9850512Z #define _PSTL_PRAGMA(x) _Pragma(#x) 2025-05-07T19:48:00.9850812Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:48:00.9851116Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:48:00.9851368Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:48:00.9851661Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:48:00.9851928Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:48:00.9852209Z #define SCHAR_MIN (-SCHAR_MAX - 1) 2025-05-07T19:48:00.9852479Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:48:00.9852746Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:48:00.9853003Z #define _SIZE_T_DEFINED_ 2025-05-07T19:48:00.9853261Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:48:00.9853572Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:48:00.9853922Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:48:00.9854182Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:48:00.9854448Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:48:00.9854748Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:48:00.9855030Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:48:00.9855342Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:48:00.9855636Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:48:00.9856069Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:48:00.9856463Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:48:00.9856718Z #define __INT64_C(c) c ## L 2025-05-07T19:48:00.9856982Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:48:00.9857389Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:48:00.9857728Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:48:00.9857998Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:48:00.9858307Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:48:00.9858605Z #define STA_PPSWANDER 0x0400 2025-05-07T19:48:00.9858876Z #define __INT_WCHAR_T_H 2025-05-07T19:48:00.9859127Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:48:00.9859424Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:48:00.9859679Z #define __have_pthread_attr_t 1 2025-05-07T19:48:00.9859942Z #define FP_NORMAL 4 2025-05-07T19:48:00.9860156Z #define _BITS_TIMEX_H 1 2025-05-07T19:48:00.9860400Z #define _POSIX_LINK_MAX 8 2025-05-07T19:48:00.9860662Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:48:00.9860936Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:48:00.9861212Z #define cudaTextureType1D 0x01 2025-05-07T19:48:00.9861471Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:48:00.9861737Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:48:00.9862153Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:48:00.9862617Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:48:00.9862871Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:48:00.9863140Z #define _POSIX_SOURCE 1 2025-05-07T19:48:00.9863394Z #define cudaTextureType2D 0x02 2025-05-07T19:48:00.9863712Z #define _PTR_TRAITS_H 1 2025-05-07T19:48:00.9863987Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:48:00.9864302Z #define __CUDA_TEXTURE_TYPES_H__ 2025-05-07T19:48:00.9864589Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:48:00.9864903Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:48:00.9865252Z #define cudaTextureType3D 0x03 2025-05-07T19:48:00.9865516Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:48:00.9865783Z #define CLOCK_REALTIME 0 2025-05-07T19:48:00.9866022Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:48:00.9866306Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:48:00.9866623Z #define __cpp_aligned_new 201606L 2025-05-07T19:48:00.9866896Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:48:00.9867197Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:48:00.9867490Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:48:00.9867814Z #define cudaEventBlockingSync 0x01 2025-05-07T19:48:00.9868133Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:48:00.9868493Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:48:00.9868801Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:48:00.9869132Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:48:00.9869405Z #define __GLIBC__ 2 2025-05-07T19:48:00.9869678Z #define __END_DECLS } 2025-05-07T19:48:00.9869968Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:48:00.9870355Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:48:00.9870805Z #define __CONCAT(x,y) x ## y 2025-05-07T19:48:00.9871157Z #define __STDC_HOSTED__ 1 2025-05-07T19:48:00.9871642Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:48:00.9871952Z #define _ALLOCA_H 1 2025-05-07T19:48:00.9872279Z #define __host__ __location__(host) 2025-05-07T19:48:00.9872749Z #define __warndecl(name,msg) extern void name (void) __attribute__((__warning__ (msg))) 2025-05-07T19:48:00.9873271Z #define __SLONG32_TYPE int 2025-05-07T19:48:00.9873579Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:48:00.9873909Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:48:00.9874264Z #define _SYS_SELECT_H 1 2025-05-07T19:48:00.9874540Z #define _IO_LINE_BUF 0x200 2025-05-07T19:48:00.9874848Z #define _IOS_NOCREATE 32 2025-05-07T19:48:00.9875128Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:48:00.9875472Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:48:00.9875817Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:00.9876189Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:48:00.9876506Z #define __global__ __location__(global) 2025-05-07T19:48:00.9876861Z #define __GNU_LIBRARY__ 6 2025-05-07T19:48:00.9877184Z #define __cpp_decltype_auto 201304L 2025-05-07T19:48:00.9877498Z #define __DBL_DIG__ 15 2025-05-07T19:48:00.9877858Z #define TIME_UTC 1 2025-05-07T19:48:00.9878098Z #define __FLT32_DIG__ 6 2025-05-07T19:48:00.9878486Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:48:00.9878914Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:48:00.9879284Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:48:00.9879603Z #define _G_BUFSIZ 8192 2025-05-07T19:48:00.9879968Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:48:00.9880366Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:48:00.9880705Z #define STA_CLOCKERR 0x1000 2025-05-07T19:48:00.9881003Z #define __GXX_WEAK__ 1 2025-05-07T19:48:00.9881261Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:48:00.9881574Z #define __SHRT_WIDTH__ 16 2025-05-07T19:48:00.9881898Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:48:00.9882297Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:48:00.9882601Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:48:00.9882972Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:48:00.9883349Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:48:00.9883787Z #define _GCC_WCHAR_T 2025-05-07T19:48:00.9884025Z #define TMP_MAX 238328 2025-05-07T19:48:00.9884258Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:48:00.9884519Z #define __DEVICE_TYPES_H__ 2025-05-07T19:48:00.9884818Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:00.9885088Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:48:00.9885346Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:48:00.9885629Z #define _IO_SKIPWS 01 2025-05-07T19:48:00.9885854Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:48:00.9886120Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:48:00.9886432Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:48:00.9886795Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:48:00.9887154Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:48:00.9887497Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:48:00.9887746Z #define le32toh(x) (x) 2025-05-07T19:48:00.9887964Z #define _SIZE_T_DEFINED 2025-05-07T19:48:00.9888259Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:48:00.9888585Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:48:00.9888860Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:48:00.9889105Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:48:00.9889364Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:48:00.9889847Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:48:00.9890372Z #define _POSIX_NAME_MAX 14 2025-05-07T19:48:00.9891471Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:48:00.9892019Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:48:00.9892558Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:48:00.9892873Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:48:00.9893184Z #define _WCHAR_T_ 2025-05-07T19:48:00.9893409Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:48:00.9893801Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:48:00.9894197Z #define RTSIG_MAX 32 2025-05-07T19:48:00.9894433Z #define _STDDEF_H 2025-05-07T19:48:00.9894672Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:48:00.9894942Z #define _VA_LIST_DEFINED 2025-05-07T19:48:00.9895210Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:48:00.9895542Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:48:00.9895951Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:48:00.9896285Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:48:00.9896732Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:48:00.9897277Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:48:00.9897661Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:48:00.9897943Z #define __unix__ 1 2025-05-07T19:48:00.9898177Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:00.9898612Z #define __INT_WIDTH__ 32 2025-05-07T19:48:00.9898858Z #define __SIZEOF_LONG__ 8 2025-05-07T19:48:00.9899109Z #define _IONBF 2 2025-05-07T19:48:00.9899562Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:48:00.9900388Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:48:00.9900974Z #define __STDC_IEC_559__ 1 2025-05-07T19:48:00.9901231Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:48:00.9901515Z #define __UINT16_C(c) c 2025-05-07T19:48:00.9901750Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:48:00.9902035Z #define STA_DEL 0x0020 2025-05-07T19:48:00.9902318Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:48:00.9902670Z #define __CUDACC_VER_MINOR__ 8 2025-05-07T19:48:00.9902924Z #define __id_t_defined 2025-05-07T19:48:00.9903208Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:48:00.9903674Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:48:00.9904343Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:48:00.9904608Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:48:00.9904854Z #define __DECIMAL_DIG__ 21 2025-05-07T19:48:00.9905110Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:48:00.9905436Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:48:00.9905702Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:48:00.9905974Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:48:00.9906250Z #define SING 2 2025-05-07T19:48:00.9906457Z #define STA_FREQHOLD 0x0080 2025-05-07T19:48:00.9906724Z #define cudaStreamDefault 0x00 2025-05-07T19:48:00.9907056Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:48:00.9907432Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:48:00.9907701Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:48:00.9907957Z #define __gnu_linux__ 1 2025-05-07T19:48:00.9908198Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:48:00.9908443Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:48:00.9908696Z #define MAX_INPUT 255 2025-05-07T19:48:00.9908910Z #define __need_clock_t 2025-05-07T19:48:00.9909154Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:48:00.9909401Z #define SEEK_DATA 3 2025-05-07T19:48:00.9909650Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:48:00.9909955Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:48:00.9910226Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:48:00.9910618Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:48:00.9911093Z #define _IO_SHOWPOS 02000 2025-05-07T19:48:00.9911415Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:48:00.9911949Z #define _Mfloat_ float 2025-05-07T19:48:00.9912226Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:48:00.9912539Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:48:00.9912844Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:48:00.9913350Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:48:00.9913878Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:00.9914166Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:48:00.9914491Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:48:00.9914878Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:48:00.9915175Z #define __USE_ISOC11 1 2025-05-07T19:48:00.9915418Z #define _BSD_SIZE_T_ 2025-05-07T19:48:00.9915648Z #define ADJ_MICRO 0x1000 2025-05-07T19:48:00.9915914Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:48:00.9916181Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:48:00.9916499Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:48:00.9916838Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:48:00.9917148Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:48:00.9917496Z #define __THROW throw () 2025-05-07T19:48:00.9917745Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:48:00.9918125Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:00.9918482Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:48:00.9918854Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:48:00.9919121Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:48:00.9919399Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:48:00.9919671Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:48:00.9919954Z #define L_tmpnam 20 2025-05-07T19:48:00.9920194Z #define ___int_wchar_t_h 2025-05-07T19:48:00.9920540Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:48:00.9920937Z #define _T_PTRDIFF 2025-05-07T19:48:00.9921153Z #define __GNUC__ 11 2025-05-07T19:48:00.9921416Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:48:00.9921720Z #define __GXX_RTTI 1 2025-05-07T19:48:00.9921954Z #define __pie__ 2 2025-05-07T19:48:00.9922161Z #define __MMX__ 1 2025-05-07T19:48:00.9922396Z #define __timespec_defined 1 2025-05-07T19:48:00.9922645Z #define L_ctermid 9 2025-05-07T19:48:00.9922889Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:00.9923270Z #define offsetof(TYPE,MEMBER) __builtin_offsetof (TYPE, MEMBER) 2025-05-07T19:48:00.9923760Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:48:00.9924020Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:48:00.9924291Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:48:00.9924647Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:48:00.9924943Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:48:00.9925205Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:48:00.9925619Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:48:00.9926373Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:00.9926954Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:48:00.9927225Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:48:00.9927521Z #define __USE_SVID 1 2025-05-07T19:48:00.9927756Z #define __constant__ __location__(constant) 2025-05-07T19:48:00.9928063Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:48:00.9928340Z #define __device__ __location__(device) 2025-05-07T19:48:00.9928626Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:48:00.9928870Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:48:00.9929146Z #define CUDART_DEVICE __device__ 2025-05-07T19:48:00.9929447Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:48:00.9929826Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:48:00.9930184Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:48:00.9930450Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:48:00.9930804Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:48:00.9931154Z #define __STDC_UTF_16__ 1 2025-05-07T19:48:00.9931398Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:48:00.9931744Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:48:00.9932173Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:48:00.9932490Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:48:00.9932743Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:48:00.9933005Z #define NGROUPS_MAX 65536 2025-05-07T19:48:00.9933231Z #define __USE_ISOC95 1 2025-05-07T19:48:00.9933453Z #define _TIME_H 1 2025-05-07T19:48:00.9933705Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:48:00.9934024Z #define __USE_ISOC99 1 2025-05-07T19:48:00.9934324Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:48:00.9934688Z #define HOST_NAME_MAX 64 2025-05-07T19:48:00.9934923Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:48:00.9935183Z #define _IOS_ATEND 4 2025-05-07T19:48:00.9935421Z #define __U64_TYPE unsigned long int 2025-05-07T19:48:00.9935688Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:48:00.9936011Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:48:00.9936396Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:00.9936802Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:48:00.9937069Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:48:00.9937356Z #define _IO_uid_t __uid_t 2025-05-07T19:48:00.9937616Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:48:00.9937936Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:48:00.9938197Z #define _STDIO_H 1 2025-05-07T19:48:00.9938423Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:48:00.9938782Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:48:00.9939145Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:48:00.9939439Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:48:00.9939691Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:48:00.9939962Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:48:00.9940238Z #define __cpp_raw_strings 200710L 2025-05-07T19:48:00.9940536Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:00.9940834Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:48:00.9941119Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:48:00.9941424Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:48:00.9941674Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:48:00.9941964Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:48:00.9942223Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:48:00.9942602Z #define __intN_t(N,MODE) typedef int int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:00.9943059Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:48:00.9943300Z #define __USE_XOPEN 1 2025-05-07T19:48:00.9943514Z #define __USE_XOPEN2K 1 2025-05-07T19:48:00.9943750Z #define _PSTL_UDR_PRESENT 1 2025-05-07T19:48:00.9943998Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:48:00.9944288Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:48:00.9944555Z #define __cpp_fold_expressions 201603L 2025-05-07T19:48:00.9945047Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:48:00.9945560Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:48:00.9945825Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:48:00.9946177Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:48:00.9946547Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:48:00.9946910Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:48:00.9947276Z #define __END_NAMESPACE_C99 2025-05-07T19:48:00.9947552Z #define __glibcxx_integral_traps true 2025-05-07T19:48:00.9947840Z #define _POSIX_PATH_MAX 256 2025-05-07T19:48:00.9948079Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:48:00.9948330Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:48:00.9948578Z #define _ISOC11_SOURCE 1 2025-05-07T19:48:00.9948821Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:48:00.9949066Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:48:00.9949351Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:48:00.9949632Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:48:00.9949989Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:48:00.9950365Z #define LONG_MIN (-LONG_MAX - 1L) 2025-05-07T19:48:00.9950628Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:48:00.9950889Z #define _IO_UNITBUF 020000 2025-05-07T19:48:00.9951199Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:48:00.9951628Z #define __FD_SETSIZE 1024 2025-05-07T19:48:00.9951883Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:48:00.9952242Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:48:00.9952593Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:48:00.9952975Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:48:00.9953242Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:48:00.9953544Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:48:00.9953828Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:48:00.9954073Z #define _WCHAR_T_DEFINED_ 2025-05-07T19:48:00.9954369Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:48:00.9954695Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:48:00.9955000Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:48:00.9955270Z #define __USE_POSIX199506 1 2025-05-07T19:48:00.9955599Z #define _FEATURES_H 1 2025-05-07T19:48:00.9955836Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:48:00.9956258Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:48:00.9956688Z #define __stub_getmsg 2025-05-07T19:48:00.9956939Z #define _IO_FIXED 010000 2025-05-07T19:48:00.9957231Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:48:00.9957546Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:48:00.9957833Z #define __stub_setlogin 2025-05-07T19:48:00.9958070Z #define __stub_fattach 2025-05-07T19:48:00.9958321Z #define __cplusplus 201703L 2025-05-07T19:48:00.9958584Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:48:00.9958878Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:48:00.9959133Z #define INFINITY (__builtin_inff()) 2025-05-07T19:48:00.9959426Z #define _IO_UNBUFFERED 2 2025-05-07T19:48:00.9959911Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:48:00.9960463Z #define _IO_INTERNAL 010 2025-05-07T19:48:00.9960723Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:48:00.9960987Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:48:00.9961348Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:00.9961705Z #define __dev_t_defined 2025-05-07T19:48:00.9961956Z #define __DEPRECATED 1 2025-05-07T19:48:00.9962264Z #define __S32_TYPE int 2025-05-07T19:48:00.9962529Z #define __cpp_rvalue_references 200610L 2025-05-07T19:48:00.9962832Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:48:00.9963112Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:48:00.9963387Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:48:00.9963737Z #define _G_HAVE_MREMAP 1 2025-05-07T19:48:00.9964041Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:48:00.9964366Z #define OVERFLOW 3 2025-05-07T19:48:00.9964597Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:48:00.9964863Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:48:00.9965146Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:48:00.9965412Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:00.9965752Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:48:00.9966065Z #define __SSE2_MATH__ 1 2025-05-07T19:48:00.9966315Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:48:00.9966620Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:00.9966923Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:48:00.9967198Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:48:00.9967471Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:00.9967785Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:48:00.9968038Z #define __amd64 1 2025-05-07T19:48:00.9968264Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:48:00.9968502Z #define __need_timespec 2025-05-07T19:48:00.9968755Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:48:00.9969006Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:48:00.9969275Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:48:00.9969574Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:48:00.9969821Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:48:00.9970076Z #define __bounded 2025-05-07T19:48:00.9970294Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:48:00.9970582Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:48:00.9970844Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:48:00.9971109Z #define _PTRDIFF_T_DECLARED 2025-05-07T19:48:00.9971370Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:00.9971793Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:48:00.9972192Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:48:00.9972456Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:48:00.9972784Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:48:00.9973110Z #define STA_PLL 0x0001 2025-05-07T19:48:00.9973349Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:48:00.9973593Z #define __GNUG__ 11 2025-05-07T19:48:00.9973816Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:48:00.9974059Z #define _T_WCHAR 2025-05-07T19:48:00.9974284Z #define __specialization_static 2025-05-07T19:48:00.9974593Z #define __key_t_defined 2025-05-07T19:48:00.9974860Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:48:00.9975166Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:48:00.9975406Z #define cudaArraySparse 0x40 2025-05-07T19:48:00.9975666Z #define STA_PPSFREQ 0x0002 2025-05-07T19:48:00.9975901Z #define __GLIBCXX__ 20230528 2025-05-07T19:48:00.9976181Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:48:00.9976466Z #define _WCHAR_T 2025-05-07T19:48:00.9976704Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:48:00.9977385Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:48:00.9978076Z #define __cpp_nsdmi 200809L 2025-05-07T19:48:00.9978484Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:48:00.9978910Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:48:00.9979186Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:48:00.9979428Z #define cudaArrayCubemap 0x04 2025-05-07T19:48:00.9979753Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:00.9980085Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:48:00.9980381Z #define __NO_CTYPE 1 2025-05-07T19:48:00.9980594Z #define __stub_bdflush 2025-05-07T19:48:00.9980956Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:48:00.9981377Z #define __CORRECT_ISO_CPP_STRING_H_PROTO 2025-05-07T19:48:00.9981665Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:48:00.9981933Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:48:00.9982190Z #define __cpp_initializer_lists 200806L 2025-05-07T19:48:00.9982489Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:48:00.9982764Z #define __U16_TYPE unsigned short int 2025-05-07T19:48:00.9983094Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:48:00.9983422Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:48:00.9983697Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:48:00.9983957Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:48:00.9984287Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:48:00.9984622Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:48:00.9984888Z #define _IO_STDIO 040000 2025-05-07T19:48:00.9985202Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:48:00.9985567Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:48:00.9985874Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:48:00.9986144Z #define _PTRDIFF_T 2025-05-07T19:48:00.9986358Z #define _MOVE_H 1 2025-05-07T19:48:00.9986564Z #define __cpp_hex_float 201603L 2025-05-07T19:48:00.9986821Z #define ADJ_TAI 0x0080 2025-05-07T19:48:00.9987045Z #define __ptrvalue 2025-05-07T19:48:00.9987250Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:48:00.9987497Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:48:00.9987760Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:48:00.9988058Z #define MATH_ERREXCEPT 2 2025-05-07T19:48:00.9988321Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:48:00.9988653Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:48:00.9989022Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:48:00.9989404Z #define __USE_GNU 1 2025-05-07T19:48:00.9989615Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:48:00.9989887Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:48:00.9990150Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:48:00.9990652Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:48:00.9991277Z #define WEXITED 4 2025-05-07T19:48:00.9991494Z #define _IO_NO_READS 4 2025-05-07T19:48:00.9991857Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:48:00.9992242Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:48:00.9992576Z #define __uid_t_defined 2025-05-07T19:48:00.9992931Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:48:00.9993239Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:48:00.9993530Z #define WNOHANG 1 2025-05-07T19:48:00.9993775Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:48:00.9994102Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:48:00.9994374Z #define cudaEventDefault 0x00 2025-05-07T19:48:00.9994662Z #define NL_SETMAX INT_MAX 2025-05-07T19:48:00.9994898Z #define __x86_64 1 2025-05-07T19:48:00.9995139Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:48:00.9995538Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:00.9996048Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:48:00.9996567Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:00.9997034Z #define __PTRDIFF_T 2025-05-07T19:48:00.9997275Z #define _ASSERT_H 1 2025-05-07T19:48:00.9997509Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:48:00.9997806Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:00.9998106Z #define _Mlong_double_ long double 2025-05-07T19:48:00.9998405Z #define __cpp_lambdas 200907L 2025-05-07T19:48:00.9998662Z #define _IO_DEC 020 2025-05-07T19:48:00.9998910Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:48:00.9999181Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:48:00.9999561Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:48:00.9999847Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:48:01.0000130Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:48:01.0000417Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:48:01.0000694Z #define _ANSI_STDDEF_H 2025-05-07T19:48:01.0000985Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:48:01.0001396Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:48:01.0001812Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:48:01.0002098Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:48:01.0002407Z #define __cpp_template_auto 201606L 2025-05-07T19:48:01.0002778Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:48:01.0003170Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:48:01.0003575Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:48:01.0003922Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:48:01.0004386Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:48:01.0004741Z #define __GNUC_VA_LIST 2025-05-07T19:48:01.0005069Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:48:01.0005436Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:48:01.0022864Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:48:01.0023159Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:48:01.0023470Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:48:01.0023716Z #define __WCOREFLAG 0x80 2025-05-07T19:48:01.0023992Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:48:01.0024284Z #define cudaEventDisableTiming 0x02 2025-05-07T19:48:01.0024601Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:48:01.0024892Z #define __LP64__ 1 2025-05-07T19:48:01.0025135Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:48:01.0025436Z #define _IO_off64_t __off64_t 2025-05-07T19:48:01.0025682Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:01.0025943Z #define __time_t_defined 1 2025-05-07T19:48:01.0026189Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:48:01.0026569Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:48:01.0026929Z #define __USE_UNIX98 1 2025-05-07T19:48:01.0027174Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:48:01.0027471Z #define __LEAF_ATTR __attribute__ ((__leaf__)) 2025-05-07T19:48:01.0027768Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:48:01.0028030Z #define SEEK_CUR 1 2025-05-07T19:48:01.0028249Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:01.0028869Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:48:01.0029601Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:48:01.0029880Z #define CHAR_MAX SCHAR_MAX 2025-05-07T19:48:01.0030127Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:48:01.0030408Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:48:01.0030673Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:48:01.0030927Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:48:01.0031398Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:01.0031989Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:48:01.0032757Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:48:01.0033452Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:48:01.0033765Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:48:01.0034123Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:48:01.0034530Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:48:01.0034823Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:48:01.0035108Z #define cudaArrayDefault 0x00 2025-05-07T19:48:01.0035382Z #define TLOSS 5 2025-05-07T19:48:01.0035601Z #define __ssize_t_defined 2025-05-07T19:48:01.0035888Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:48:01.0036195Z #define __CUDACC_VER_BUILD__ 89 2025-05-07T19:48:01.0036550Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:48:01.0036858Z #define ULONG_MAX (LONG_MAX * 2UL + 1UL) 2025-05-07T19:48:01.0037224Z #define __SURFACE_FUNCTIONS_H__ 2025-05-07T19:48:01.0037551Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:48:01.0037946Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:48:01.0038411Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:48:01.0038732Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:48:01.0039069Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:48:01.0039385Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:48:01.0039711Z #define __REGISTER_PREFIX__ 2025-05-07T19:48:01.0039976Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:48:01.0040326Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:48:01.0040696Z #define _IOS_NOREPLACE 64 2025-05-07T19:48:01.0040950Z #define __cdecl 2025-05-07T19:48:01.0041211Z #define cudaEventInterprocess 0x04 2025-05-07T19:48:01.0041569Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:48:01.0041992Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:48:01.0042337Z #define LOGIN_NAME_MAX 256 2025-05-07T19:48:01.0042605Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:48:01.0042876Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:48:01.0043188Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:48:01.0043457Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:48:01.0043881Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:48:01.0044209Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:48:01.0044448Z #define ADJ_NANO 0x2000 2025-05-07T19:48:01.0044752Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:48:01.0045098Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:48:01.0045385Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:48:01.0045631Z #define __FLT_DIG__ 6 2025-05-07T19:48:01.0045984Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:48:01.0046373Z #define __NO_INLINE__ 1 2025-05-07T19:48:01.0046676Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:01.0047007Z #define ADJ_STATUS 0x0010 2025-05-07T19:48:01.0047262Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:48:01.0047537Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:01.0047819Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:48:01.0048103Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:48:01.0048355Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:48:01.0048693Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:48:01.0049024Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:48:01.0050279Z #define MAX_CANON 255 2025-05-07T19:48:01.0050506Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:48:01.0050760Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:48:01.0051040Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:48:01.0051325Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:48:01.0051617Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:48:01.0051919Z #define __VERSION__ "11.4.0" 2025-05-07T19:48:01.0052169Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:48:01.0052431Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:48:01.0052699Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:48:01.0052950Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:48:01.0053239Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:48:01.0053516Z #define __UINT64_C(c) c ## UL 2025-05-07T19:48:01.0053774Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:48:01.0054016Z #define _SYS_TYPES_H 1 2025-05-07T19:48:01.0054237Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:48:01.0054498Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:48:01.0054727Z #define _SYS_CDEFS_H 1 2025-05-07T19:48:01.0054969Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:48:01.0055213Z #define __cpp_unicode_characters 201411L 2025-05-07T19:48:01.0055501Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:48:01.0055725Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:48:01.0056049Z #define FP_SUBNORMAL 3 2025-05-07T19:48:01.0056269Z #define cudaOccupancyDefault 0x00 2025-05-07T19:48:01.0056533Z #define _INITIALIZER_LIST 2025-05-07T19:48:01.0056772Z #define _STDC_PREDEF_H 1 2025-05-07T19:48:01.0056997Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:48:01.0057270Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:48:01.0057536Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:48:01.0057785Z #define _IO_file_flags _flags 2025-05-07T19:48:01.0058022Z #define __USE_XOPEN2K8 1 2025-05-07T19:48:01.0058262Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:48:01.0058485Z #define HUGE 3.40282347e+38F 2025-05-07T19:48:01.0058779Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:48:01.0059092Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:48:01.0059341Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:48:01.0059580Z #define _BSD_SOURCE 1 2025-05-07T19:48:01.0059790Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:48:01.0060624Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_ ##_NTYPE : false_type { }; template struct __has_ ##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:48:01.0061464Z #define __catch(X) catch(X) 2025-05-07T19:48:01.0061704Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:48:01.0061958Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:48:01.0062235Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:48:01.0062475Z #define __TIMER_T_TYPE void * 2025-05-07T19:48:01.0062702Z #define __STRING(x) #x 2025-05-07T19:48:01.0062923Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:48:01.0063162Z #define _T_PTRDIFF_ 2025-05-07T19:48:01.0063387Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:48:01.0063655Z #define cudaEventWaitExternal 0x01 2025-05-07T19:48:01.0063917Z #define __unbounded 2025-05-07T19:48:01.0064128Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:01.0064392Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:48:01.0064651Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:01.0064945Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:48:01.0065196Z #define __cpp_lib_is_final 201402L 2025-05-07T19:48:01.0065477Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:48:01.0065792Z #define LONG_LONG_MIN (-LONG_LONG_MAX - 1LL) 2025-05-07T19:48:01.0066536Z #define cudaDevicePropDontCare { {'\0'}, {{0}}, {'\0'}, 0, 0, 0, 0, 0, 0, 0, {0, 0, 0}, {0, 0, 0}, 0, 0, -1, -1, 0, 0, -1, 0, 0, 0, 0, 0, 0, 0, 0, {0, 0}, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, {0, 0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, 0, {0, 0}, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, } 2025-05-07T19:48:01.0067194Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:48:01.0067455Z #define __managed__ __location__(managed) 2025-05-07T19:48:01.0067750Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:48:01.0068135Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:01.0068548Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:48:01.0068800Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:48:01.0069151Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:48:01.0069542Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:48:01.0069778Z #define _SYS_SIZE_T_H 2025-05-07T19:48:01.0070054Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:48:01.0070372Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:48:01.0070610Z #define _CRTIMP 2025-05-07T19:48:01.0070823Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:48:01.0071188Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:48:01.0071693Z #define STA_PPSJITTER 0x0200 2025-05-07T19:48:01.0072056Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:48:01.0072492Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:01.0072839Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:48:01.0073150Z #define __SIZE_T__ 2025-05-07T19:48:01.0073427Z #define __stub_gtty 2025-05-07T19:48:01.0073657Z #define __pid_t_defined 2025-05-07T19:48:01.0073930Z #define __glibcxx_function_requires(...) 2025-05-07T19:48:01.0074224Z #define __SM_80_RT_HPP__ 2025-05-07T19:48:01.0074476Z #define __need_clockid_t 2025-05-07T19:48:01.0074719Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:48:01.0074991Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:48:01.0075310Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:48:01.0075649Z #define __mode_t_defined 2025-05-07T19:48:01.0075930Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:48:01.0076247Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:48:01.0076529Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:48:01.0076951Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:01.0077398Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:48:01.0077692Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:48:01.0077972Z #define _G_config_h 1 2025-05-07T19:48:01.0078211Z #define __stub_sstk 2025-05-07T19:48:01.0078432Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:48:01.0078755Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:48:01.0079110Z #define __wur 2025-05-07T19:48:01.0079320Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:48:01.0079572Z #define _G_HAVE_MMAP 1 2025-05-07T19:48:01.0079794Z #define _IO_OCT 040 2025-05-07T19:48:01.0080032Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:48:01.0080290Z #define NL_MSGMAX INT_MAX 2025-05-07T19:48:01.0080528Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:48:01.0080807Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:48:01.0081126Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:48:01.0081386Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:48:01.0081772Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:48:01.0082170Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:48:01.0082424Z #define _STL_ALGOBASE_H 1 2025-05-07T19:48:01.0082743Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:48:01.0083089Z #define __off64_t_defined 2025-05-07T19:48:01.0083351Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:48:01.0083724Z #define __FLT128_DIG__ 33 2025-05-07T19:48:01.0083964Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:48:01.0084226Z #define __INT32_C(c) c 2025-05-07T19:48:01.0084457Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:48:01.0084725Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:48:01.0084972Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:48:01.0085233Z #define __PDP_ENDIAN 3412 2025-05-07T19:48:01.0085454Z #define _ISOC95_SOURCE 1 2025-05-07T19:48:01.0085693Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:48:01.0086014Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:48:01.0086271Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:48:01.0086547Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:48:01.0086865Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:48:01.0087104Z #define __SM_90_RT_HPP__ 2025-05-07T19:48:01.0087357Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:48:01.0087635Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:48:01.0088012Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:48:01.0088422Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:48:01.0088652Z #define htole32(x) (x) 2025-05-07T19:48:01.0088906Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:48:01.0089192Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:48:01.0089514Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:48:01.0089875Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:48:01.0090222Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:48:01.0090712Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:48:01.0091205Z #define ADJ_OFFSET 0x0001 2025-05-07T19:48:01.0091541Z #define cudaArrayLayered 0x01 2025-05-07T19:48:01.0091877Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:48:01.0092391Z #define cudaEventRecordDefault 0x00 2025-05-07T19:48:01.0092680Z #define USHRT_MAX (SHRT_MAX * 2 + 1) 2025-05-07T19:48:01.0092978Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:48:01.0093241Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:48:01.0093507Z #define unix 1 2025-05-07T19:48:01.0093719Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:48:01.0093988Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:48:01.0094257Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:48:01.0094518Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:48:01.0094800Z #define __USE_POSIX 1 2025-05-07T19:48:01.0095932Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:48:01.0097126Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:48:01.0097430Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:48:01.0097770Z #define __THROWNL throw () 2025-05-07T19:48:01.0098108Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:48:01.0098463Z #define __cpp_rtti 199711L 2025-05-07T19:48:01.0098732Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:48:01.0099018Z #define __PMT(args) args 2025-05-07T19:48:01.0099147Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:01.0099295Z #define __va_arg_pack_len() __builtin_va_arg_pack_len () 2025-05-07T19:48:01.0099409Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:48:01.0099502Z #define _SIZE_T_DECLARED 2025-05-07T19:48:01.0099613Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:48:01.0099704Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:48:01.0100122Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:48:01.0100239Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:48:01.0100331Z #define XATTR_LIST_MAX 65536 2025-05-07T19:48:01.0100423Z #define __CUDACC_VER_MAJOR__ 11 2025-05-07T19:48:01.0100566Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:48:01.0100664Z #define _WCHAR_T_H 2025-05-07T19:48:01.0100752Z #define __FLT64X_DIG__ 18 2025-05-07T19:48:01.0100843Z #define _IO_SHOWBASE 0200 2025-05-07T19:48:01.0100947Z #define _POSIX_QLIMIT 1 2025-05-07T19:48:01.0101042Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:48:01.0101140Z #define __INT8_TYPE__ signed char 2025-05-07T19:48:01.0101233Z #define __SURFACE_TYPES_H__ 2025-05-07T19:48:01.0101334Z #define __CUDA_ARCH__ 520 2025-05-07T19:48:01.0101513Z #define __cpp_digit_separators 201309L 2025-05-07T19:48:01.0101596Z #define __ELF__ 1 2025-05-07T19:48:01.0101708Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:48:01.0101803Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:48:01.0101891Z #define STA_INS 0x0010 2025-05-07T19:48:01.0101990Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:48:01.0102106Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:48:01.0102215Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:01.0102321Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:48:01.0102442Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:48:01.0102541Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:48:01.0102645Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:48:01.0102754Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:48:01.0102910Z #define __warnattr(msg) __attribute__((__warning__ (msg))) 2025-05-07T19:48:01.0103067Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:48:01.0103169Z #define _IO_funlockfile(_fp) 2025-05-07T19:48:01.0103613Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:01.0103740Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:48:01.0103831Z #define __DRIVER_TYPES_H__ 2025-05-07T19:48:01.0103927Z #define __FLT_RADIX__ 2 2025-05-07T19:48:01.0104081Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:48:01.0104350Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:48:01.0104437Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:48:01.0104536Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:48:01.0104630Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:48:01.0104716Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:48:01.0104817Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:48:01.0104911Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:48:01.0104992Z #define WORD_BIT 32 2025-05-07T19:48:01.0105069Z #define _IO_USER_BUF 1 2025-05-07T19:48:01.0105165Z #define __VECTOR_TYPES_H__ 2025-05-07T19:48:01.0105264Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:01.0105363Z #define cudaHostAllocPortable 0x01 2025-05-07T19:48:01.0105467Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:48:01.0105559Z #define __long_double_t long double 2025-05-07T19:48:01.0105651Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:48:01.0105741Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:48:01.0105839Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:48:01.0105938Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:48:01.0106032Z #define __TEXTURE_FETCH_FUNCTIONS_H__ 2025-05-07T19:48:01.0106119Z #define __k8 1 2025-05-07T19:48:01.0106301Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:48:01.0106463Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:48:01.0106584Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:48:01.0106673Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:48:01.0106765Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:48:01.0106882Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:48:01.0106982Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:48:01.0107065Z #define __blksize_t_defined 2025-05-07T19:48:01.0107151Z #define _IO_SHOWPOINT 0400 2025-05-07T19:48:01.0107253Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:48:01.0107381Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:48:01.0107484Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:48:01.0107572Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:48:01.0107837Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:48:01.0107937Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:48:01.0108023Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:48:01.0108126Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:48:01.0108373Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:48:01.0108539Z #define UCHAR_MAX (SCHAR_MAX * 2 + 1) 2025-05-07T19:48:01.0108643Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:48:01.0108749Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:48:01.0108826Z #define SEEK_SET 0 2025-05-07T19:48:01.0108917Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:48:01.0109021Z #define __CUDA_API_VER_MINOR__ 8 2025-05-07T19:48:01.0109202Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:48:01.0109289Z #define _MATH_H_MATHDEF 1 2025-05-07T19:48:01.0109385Z #define WSTOPPED 2 2025-05-07T19:48:01.0109697Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:48:01.0109785Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:48:01.0109874Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:48:01.0109984Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:48:01.0110064Z #define __stub_sigreturn 2025-05-07T19:48:01.0110296Z #define __errordecl(name,msg) extern void name (void) __attribute__((__error__ (msg))) 2025-05-07T19:48:01.0110399Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:48:01.0110481Z #define __HOST_CONFIG_H__ 2025-05-07T19:48:01.0110568Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:48:01.0110646Z #define CLOCK_TAI 11 2025-05-07T19:48:01.0110759Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:48:01.0110887Z #define __restrict_arr 2025-05-07T19:48:01.0110988Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:48:01.0111768Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:01.0111957Z #define __attribute_artificial__ __attribute__ ((__artificial__)) 2025-05-07T19:48:01.0112040Z #define __USE_MISC 1 2025-05-07T19:48:01.0112159Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:48:01.0112259Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:48:01.0112347Z #define _GCC_LIMITS_H_ 2025-05-07T19:48:01.0112438Z #define __LDBL_DIG__ 18 2025-05-07T19:48:01.0112548Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:48:01.0112651Z #define __malloc_and_calloc_defined 2025-05-07T19:48:01.0112742Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:48:01.0112859Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:48:01.0112947Z #define __x86_64__ 1 2025-05-07T19:48:01.0113030Z #define _SIZE_T_ 2025-05-07T19:48:01.0113130Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:48:01.0113238Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:48:01.0113352Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:48:01.0113468Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:48:01.0113574Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:48:01.0113681Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:48:01.0113815Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:48:01.0113909Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:48:01.0114415Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:01.0114540Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:48:01.0114682Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:48:01.0114795Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:48:01.0114889Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:48:01.0114977Z #define STA_FLL 0x0008 2025-05-07T19:48:01.0115128Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:48:01.0115221Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:48:01.0115341Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:01.0115450Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:48:01.0115546Z #define __stub_revoke 2025-05-07T19:48:01.0115636Z #define __timer_t_defined 1 2025-05-07T19:48:01.0115767Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:48:01.0115867Z #define INT_MAX __INT_MAX__ 2025-05-07T19:48:01.0116028Z #define ULLONG_MAX (LLONG_MAX * 2ULL + 1) 2025-05-07T19:48:01.0116133Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:48:01.0116242Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:48:01.0116341Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:48:01.0116447Z #define cudaArrayTextureGather 0x08 2025-05-07T19:48:01.0116547Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:48:01.0116645Z #define _IO_off_t __off_t 2025-05-07T19:48:01.0116731Z #define __FLT64_DIG__ 15 2025-05-07T19:48:01.0116958Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:48:01.0117065Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:48:01.0117192Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:01.0117313Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:48:01.0117409Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:48:01.0117525Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:48:01.0117608Z #define NULL __null 2025-05-07T19:48:01.0117745Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:48:01.0117858Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:48:01.0117951Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:48:01.0118041Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:48:01.0118125Z #define FP_ZERO 2 2025-05-07T19:48:01.0118229Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:48:01.0118384Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:01.0118466Z #define __WCHAR_T__ 2025-05-07T19:48:01.0118573Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:48:01.0118773Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:48:01.0118923Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:48:01.0119018Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:48:01.0119150Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:48:01.0119264Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:01.0119387Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:48:01.0119526Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:48:01.0119616Z #define _BSD_PTRDIFF_T_ 2025-05-07T19:48:01.0119708Z #define _SIGSET_H_types 1 2025-05-07T19:48:01.0119818Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:48:01.0119933Z #define __cpp_unicode_literals 200710L 2025-05-07T19:48:01.0120033Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:48:01.0120155Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:48:01.0120301Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:48:01.0120406Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:48:01.0120535Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:48:01.0120724Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:48:01.0120816Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:48:01.0120919Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:48:01.0121030Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:48:01.0121119Z #define STA_MODE 0x4000 2025-05-07T19:48:01.0121229Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:48:01.0121342Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:48:01.0121458Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:48:01.0121557Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:48:01.0121666Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:48:01.0121763Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:48:01.0121874Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:48:01.0121963Z #define __SIZE_WIDTH__ 64 2025-05-07T19:48:01.0122088Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:01.0122169Z #define __SEG_FS 1 2025-05-07T19:48:01.0122260Z #define _IO_size_t size_t 2025-05-07T19:48:01.0122366Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:48:01.0122461Z #define INT_MIN (-INT_MAX - 1) 2025-05-07T19:48:01.0122546Z #define __stub_lchmod 2025-05-07T19:48:01.0122634Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:48:01.0122736Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:48:01.0122899Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:01.0122998Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:48:01.0123088Z #define __SEG_GS 1 2025-05-07T19:48:01.0123273Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:48:01.0123361Z #define _IOS_APPEND 8 2025-05-07T19:48:01.0123456Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:48:01.0123574Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:48:01.0123783Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:48:01.0123876Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:48:01.0123968Z #define htole16(x) (x) 2025-05-07T19:48:01.0124069Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:01.0124155Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:48:01.0124240Z #define __INT16_TYPE__ short int 2025-05-07T19:48:01.0124347Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:48:01.0124445Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:48:01.0124541Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:48:01.0124660Z #define __cpp_structured_bindings 201606L 2025-05-07T19:48:01.0124774Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:48:01.0124856Z #define __SIZEOF_INT__ 4 2025-05-07T19:48:01.0124942Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:48:01.0125055Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:01.0125187Z #define SEEK_HOLE 4 2025-05-07T19:48:01.0125269Z #define TIMER_ABSTIME 1 2025-05-07T19:48:01.0125367Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:48:01.0125450Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:48:01.0125613Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:48:01.0125718Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:01.0125830Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:48:01.0125918Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:48:01.0126029Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:48:01.0126126Z #define _LINUX_LIMITS_H 2025-05-07T19:48:01.0126200Z #define linux 1 2025-05-07T19:48:01.0126284Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:48:01.0126397Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:48:01.0126485Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:48:01.0126571Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:48:01.0126682Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:48:01.0126816Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:48:01.0126903Z #define __cpp_lib_hypot 201603 2025-05-07T19:48:01.0127004Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:48:01.0127092Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:48:01.0127172Z #define MOD_NANO ADJ_NANO 2025-05-07T19:48:01.0127248Z #define htole64(x) (x) 2025-05-07T19:48:01.0127353Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:48:01.0127438Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:48:01.0127522Z #define _IO_UPPERCASE 01000 2025-05-07T19:48:01.0127994Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:48:01.0128079Z #define __USE_POSIX2 1 2025-05-07T19:48:01.0128166Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:48:01.0128251Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:48:01.0128459Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:48:01.0128535Z #define _XLOCALE_H 1 2025-05-07T19:48:01.0128622Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:48:01.0128720Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:48:01.0128808Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:48:01.0128902Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:48:01.0128980Z #define __EXCEPTIONS 1 2025-05-07T19:48:01.0129079Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:48:01.0129259Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:48:01.0129335Z #define __WORDSIZE 64 2025-05-07T19:48:01.0129429Z #define CLOCK_MONOTONIC 1 2025-05-07T19:48:01.0129508Z #define _STL_RELOPS_H 1 2025-05-07T19:48:01.0129594Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:48:01.0129728Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:48:01.0129821Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:48:01.0129911Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:48:01.0130192Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:48:01.0130421Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:48:01.0130533Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:48:01.0130624Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:48:01.0130724Z #define __cpp_range_based_for 201603L 2025-05-07T19:48:01.0130827Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:48:01.0130920Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:48:01.0131018Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:48:01.0131188Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:48:01.0131274Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:48:01.0131357Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:48:01.0131460Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:48:01.0131624Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:48:01.0131708Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:48:01.0131806Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:48:01.0131915Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:48:01.0132052Z #define _STRING_H 1 2025-05-07T19:48:01.0132134Z #define _GCC_MAX_ALIGN_T 2025-05-07T19:48:01.0132232Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:48:01.0132357Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:48:01.0132444Z #define __code_model_small__ 1 2025-05-07T19:48:01.0132525Z #define _PSTL_CONFIG_H 2025-05-07T19:48:01.0132620Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:48:01.0132722Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:48:01.0132814Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:48:01.0133154Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:01.0133238Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:48:01.0133321Z #define FILENAME_MAX 4096 2025-05-07T19:48:01.0133435Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:48:01.0133506Z #define L_cuserid 9 2025-05-07T19:48:01.0133584Z #define __ino_t_defined 2025-05-07T19:48:01.0133658Z #define __k8__ 1 2025-05-07T19:48:01.0133753Z #define __INTPTR_TYPE__ long int 2025-05-07T19:48:01.0133850Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:48:01.0133926Z #define __int8_t_defined 2025-05-07T19:48:01.0134023Z #define __WCHAR_TYPE__ int 2025-05-07T19:48:01.0134111Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:48:01.0134211Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:48:01.0134286Z #define _IOS_TRUNC 16 2025-05-07T19:48:01.0134405Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:48:01.0134479Z #define __HAVE_COLUMN 2025-05-07T19:48:01.0134552Z #define __stub_fdetach 2025-05-07T19:48:01.0134966Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:48:01.0135043Z #define __need_time_t 2025-05-07T19:48:01.0135114Z #define __pic__ 2 2025-05-07T19:48:01.0135222Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:01.0135320Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:48:01.0135403Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:48:01.0135491Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:48:01.0135579Z #define __stub_chflags 2025-05-07T19:48:01.0135656Z #define CLOCK_BOOTTIME 7 2025-05-07T19:48:01.0135731Z #define __need_IOV_MAX 2025-05-07T19:48:01.0135832Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:48:01.0135938Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:48:01.0136023Z #define __cpp_decltype 200707L 2025-05-07T19:48:01.0136112Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:48:01.0136206Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:48:01.0136349Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:48:01.0136427Z #define TTY_NAME_MAX 32 2025-05-07T19:48:01.0136583Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:48:01.0136676Z #define le64toh(x) (x) 2025-05-07T19:48:01.0136782Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:01.0136934Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:48:01.0137049Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:48:01.0137134Z #define STA_PPSTIME 0x0004 2025-05-07T19:48:01.0137208Z #define __import__ 2025-05-07T19:48:01.0137298Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:48:01.0137423Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:48:01.0137496Z #define __export__ 2025-05-07T19:48:01.0137603Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:48:01.0137706Z #define cudaMemAttachHost 0x02 2025-05-07T19:48:01.0137856Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:48:01.0137945Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:48:01.0138039Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:48:01.0138127Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:48:01.0138208Z #define _WCHAR_T_DECLARED 2025-05-07T19:48:01.0138313Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:48:01.0138421Z #define __cpp_inline_variables 201606L 2025-05-07T19:48:01.0138556Z #define WNOWAIT 0x01000000 2025-05-07T19:48:01.0138632Z #define PLOSS 6 2025-05-07T19:48:01.0138728Z #define M_LN10 2.30258509299404568402 2025-05-07T19:48:01.0138806Z #define EXIT_SUCCESS 0 2025-05-07T19:48:01.0138889Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:48:01.0138973Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:48:01.0139075Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:48:01.0139155Z #define __thread__ __thread 2025-05-07T19:48:01.0139236Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:48:01.0139326Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:48:01.0139541Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:01.0139632Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:48:01.0139703Z #define __linux__ 1 2025-05-07T19:48:01.0139798Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:48:01.0139916Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:48:01.0139999Z #define __S16_TYPE short int 2025-05-07T19:48:01.0140349Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:48:01.0140444Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:48:01.0140618Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:48:01.0140714Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:48:01.0140802Z #define UINT_MAX (INT_MAX * 2U + 1U) 2025-05-07T19:48:01.0140877Z #define _T_SIZE_ 2025-05-07T19:48:01.0140966Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:01.0141057Z #define _PSTL_VERSION 12000 2025-05-07T19:48:01.0141167Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:48:01.0141254Z #define __WNOTHREAD 0x20000000 2025-05-07T19:48:01.0141350Z #define _G_va_list __gnuc_va_list 2025-05-07T19:48:01.0141427Z #define _IOS_INPUT 1 2025-05-07T19:48:01.0141513Z #define __USE_LARGEFILE64 1 2025-05-07T19:48:01.0141608Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:48:01.0141698Z #define __INT64_TYPE__ long int 2025-05-07T19:48:01.0141790Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:48:01.0141879Z #define __shared__ __location__(shared) 2025-05-07T19:48:01.0141971Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:48:01.0142062Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:48:01.0142203Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:48:01.0142283Z #define __gid_t_defined 2025-05-07T19:48:01.0142395Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:48:01.0142481Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:48:01.0142666Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:48:01.0142764Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:48:01.0142901Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:48:01.0142988Z #define ___int_size_t_h 2025-05-07T19:48:01.0143101Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:48:01.0143201Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:48:01.0143287Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:48:01.0143378Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:48:01.0143470Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:48:01.0143584Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:01.0143688Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:48:01.0143798Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:48:01.0143889Z #define __clock_t_defined 1 2025-05-07T19:48:01.0143977Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:48:01.0144055Z #define __GLIBC_MINOR__ 17 2025-05-07T19:48:01.0144147Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:48:01.0144233Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:48:01.0144327Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:48:01.0144412Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:48:01.0144577Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:48:01.0144647Z #define __SSE__ 1 2025-05-07T19:48:01.0144734Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:48:01.0144832Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:48:01.0144961Z #define __sigset_t_defined 2025-05-07T19:48:01.0145044Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:48:01.0145132Z #define MOD_TAI ADJ_TAI 2025-05-07T19:48:01.0145216Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:48:01.0145298Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:48:01.0145371Z #define __SM_70_RT_H__ 2025-05-07T19:48:01.0145465Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:48:01.0145547Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:48:01.0145694Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:48:01.0145784Z #define _POSIX_MAX_CANON 255 2025-05-07T19:48:01.0145889Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:48:01.0145971Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:48:01.0146057Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:48:01.0146149Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:48:01.0146222Z #define __amd64__ 1 2025-05-07T19:48:01.0146300Z #define __WINT_WIDTH__ 32 2025-05-07T19:48:01.0146404Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:48:01.0146659Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:01.0146747Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:48:01.0146820Z #define EOF (-1) 2025-05-07T19:48:01.0146922Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:48:01.0147007Z #define __USE_POSIX199309 1 2025-05-07T19:48:01.0147093Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:48:01.0147186Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:48:01.0147272Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:48:01.0147357Z #define LLONG_MIN (-LLONG_MAX-1) 2025-05-07T19:48:01.0147458Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:48:01.0147560Z #define ____mbstate_t_defined 1 2025-05-07T19:48:01.0147640Z #define STA_NANO 0x2000 2025-05-07T19:48:01.0147722Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:48:01.0147816Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:48:01.0147902Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:48:01.0147986Z #define _IO_LINKED 0x80 2025-05-07T19:48:01.0148073Z #define __cpp_lib_launder 201606 2025-05-07T19:48:01.0148168Z #define __SIZEOF_INT128__ 16 2025-05-07T19:48:01.0148261Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:48:01.0148344Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:48:01.0148442Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:48:01.0148540Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:01.0148634Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:01.0148718Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:48:01.0148813Z #define __W_CONTINUED 0xffff 2025-05-07T19:48:01.0148897Z #define __ATOMIC_RELAXED 0 2025-05-07T19:48:01.0149067Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:48:01.0149189Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:01.0149367Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:48:01.0149442Z #define __stub_stty 2025-05-07T19:48:01.0149520Z #define le16toh(x) (x) 2025-05-07T19:48:01.0149628Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:48:01.0149789Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:48:01.0149861Z #define _SIZET_ 2025-05-07T19:48:01.0149949Z #define XATTR_NAME_MAX 255 2025-05-07T19:48:01.0150055Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:48:01.0150135Z #define _SVID_SOURCE 1 2025-05-07T19:48:01.0150218Z #define _LP64 1 2025-05-07T19:48:01.0150297Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:48:01.0150516Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:48:01.0150618Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:48:01.0150705Z #define __UINT8_C(c) c 2025-05-07T19:48:01.0150792Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:48:01.0150879Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:48:01.0150976Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:48:01.0151190Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:48:01.0151281Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:48:01.0151587Z #define CUDARTAPI 2025-05-07T19:48:01.0151699Z #define cudaEventWaitDefault 0x00 2025-05-07T19:48:01.0151779Z #define IOV_MAX 1024 2025-05-07T19:48:01.0151926Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:48:01.0152030Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:48:01.0152117Z #define STA_CLK 0x8000 2025-05-07T19:48:01.0152214Z #define cudaMemAttachSingle 0x04 2025-05-07T19:48:01.0152297Z #define __wchar_t__ 2025-05-07T19:48:01.0152404Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:48:01.0152483Z #define SEEK_END 2 2025-05-07T19:48:01.0152575Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:48:01.0152760Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:48:01.0152853Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:48:01.0152996Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:48:01.0153088Z #define ____FILE_defined 1 2025-05-07T19:48:01.0153220Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:48:01.0153321Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:48:01.0153410Z #define _ISOC99_SOURCE 1 2025-05-07T19:48:01.0153520Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:48:01.0153780Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:01.0153915Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:48:01.0154001Z #define _IO_RIGHT 04 2025-05-07T19:48:01.0154110Z #define __END_NAMESPACE_STD 2025-05-07T19:48:01.0154303Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:48:01.0154397Z #define _GLIBCXX_STD_C std 2025-05-07T19:48:01.0154508Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:48:01.0154613Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:48:01.0154697Z #define _STDDEF_H_ 2025-05-07T19:48:01.0154877Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:48:01.0154990Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:48:01.0155102Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:48:01.0155247Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:48:01.0155365Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:48:01.0155464Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:48:01.0155578Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:48:01.0155696Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:48:01.0155793Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:48:01.0155888Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:48:01.0156069Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:48:01.0156181Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:48:01.0156419Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:48:01.0156521Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:48:01.0156631Z #define __STDCPP_THREADS__ 1 2025-05-07T19:48:01.0156738Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:48:01.0156885Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:48:01.0156990Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:48:01.0157098Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:48:01.0157198Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:48:01.0157296Z #define P_tmpdir "/tmp" 2025-05-07T19:48:01.0157433Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:48:01.0157528Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:48:01.0157630Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:48:01.0157799Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:48:01.0157989Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:48:01.0158090Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:48:01.0158219Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:48:01.0158348Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:48:01.0158453Z #define __location__(a) __annotate__(a) 2025-05-07T19:48:01.0158696Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:48:01.0158876Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:48:01.0158972Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:48:01.0159064Z #define __STDC_UTF_32__ 1 2025-05-07T19:48:01.0159159Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:48:01.0159271Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:48:01.0159358Z #define __FXSR__ 1 2025-05-07T19:48:01.0159440Z #define _SIZE_T 2025-05-07T19:48:01.0159560Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:48:01.0159672Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:48:01.0159848Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:48:01.0159942Z #define _IO_ssize_t __ssize_t 2025-05-07T19:48:01.0160157Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:48:01.0160250Z #define _GXX_NULLPTR_T 2025-05-07T19:48:01.0160376Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:48:01.0160480Z #define FOPEN_MAX 16 2025-05-07T19:48:01.0160569Z #define __BIG_ENDIAN 4321 2025-05-07T19:48:01.0160689Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:48:01.0160791Z #define __suseconds_t_defined 2025-05-07T19:48:01.0160893Z #define WCONTINUED 8 2025-05-07T19:48:01.0160982Z #define __off_t_defined 2025-05-07T19:48:01.0161069Z #define stderr stderr 2025-05-07T19:48:01.0161179Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:48:01.0161285Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:01.0161398Z #define __glibcxx_requires_string(_String) 2025-05-07T19:48:01.0161495Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:48:01.0161602Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:48:01.0162045Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:48:01.0162134Z #define _GCC_SIZE_T 2025-05-07T19:48:01.0162246Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:01.0162350Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:48:01.0162457Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:48:01.0162571Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:48:01.0162664Z #define __UINT32_C(c) c ## U 2025-05-07T19:48:01.0162763Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:48:01.0162866Z #define __cpp_alias_templates 200704L 2025-05-07T19:48:01.0162988Z #define cudaHostAllocMapped 0x02 2025-05-07T19:48:01.0163094Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:48:01.0163185Z #define _STL_ITERATOR_H 1 2025-05-07T19:48:01.0163285Z #define __size_t__ 2025-05-07T19:48:01.0163417Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:48:01.0163514Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:48:01.0163729Z #define cudaEventRecordExternal 0x01 2025-05-07T19:48:01.0163886Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:48:01.0164049Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:48:01.0164131Z #define __SM_80_RT_H__ 2025-05-07T19:48:01.0164223Z #define _ENDIAN_H 1 2025-05-07T19:48:01.0164313Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:48:01.0164412Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:48:01.0164503Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:48:01.0164593Z #define __try try 2025-05-07T19:48:01.0164682Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:48:01.0164770Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:48:01.0164868Z #define __INT8_MAX__ 0x7f 2025-05-07T19:48:01.0164951Z #define __LONG_WIDTH__ 64 2025-05-07T19:48:01.0165029Z #define __PIC__ 2 2025-05-07T19:48:01.0165116Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:48:01.0165222Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:48:01.0165335Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:48:01.0165460Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:48:01.0165600Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:48:01.0165689Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:48:01.0165778Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:48:01.0165955Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:48:01.0166431Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:48:01.0166523Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:48:01.0166633Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:48:01.0166740Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:48:01.0166819Z #define _IO_STDIO_H 2025-05-07T19:48:01.0166906Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:48:01.0167061Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:48:01.0167155Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:48:01.0167269Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:48:01.0167348Z #define LONG_BIT 64 2025-05-07T19:48:01.0167466Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:48:01.0167570Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:48:01.0167691Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:48:01.0167799Z #define __fsfilcnt_t_defined 2025-05-07T19:48:01.0167888Z #define __blkcnt_t_defined 2025-05-07T19:48:01.0167975Z #define __USE_LARGEFILE 1 2025-05-07T19:48:01.0168073Z #define __cpp_constexpr 201603L 2025-05-07T19:48:01.0168182Z #define CUDART_VERSION 11080 2025-05-07T19:48:01.0168277Z #define cudaDeviceMapHost 0x08 2025-05-07T19:48:01.0168364Z #define _GLIBCXX_CMATH 1 2025-05-07T19:48:01.0168567Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:48:01.0168652Z #define __lldiv_t_defined 1 2025-05-07T19:48:01.0168731Z #define __SSE2__ 1 2025-05-07T19:48:01.0168810Z #define _IOLBF 1 2025-05-07T19:48:01.0168919Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:48:01.0169010Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:48:01.0169108Z #define __cpp_deduction_guides 201703L 2025-05-07T19:48:01.0169208Z #define ADJ_TICK 0x4000 2025-05-07T19:48:01.0169298Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:48:01.0169402Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:48:01.0169489Z #define __INT32_TYPE__ int 2025-05-07T19:48:01.0169590Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:48:01.0169683Z #define __cpp_exceptions 199711L 2025-05-07T19:48:01.0169774Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:48:01.0169890Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:48:01.0169976Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:48:01.0170086Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:48:01.0170237Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:48:01.0170337Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:48:01.0170430Z #define __SWORD_TYPE long int 2025-05-07T19:48:01.0170519Z #define __INTMAX_TYPE__ long int 2025-05-07T19:48:01.0170619Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:48:01.0170708Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:48:01.0170844Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:48:01.0170931Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:48:01.0171080Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:48:01.0171157Z #define _T_SIZE 2025-05-07T19:48:01.0171252Z #define cudaHostAllocDefault 0x00 2025-05-07T19:48:01.0171385Z #define __va_arg_pack() __builtin_va_arg_pack () 2025-05-07T19:48:01.0171478Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:48:01.0171567Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:48:01.0171654Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:48:01.0171755Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:48:01.0171842Z #define __ATOMIC_CONSUME 1 2025-05-07T19:48:01.0171960Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:48:01.0172058Z #define __GNUC_MINOR__ 4 2025-05-07T19:48:01.0172155Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:48:01.0172242Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:48:01.0172352Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:01.0172445Z #define __PIE__ 2 2025-05-07T19:48:01.0172542Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:48:01.0172728Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:48:01.0172835Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:48:01.0172936Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:48:01.0173099Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:48:01.0173193Z #define __nlink_t_defined 2025-05-07T19:48:01.0173311Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:48:01.0173416Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:48:01.0173502Z #define _XOPEN_LIM_H 1 2025-05-07T19:48:01.0173768Z #define __u_intN_t(N,MODE) typedef unsigned int u_int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:01.0173879Z #define __cpp_template_template_args 201611L 2025-05-07T19:48:01.0173977Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:48:01.0174086Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:48:01.0174174Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:48:01.0174262Z #define __FILE_defined 1 2025-05-07T19:48:01.0174431Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:48:01.0174535Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:48:01.0174635Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:48:01.0174738Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:48:01.0174847Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:48:01.0174920Z #define __INT16_C(c) c 2025-05-07T19:48:01.0175009Z #define __U32_TYPE unsigned int 2025-05-07T19:48:01.0175098Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:48:01.0175201Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:48:01.0175313Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:48:01.0175387Z #define __STDC__ 1 2025-05-07T19:48:01.0175488Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:48:01.0175575Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:48:01.0175714Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:48:01.0175818Z #define __FLT32X_DIG__ 15 2025-05-07T19:48:01.0175909Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:48:01.0175997Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:48:01.0176097Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:48:01.0176212Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:48:01.0176306Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:48:01.0176384Z #define stdin stdin 2025-05-07T19:48:01.0176483Z #define __ino64_t_defined 2025-05-07T19:48:01.0176566Z #define STA_UNSYNC 0x0040 2025-05-07T19:48:01.0176648Z #define __clockid_t_defined 1 2025-05-07T19:48:01.0176784Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:48:01.0176951Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:48:01.0177047Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:48:01.0177140Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:48:01.0177250Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:48:01.0177493Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:48:01.0177571Z #define _IO_HEX 0100 2025-05-07T19:48:01.0177653Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:48:01.0177742Z #define DOMAIN 1 2025-05-07T19:48:01.0177827Z #define M_LN2 0.69314718055994530942 2025-05-07T19:48:01.0177906Z #define __NVCC__ 1 2025-05-07T19:48:01.0178025Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:01.0178119Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:48:01.0178214Z #define __throw_exception_again throw 2025-05-07T19:48:01.0178299Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:48:01.0178395Z #define __EXCEPTION_H 1 2025-05-07T19:48:01.0178486Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:48:01.0178575Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:48:01.0178868Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:01.0178971Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:48:01.0179063Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:48:01.0179160Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:48:01.0179253Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:48:01.0179342Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:48:01.0179472Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:48:01.0179625Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:01.0179726Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:48:01.0179812Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:48:01.0179917Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:48:01.0180006Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:48:01.0180102Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:48:01.0180227Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:48:01.0180321Z #define __useconds_t_defined 2025-05-07T19:48:01.0180490Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:48:01.0180628Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:48:01.0180721Z #define __SSE_MATH__ 1 2025-05-07T19:48:01.0180802Z #define _IO_wint_t wint_t 2025-05-07T19:48:01.0180893Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:48:01.0180982Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:48:01.0181077Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:48:01.0181191Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:48:01.0181283Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:48:01.0181378Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:48:01.0181484Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:48:01.0181571Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:48:01.0181656Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:48:01.0181745Z #define __USE_ATFILE 1 2025-05-07T19:48:01.0181833Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:48:01.0181923Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:48:01.0182017Z #define _GCC_PTRDIFF_T 2025-05-07T19:48:01.0182224Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:48:01.0182316Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:48:01.0182410Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:48:01.0182518Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:48:01.0182616Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:48:01.0182692Z #define _STDLIB_H 1 2025-05-07T19:48:01.0182789Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:48:01.0182875Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:48:01.0182991Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:48:01.0183104Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:01.0183189Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:48:01.0183358Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:48:01.0183440Z #define __WALL 0x40000000 2025-05-07T19:48:01.0183548Z #define __glibcxx_requires_nonempty() 2025-05-07T19:48:01.0183653Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:48:01.0183733Z #define __ldiv_t_defined 1 2025-05-07T19:48:01.0183980Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:48:01.0184063Z #define ___int_ptrdiff_t_h 2025-05-07T19:48:01.0184220Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:48:01.0184304Z #define __HOST_DEFINES_H__ 2025-05-07T19:48:01.0184412Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:48:01.0184507Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:01.0184598Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:48:01.0184688Z #define CUDART_CB 2025-05-07T19:48:01.0184780Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:48:01.0184861Z #define MB_LEN_MAX 16 2025-05-07T19:48:01.0185074Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:01.0185183Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:48:01.0185303Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:48:01.0185393Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:48:01.0185512Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:48:01.0185603Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:48:01.0185742Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:48:01.0185943Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:48:01.0186041Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:48:01.0186170Z #define _GNU_SOURCE 1 2025-05-07T19:48:01.0186248Z #define __stub_putmsg 2025-05-07T19:48:01.0186337Z #define __CUDACC__ 1 2025-05-07T19:48:01.0186418Z #define __N(msgid) (msgid) 2025-05-07T19:48:01.0186495Z #define __P(args) args 2025-05-07T19:48:01.0186744Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:48:01.0186839Z #define __cpp_init_captures 201304L 2025-05-07T19:48:01.0186925Z #define __SLONGWORD_TYPE long int 2025-05-07T19:48:01.0187009Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:48:01.0187109Z #define __cpp_lib_as_const 201510 2025-05-07T19:48:01.0187185Z #define __WCHAR_T 2025-05-07T19:48:01.0187265Z #define __ATOMIC_RELEASE 3 2025-05-07T19:48:01.0187371Z #define __CUDA_SURFACE_TYPES_H__ 2025-05-07T19:48:01.0187459Z #define __fsblkcnt_t_defined 2025-05-07T19:48:01.0187558Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:48:01.0187646Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:48:01.0187653Z 2025-05-07T19:48:01.0553465Z 2025-05-07T19:48:01.0553827Z + conda run -n build_binary nvcc --version 2025-05-07T19:48:01.0553836Z 2025-05-07T19:48:02.8344404Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:48:02.8346351Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:48:02.8347414Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:48:02.8348335Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:48:02.8349286Z Build cuda_11.8.r11.8/compiler.31833905_0 2025-05-07T19:48:02.8349922Z 2025-05-07T19:48:02.9086547Z 2025-05-07T19:48:02.9099172Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:48:02.9100639Z [CHECK] nvidia-smi not found 2025-05-07T19:48:02.9100967Z [INSTALL] Successfully installed CUDA 11.8.0 2025-05-07T19:48:02.9199186Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:02.9199849Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:02.9200536Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:02.9200919Z env: 2025-05-07T19:48:02.9201196Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:02.9201535Z BUILD_ENV: build_binary 2025-05-07T19:48:02.9201842Z BUILD_TARGET: default 2025-05-07T19:48:02.9202100Z BUILD_VARIANT: cuda 2025-05-07T19:48:02.9202385Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:48:02.9202660Z ##[endgroup] 2025-05-07T19:48:03.3956207Z ################################################################################ 2025-05-07T19:48:03.3956578Z # Install PyTorch (PIP) 2025-05-07T19:48:03.3956831Z # 2025-05-07T19:48:03.3975225Z # [2025-05-07T19:48:03.396Z] + install_pytorch_pip build_binary nightly cuda/11.8.0 2025-05-07T19:48:03.3975790Z ################################################################################ 2025-05-07T19:48:03.3976031Z 2025-05-07T19:48:03.4004803Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:48:04.2765984Z Channels: 2025-05-07T19:48:04.2766527Z - conda-forge 2025-05-07T19:48:04.2766825Z Platform: linux-64 2025-05-07T19:48:13.8628633Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:48:15.3216140Z Solving environment: | / - done 2025-05-07T19:48:15.4871780Z 2025-05-07T19:48:15.4872650Z ## Package Plan ## 2025-05-07T19:48:15.4873421Z 2025-05-07T19:48:15.4874467Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:48:15.4875283Z 2025-05-07T19:48:15.4875397Z added / updated specs: 2025-05-07T19:48:15.4875693Z - numpy 2025-05-07T19:48:15.4875822Z 2025-05-07T19:48:15.4875827Z 2025-05-07T19:48:15.4875965Z The following packages will be downloaded: 2025-05-07T19:48:15.4876223Z 2025-05-07T19:48:15.4876376Z package | build 2025-05-07T19:48:15.4876723Z ---------------------------|----------------- 2025-05-07T19:48:15.4877160Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:48:15.4878071Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:48:15.4878697Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:48:15.4879167Z numpy-2.2.5 | py313h17eae1a_0 8.1 MB conda-forge 2025-05-07T19:48:15.4879565Z ------------------------------------------------------------ 2025-05-07T19:48:15.4879942Z Total: 8.2 MB 2025-05-07T19:48:15.4880190Z 2025-05-07T19:48:15.4880390Z The following NEW packages will be INSTALLED: 2025-05-07T19:48:15.4880999Z 2025-05-07T19:48:15.4881442Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:48:15.4882111Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:48:15.4882645Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:48:15.4883183Z numpy conda-forge/linux-64::numpy-2.2.5-py313h17eae1a_0 2025-05-07T19:48:15.4883462Z 2025-05-07T19:48:15.4883466Z 2025-05-07T19:48:15.4883470Z 2025-05-07T19:48:15.4883620Z Downloading and Extracting Packages: ...working... 2025-05-07T19:48:15.4884028Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:48:15.4884273Z 2025-05-07T19:48:15.4884605Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:15.4884848Z 2025-05-07T19:48:15.4884852Z 2025-05-07T19:48:15.4894844Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:15.4896235Z 2025-05-07T19:48:15.4896247Z 2025-05-07T19:48:15.4896258Z 2025-05-07T19:48:15.5495740Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:15.5496141Z 2025-05-07T19:48:15.5562915Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:15.5563216Z 2025-05-07T19:48:15.5563222Z 2025-05-07T19:48:15.5564305Z 2025-05-07T19:48:15.5720921Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:15.5721243Z 2025-05-07T19:48:15.5775169Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:15.5776584Z 2025-05-07T19:48:15.5776610Z 2025-05-07T19:48:15.5776628Z 2025-05-07T19:48:15.5813344Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:15.5813834Z 2025-05-07T19:48:15.5813839Z 2025-05-07T19:48:15.5872509Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:15.5925708Z numpy-2.2.5 | 8.1 MB | #######3 | 73% 2025-05-07T19:48:15.5926120Z 2025-05-07T19:48:15.5926128Z 2025-05-07T19:48:15.5926444Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:15.5926714Z 2025-05-07T19:48:15.5926733Z 2025-05-07T19:48:15.6286299Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:15.9415794Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:48:15.9418147Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:48:15.9419133Z 2025-05-07T19:48:15.9419758Z 2025-05-07T19:48:15.9420407Z  2025-05-07T19:48:15.9420633Z 2025-05-07T19:48:15.9420637Z 2025-05-07T19:48:15.9420814Z  2025-05-07T19:48:15.9421069Z 2025-05-07T19:48:15.9421073Z 2025-05-07T19:48:15.9421077Z 2025-05-07T19:48:15.9422131Z  done 2025-05-07T19:48:16.0431572Z Preparing transaction: | done 2025-05-07T19:48:16.1434440Z Verifying transaction: - done 2025-05-07T19:48:16.2444470Z Executing transaction: | done 2025-05-07T19:48:16.3449276Z ################################################################################ 2025-05-07T19:48:16.3450218Z # Install Package From PyTorch PIP: torch 2025-05-07T19:48:16.3450647Z # 2025-05-07T19:48:16.3485977Z # [2025-05-07T19:48:16.346Z] + install_from_pytorch_pip build_binary torch nightly cuda/11.8.0 2025-05-07T19:48:16.3487515Z ################################################################################ 2025-05-07T19:48:16.3487861Z 2025-05-07T19:48:16.3488092Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:16.4329407Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:16.4330390Z ################################################################################ 2025-05-07T19:48:16.4330847Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:48:16.4331168Z # 2025-05-07T19:48:16.4347876Z # [2025-05-07T19:48:16.434Z] + __prepare_pip_arguments torch nightly cuda/11.8.0 2025-05-07T19:48:16.4348409Z ################################################################################ 2025-05-07T19:48:16.4348649Z 2025-05-07T19:48:16.4374848Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:48:16.4401160Z [INSTALL] Extracted package variant: cu118 2025-05-07T19:48:16.4419613Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:48:16.4420261Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:48:16.4425643Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:48:16.4434265Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu118/ ... 2025-05-07T19:48:16.4460850Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:49:38.1544039Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:38.1545692Z 2025-05-07T19:49:38.1545918Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:49:38.1546344Z Collecting torch 2025-05-07T19:49:38.1547126Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp313-cp313-manylinux_2_28_x86_64.whl.metadata (29 kB) 2025-05-07T19:49:38.1547864Z Collecting filelock (from torch) 2025-05-07T19:49:38.1548364Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:49:38.1549340Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (4.13.2) 2025-05-07T19:49:38.1550468Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (78.1.1) 2025-05-07T19:49:38.1551420Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:49:38.1551968Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:49:38.1552848Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 260.3 MB/s eta 0:00:00 2025-05-07T19:49:38.1553223Z Collecting networkx (from torch) 2025-05-07T19:49:38.1553747Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:49:38.1554417Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 167.4 MB/s eta 0:00:00 2025-05-07T19:49:38.1555148Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (3.1.6) 2025-05-07T19:49:38.1555823Z Collecting fsspec (from torch) 2025-05-07T19:49:38.1556348Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:49:38.1556973Z Collecting nvidia-cuda-nvrtc-cu11==11.8.89 (from torch) 2025-05-07T19:49:38.1557706Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_nvrtc_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (23.2 MB) 2025-05-07T19:49:38.1558732Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.2/23.2 MB 223.3 MB/s eta 0:00:00 2025-05-07T19:49:38.1559149Z Collecting nvidia-cuda-runtime-cu11==11.8.89 (from torch) 2025-05-07T19:49:38.1559912Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_runtime_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (875 kB) 2025-05-07T19:49:38.1560733Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 875.6/875.6 kB 108.7 MB/s eta 0:00:00 2025-05-07T19:49:38.1561158Z Collecting nvidia-cuda-cupti-cu11==11.8.87 (from torch) 2025-05-07T19:49:38.1561901Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_cupti_cu11-11.8.87-py3-none-manylinux1_x86_64.whl (13.1 MB) 2025-05-07T19:49:38.1562725Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 13.1/13.1 MB 225.4 MB/s eta 0:00:00 2025-05-07T19:49:38.1563127Z Collecting nvidia-cudnn-cu11==9.1.0.70 (from torch) 2025-05-07T19:49:38.1563931Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cudnn_cu11-9.1.0.70-py3-none-manylinux2014_x86_64.whl (663.9 MB) 2025-05-07T19:49:38.1564828Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 663.9/663.9 MB 50.9 MB/s eta 0:00:00 2025-05-07T19:49:38.1565206Z Collecting nvidia-cublas-cu11==11.11.3.6 (from torch) 2025-05-07T19:49:38.1565860Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cublas_cu11-11.11.3.6-py3-none-manylinux1_x86_64.whl (417.9 MB) 2025-05-07T19:49:38.1566621Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 417.9/417.9 MB 78.7 MB/s eta 0:00:00 2025-05-07T19:49:38.1566981Z Collecting nvidia-cufft-cu11==10.9.0.58 (from torch) 2025-05-07T19:49:38.1567664Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cufft_cu11-10.9.0.58-py3-none-manylinux1_x86_64.whl (168.4 MB) 2025-05-07T19:49:38.1568494Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 168.4/168.4 MB 193.1 MB/s eta 0:00:00 2025-05-07T19:49:38.1568863Z Collecting nvidia-curand-cu11==10.3.0.86 (from torch) 2025-05-07T19:49:38.1569534Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_curand_cu11-10.3.0.86-py3-none-manylinux1_x86_64.whl (58.1 MB) 2025-05-07T19:49:38.1570286Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 58.1/58.1 MB 175.5 MB/s eta 0:00:00 2025-05-07T19:49:38.1570673Z Collecting nvidia-cusolver-cu11==11.4.1.48 (from torch) 2025-05-07T19:49:38.1571366Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusolver_cu11-11.4.1.48-py3-none-manylinux1_x86_64.whl (128.2 MB) 2025-05-07T19:49:38.1572117Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 128.2/128.2 MB 179.8 MB/s eta 0:00:00 2025-05-07T19:49:38.1572503Z Collecting nvidia-cusparse-cu11==11.7.5.86 (from torch) 2025-05-07T19:49:38.1573178Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusparse_cu11-11.7.5.86-py3-none-manylinux1_x86_64.whl (204.1 MB) 2025-05-07T19:49:38.1573956Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 204.1/204.1 MB 158.6 MB/s eta 0:00:00 2025-05-07T19:49:38.1574307Z Collecting nvidia-nccl-cu11==2.21.5 (from torch) 2025-05-07T19:49:38.1574965Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nccl_cu11-2.21.5-py3-none-manylinux2014_x86_64.whl (147.8 MB) 2025-05-07T19:49:38.1575732Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 147.8/147.8 MB 135.1 MB/s eta 0:00:00 2025-05-07T19:49:38.1576083Z Collecting nvidia-nvtx-cu11==11.8.86 (from torch) 2025-05-07T19:49:38.1576719Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nvtx_cu11-11.8.86-py3-none-manylinux1_x86_64.whl (99 kB) 2025-05-07T19:49:38.1577376Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:49:38.1578207Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:49:38.1579039Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:49:38.1579568Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:49:38.1580195Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 48.5 MB/s eta 0:00:00 2025-05-07T19:49:38.1580990Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:49:38.1582061Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp313-cp313-manylinux_2_28_x86_64.whl (916.2 MB) 2025-05-07T19:49:38.1582871Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 916.2/916.2 MB 28.4 MB/s eta 0:00:00 2025-05-07T19:49:38.1583642Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:49:38.1584513Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 62.6 MB/s eta 0:00:00 2025-05-07T19:49:38.1585975Z Installing collected packages: mpmath, sympy, pytorch-triton, nvidia-nvtx-cu11, nvidia-nccl-cu11, nvidia-cusparse-cu11, nvidia-curand-cu11, nvidia-cufft-cu11, nvidia-cuda-runtime-cu11, nvidia-cuda-nvrtc-cu11, nvidia-cuda-cupti-cu11, nvidia-cublas-cu11, networkx, fsspec, filelock, nvidia-cusolver-cu11, nvidia-cudnn-cu11, torch 2025-05-07T19:49:38.1587246Z 2025-05-07T19:49:38.1588820Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu11-11.11.3.6 nvidia-cuda-cupti-cu11-11.8.87 nvidia-cuda-nvrtc-cu11-11.8.89 nvidia-cuda-runtime-cu11-11.8.89 nvidia-cudnn-cu11-9.1.0.70 nvidia-cufft-cu11-10.9.0.58 nvidia-curand-cu11-10.3.0.86 nvidia-cusolver-cu11-11.4.1.48 nvidia-cusparse-cu11-11.7.5.86 nvidia-nccl-cu11-2.21.5 nvidia-nvtx-cu11-11.8.86 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu118 2025-05-07T19:49:38.1590673Z 2025-05-07T19:49:40.3534204Z torch 2.8.0.dev20250507+cu118 2025-05-07T19:49:40.3535440Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu118) 2025-05-07T19:49:43.6411100Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:49:46.9298470Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu118 2025-05-07T19:49:46.9299039Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:49:50.1479542Z True 2025-05-07T19:49:50.1480198Z True 2025-05-07T19:49:50.1480509Z 2025-05-07T19:49:50.2410545Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:49:50.2486763Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:49:50.2487409Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:49:50.2488085Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:50.2488394Z env: 2025-05-07T19:49:50.2488638Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:50.2488934Z BUILD_ENV: build_binary 2025-05-07T19:49:50.2489197Z BUILD_TARGET: default 2025-05-07T19:49:50.2489422Z BUILD_VARIANT: cuda 2025-05-07T19:49:50.2489674Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:49:50.2489917Z ##[endgroup] 2025-05-07T19:49:50.7451277Z /github/home/miniconda/bin/conda 2025-05-07T19:49:50.7452003Z ################################################################################ 2025-05-07T19:49:50.7452465Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:49:50.7452900Z # 2025-05-07T19:49:50.7467752Z # [2025-05-07T19:49:50.746Z] + collect_pytorch_env_info build_binary 2025-05-07T19:49:50.7468981Z ################################################################################ 2025-05-07T19:49:50.7469704Z 2025-05-07T19:49:50.7490411Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:50.8359324Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:50.8371964Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:49:50.8372716Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:49:50.8373155Z 2025-05-07T19:49:50.9246927Z 2025-05-07T19:49:50.9248247Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:49:50.9273059Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:49:56.2376074Z Collecting environment information... 2025-05-07T19:49:56.2377096Z PyTorch version: 2.8.0.dev20250507+cu118 2025-05-07T19:49:56.2378047Z Is debug build: False 2025-05-07T19:49:56.2378764Z CUDA used to build PyTorch: 11.8 2025-05-07T19:49:56.2379594Z ROCM used to build PyTorch: N/A 2025-05-07T19:49:56.2380115Z 2025-05-07T19:49:56.2380432Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:49:56.2381361Z GCC version: (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:49:56.2382315Z Clang version: Could not collect 2025-05-07T19:49:56.2382802Z CMake version: version 4.0.2 2025-05-07T19:49:56.2383085Z Libc version: glibc-2.34 2025-05-07T19:49:56.2383245Z 2025-05-07T19:49:56.2383681Z Python version: 3.13.2 | packaged by conda-forge | (main, Feb 17 2025, 14:10:22) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:49:56.2384352Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:49:56.2384800Z Is CUDA available: False 2025-05-07T19:49:56.2385057Z CUDA runtime version: 11.8.89 2025-05-07T19:49:56.2385356Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:49:56.2385677Z GPU models and configuration: Could not collect 2025-05-07T19:49:56.2386037Z Nvidia driver version: Could not collect 2025-05-07T19:49:56.2386356Z cuDNN version: Could not collect 2025-05-07T19:49:56.2386625Z HIP runtime version: N/A 2025-05-07T19:49:56.2386894Z MIOpen runtime version: N/A 2025-05-07T19:49:56.2387158Z Is XNNPACK available: True 2025-05-07T19:49:56.2387338Z 2025-05-07T19:49:56.2387421Z CPU: 2025-05-07T19:49:56.2387634Z Architecture: x86_64 2025-05-07T19:49:56.2387988Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:49:56.2388383Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:49:56.2388791Z Byte Order: Little Endian 2025-05-07T19:49:56.2389130Z CPU(s): 96 2025-05-07T19:49:56.2389427Z On-line CPU(s) list: 0-95 2025-05-07T19:49:56.2389769Z Vendor ID: GenuineIntel 2025-05-07T19:49:56.2391018Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:49:56.2391546Z CPU family: 6 2025-05-07T19:49:56.2391837Z Model: 85 2025-05-07T19:49:56.2392148Z Thread(s) per core: 2 2025-05-07T19:49:56.2392443Z Core(s) per socket: 24 2025-05-07T19:49:56.2392752Z Socket(s): 2 2025-05-07T19:49:56.2393053Z Stepping: 7 2025-05-07T19:49:56.2393358Z BogoMIPS: 5999.99 2025-05-07T19:49:56.2395777Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:49:56.2398420Z Hypervisor vendor: KVM 2025-05-07T19:49:56.2398716Z Virtualization type: full 2025-05-07T19:49:56.2399058Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:49:56.2399428Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:49:56.2399774Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:49:56.2400133Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:49:56.2400454Z NUMA node(s): 2 2025-05-07T19:49:56.2400774Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:49:56.2401100Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:49:56.2401554Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:49:56.2402101Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:49:56.2402560Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:49:56.2403139Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:49:56.2403681Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:49:56.2404264Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:49:56.2404831Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:49:56.2405198Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:49:56.2405567Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:49:56.2405919Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:49:56.2406452Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:49:56.2407229Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:49:56.2407845Z Vulnerability Srbds: Not affected 2025-05-07T19:49:56.2408189Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:49:56.2408430Z 2025-05-07T19:49:56.2408530Z Versions of relevant libraries: 2025-05-07T19:49:56.2408802Z [pip3] numpy==2.2.5 2025-05-07T19:49:56.2409030Z [pip3] nvidia-cublas-cu11==11.11.3.6 2025-05-07T19:49:56.2409338Z [pip3] nvidia-cuda-cupti-cu11==11.8.87 2025-05-07T19:49:56.2409632Z [pip3] nvidia-cuda-nvrtc-cu11==11.8.89 2025-05-07T19:49:56.2409945Z [pip3] nvidia-cuda-runtime-cu11==11.8.89 2025-05-07T19:49:56.2410243Z [pip3] nvidia-cudnn-cu11==9.1.0.70 2025-05-07T19:49:56.2410533Z [pip3] nvidia-cufft-cu11==10.9.0.58 2025-05-07T19:49:56.2410807Z [pip3] nvidia-curand-cu11==10.3.0.86 2025-05-07T19:49:56.2411107Z [pip3] nvidia-cusolver-cu11==11.4.1.48 2025-05-07T19:49:56.2411415Z [pip3] nvidia-cusparse-cu11==11.7.5.86 2025-05-07T19:49:56.2411828Z [pip3] nvidia-nccl-cu11==2.21.5 2025-05-07T19:49:56.2412118Z [pip3] nvidia-nvtx-cu11==11.8.86 2025-05-07T19:49:56.2412394Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:49:56.2412705Z [pip3] torch==2.8.0.dev20250507+cu118 2025-05-07T19:49:56.2413077Z [conda] cuda-cudart 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2413596Z [conda] cuda-cudart-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2414081Z [conda] cuda-cupti 11.8.87 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2414583Z [conda] cuda-libraries 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2415170Z [conda] cuda-libraries-dev 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2415665Z [conda] cuda-nvrtc 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2416154Z [conda] cuda-nvrtc-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2416621Z [conda] cuda-nvtx 11.8.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2417105Z [conda] cuda-runtime 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2417593Z [conda] libcublas 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2418071Z [conda] libcublas-dev 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2418562Z [conda] libcufft 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2419034Z [conda] libcufft-dev 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2419526Z [conda] libcurand 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2420022Z [conda] libcurand-dev 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2420509Z [conda] libcusolver 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2421020Z [conda] libcusolver-dev 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2421520Z [conda] libcusparse 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2422036Z [conda] libcusparse-dev 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:49:56.2422530Z [conda] numpy 2.2.5 py313h17eae1a_0 conda-forge 2025-05-07T19:49:56.2422982Z [conda] nvidia-cublas-cu11 11.11.3.6 pypi_0 pypi 2025-05-07T19:49:56.2423482Z [conda] nvidia-cuda-cupti-cu11 11.8.87 pypi_0 pypi 2025-05-07T19:49:56.2423961Z [conda] nvidia-cuda-nvrtc-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:49:56.2424469Z [conda] nvidia-cuda-runtime-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:49:56.2424943Z [conda] nvidia-cudnn-cu11 9.1.0.70 pypi_0 pypi 2025-05-07T19:49:56.2425414Z [conda] nvidia-cufft-cu11 10.9.0.58 pypi_0 pypi 2025-05-07T19:49:56.2425886Z [conda] nvidia-curand-cu11 10.3.0.86 pypi_0 pypi 2025-05-07T19:49:56.2426354Z [conda] nvidia-cusolver-cu11 11.4.1.48 pypi_0 pypi 2025-05-07T19:49:56.2426845Z [conda] nvidia-cusparse-cu11 11.7.5.86 pypi_0 pypi 2025-05-07T19:49:56.2427309Z [conda] nvidia-nccl-cu11 2.21.5 pypi_0 pypi 2025-05-07T19:49:56.2427797Z [conda] nvidia-nvtx-cu11 11.8.86 pypi_0 pypi 2025-05-07T19:49:56.2428261Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:49:56.2428727Z [conda] torch 2.8.0.dev20250507+cu118 pypi_0 pypi 2025-05-07T19:49:56.2428996Z 2025-05-07T19:49:56.3251928Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:49:56.3252600Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:49:56.3253206Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:56.3253565Z env: 2025-05-07T19:49:56.3253798Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:56.3254143Z BUILD_ENV: build_binary 2025-05-07T19:49:56.3254399Z BUILD_TARGET: default 2025-05-07T19:49:56.3254671Z BUILD_VARIANT: cuda 2025-05-07T19:49:56.3254939Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:49:56.3255190Z ##[endgroup] 2025-05-07T19:49:56.8200416Z ################################################################################ 2025-05-07T19:49:56.8201116Z # Install cuDNN 2025-05-07T19:49:56.8201355Z # 2025-05-07T19:49:56.8225068Z # [2025-05-07T19:49:56.821Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 11.8.0 2025-05-07T19:49:56.8226767Z ################################################################################ 2025-05-07T19:49:56.8227450Z 2025-05-07T19:49:56.8244577Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:56.9133974Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:56.9135199Z [INSTALL] cuda_concat_version is determined to be: 118 2025-05-07T19:49:56.9136311Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:56.9136946Z 2025-05-07T19:49:56.9150234Z 2025-05-07T19:49:56.9150395Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:56.9167878Z 2025-05-07T19:49:56.9167894Z 2025-05-07T19:49:56.9192741Z [INSTALL] Downloading cuDNN to /tmp/tmp.ng3sw5l5Ky ... 2025-05-07T19:49:56.9213439Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/redist/cudnn/v8.7.0/local_installers/11.8/cudnn-linux-x86_64-8.7.0.84_cuda11-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:49:58.8430473Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:49:58.8430906Z + tar -xvf cudnn.tar.xz 2025-05-07T19:49:58.8431194Z 2025-05-07T19:49:58.8457029Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/ 2025-05-07T19:49:58.8458144Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/ 2025-05-07T19:49:58.8459422Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static.a 2025-05-07T19:50:01.2344054Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static_v8.a 2025-05-07T19:50:01.2344846Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static.a 2025-05-07T19:50:03.4973019Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static_v8.a 2025-05-07T19:50:03.4973812Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static.a 2025-05-07T19:50:11.7095641Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static_v8.a 2025-05-07T19:50:11.7097330Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static.a 2025-05-07T19:50:13.3154161Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static_v8.a 2025-05-07T19:50:13.3154762Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static.a 2025-05-07T19:50:15.0054600Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static_v8.a 2025-05-07T19:50:15.0055320Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static.a 2025-05-07T19:50:16.5175498Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static_v8.a 2025-05-07T19:50:16.5177072Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8 2025-05-07T19:50:16.5178389Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so 2025-05-07T19:50:16.5179715Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8.7.0 2025-05-07T19:50:16.5189712Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8 2025-05-07T19:50:16.5191925Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so 2025-05-07T19:50:16.5193486Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8.7.0 2025-05-07T19:50:18.8949401Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8 2025-05-07T19:50:18.8951215Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so 2025-05-07T19:50:18.8952830Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8.7.0 2025-05-07T19:50:21.1496568Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so 2025-05-07T19:50:21.1498134Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8 2025-05-07T19:50:21.1499684Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8.7.0 2025-05-07T19:50:29.6823071Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so 2025-05-07T19:50:29.6823981Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8.7.0 2025-05-07T19:50:31.3019879Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8 2025-05-07T19:50:31.3021538Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8.7.0 2025-05-07T19:50:32.9875326Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so 2025-05-07T19:50:32.9876889Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8 2025-05-07T19:50:32.9878456Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8.7.0 2025-05-07T19:50:34.5016940Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so 2025-05-07T19:50:34.5018566Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8 2025-05-07T19:50:34.5019909Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/ 2025-05-07T19:50:34.5021131Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_v8.h 2025-05-07T19:50:34.5022569Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer_v8.h 2025-05-07T19:50:34.5023405Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train_v8.h 2025-05-07T19:50:34.5023948Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend_v8.h 2025-05-07T19:50:34.5024458Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer_v8.h 2025-05-07T19:50:34.5024991Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train_v8.h 2025-05-07T19:50:34.5025525Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer_v8.h 2025-05-07T19:50:34.5026044Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train_v8.h 2025-05-07T19:50:34.5026571Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version_v8.h 2025-05-07T19:50:34.5027037Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn.h 2025-05-07T19:50:34.5027519Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer.h 2025-05-07T19:50:34.5028040Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train.h 2025-05-07T19:50:34.5028543Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend.h 2025-05-07T19:50:34.5029236Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer.h 2025-05-07T19:50:34.5029740Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train.h 2025-05-07T19:50:34.5030280Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer.h 2025-05-07T19:50:34.5030794Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train.h 2025-05-07T19:50:34.5031475Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version.h 2025-05-07T19:50:34.5031959Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/LICENSE 2025-05-07T19:50:34.5042776Z 2025-05-07T19:50:34.5043797Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:50:34.5044346Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:50:34.5044597Z 2025-05-07T19:50:34.5060758Z 2025-05-07T19:50:34.5060923Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:50:34.5061230Z 2025-05-07T19:50:34.5072816Z 2025-05-07T19:50:34.5073708Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:34.5074108Z 2025-05-07T19:50:34.5107276Z 2025-05-07T19:50:34.5108979Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:34.5110174Z 2025-05-07T19:50:36.2544038Z 2025-05-07T19:50:36.2544648Z /__w/FBGEMM/FBGEMM 2025-05-07T19:50:36.2550013Z + rm -rf /tmp/tmp.ng3sw5l5Ky 2025-05-07T19:50:36.2550740Z 2025-05-07T19:50:36.7155174Z 2025-05-07T19:50:36.7172233Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:50:36.7173228Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:50:36.7173907Z 2025-05-07T19:50:37.1384678Z 2025-05-07T19:50:37.1385029Z [INSTALL] Successfully installed cuDNN (for CUDA 11.8.0) 2025-05-07T19:50:37.1456625Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:50:37.1457266Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:50:37.1457934Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:37.1458311Z env: 2025-05-07T19:50:37.1458555Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:37.1458905Z BUILD_ENV: build_binary 2025-05-07T19:50:37.1459176Z BUILD_TARGET: default 2025-05-07T19:50:37.1459450Z BUILD_VARIANT: cuda 2025-05-07T19:50:37.1459741Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:37.1460010Z ##[endgroup] 2025-05-07T19:50:37.5394897Z ################################################################################ 2025-05-07T19:50:37.5395979Z # Prepare FBGEMM-GPU Build 2025-05-07T19:50:37.5396731Z # 2025-05-07T19:50:37.5412361Z # [2025-05-07T19:50:37.540Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:50:37.5412960Z ################################################################################ 2025-05-07T19:50:37.5413230Z 2025-05-07T19:50:37.5428533Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:37.6335439Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:37.6350801Z [BUILD] Running git submodules update ... 2025-05-07T19:50:37.6384343Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:50:37.6678085Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:50:37.6678647Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:50:37.6679153Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:50:37.6679565Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:50:37.6680000Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:50:37.6680439Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:50:37.6680870Z Synchronizing submodule url for '../external/json' 2025-05-07T19:50:37.6713452Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:50:37.7219984Z [BUILD] Installing other build dependencies ... 2025-05-07T19:50:37.7249698Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:50:39.8208783Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:50:39.8382216Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:50:39.8465016Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:50:39.9499615Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:50:39.9530894Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:50:39.9602873Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:50:39.9604409Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:50:39.9606194Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:50:39.9609796Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:50:39.9873390Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:50:39.9912457Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:50:39.9983108Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:50:40.0114923Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:50:40.0151404Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:50:40.0215677Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:50:40.0219557Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:50:40.0222384Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:50:40.0412469Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:50:40.0453027Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:50:40.0626934Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:50:40.0656300Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:50:40.0910027Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:50:40.0940115Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:50:40.1028218Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:50:40.1032536Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:50:40.1075529Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:50:40.1079911Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:50:40.1130328Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:50:40.1259487Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:50:40.1291856Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:50:40.1359008Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:50:40.1373487Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:50:40.1384711Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:50:40.1642425Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:50:40.1672631Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:50:40.1781055Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:50:40.1871108Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:50:40.2851458Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 291.5 MB/s eta 0:00:00 2025-05-07T19:50:40.2935520Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:50:40.3029408Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:50:40.3130970Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:50:40.3200397Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:50:40.3272540Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:50:40.3362933Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:50:40.3431921Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:50:40.4942224Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:50:41.3183477Z 2025-05-07T19:50:41.3231314Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:50:41.3233718Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:50:41.4533840Z ################################################################################ 2025-05-07T19:50:41.4534755Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:50:41.4535141Z # 2025-05-07T19:50:41.4552906Z # [2025-05-07T19:50:41.454Z] + install_triton_pip build_binary 2025-05-07T19:50:41.4553360Z ################################################################################ 2025-05-07T19:50:41.4553596Z 2025-05-07T19:50:41.4553848Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:50:41.4554297Z ################################################################################ 2025-05-07T19:50:41.4554689Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:50:41.4555021Z # 2025-05-07T19:50:41.4572536Z # [2025-05-07T19:50:41.456Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:50:41.4573718Z ################################################################################ 2025-05-07T19:50:41.4574026Z 2025-05-07T19:50:41.4592572Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:41.5452714Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:41.5453178Z ################################################################################ 2025-05-07T19:50:41.5453530Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:50:41.5453829Z # 2025-05-07T19:50:41.5476595Z # [2025-05-07T19:50:41.546Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:50:41.5478436Z ################################################################################ 2025-05-07T19:50:41.5479111Z 2025-05-07T19:50:41.5527961Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:50:41.5543763Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:50:41.5544871Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:50:41.5553453Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:50:41.5567453Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:50:41.5590656Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:50:47.1985249Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:50:47.1986219Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:50:47.1986679Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:50:47.1987781Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:50:47.1989240Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:50:47.1990396Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 152.6 MB/s eta 0:00:00 2025-05-07T19:50:47.1991024Z Installing collected packages: pytorch-triton 2025-05-07T19:50:47.1991404Z Attempting uninstall: pytorch-triton 2025-05-07T19:50:47.1991798Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:50:47.1992263Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:50:47.1992684Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:50:47.1993145Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:50:47.1993414Z 2025-05-07T19:50:49.3070261Z torch 2.8.0.dev20250507+cu118 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:50:49.3075448Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:50:49.3076927Z 2025-05-07T19:50:49.3077073Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:50:49.3077497Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:50:51.3362305Z ################################################################################ 2025-05-07T19:50:51.3363664Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:50:51.3364085Z ################################################################################ 2025-05-07T19:50:51.3364357Z 2025-05-07T19:50:53.3274560Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:50:55.4153768Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:50:55.4154217Z [BUILD] Successfully ran git submodules update 2025-05-07T19:50:55.4230481Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:50:55.4231330Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:50:55.4231952Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:55.4232318Z env: 2025-05-07T19:50:55.4232542Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:55.4232860Z BUILD_ENV: build_binary 2025-05-07T19:50:55.4233101Z BUILD_TARGET: default 2025-05-07T19:50:55.4233350Z BUILD_VARIANT: cuda 2025-05-07T19:50:55.4233600Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:55.4233846Z ##[endgroup] 2025-05-07T19:50:55.9137423Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:50:55.9137891Z [BUILD] Extracted build target: default 2025-05-07T19:50:55.9138528Z [BUILD] Extracted build variant: cuda 2025-05-07T19:50:57.7140790Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:50:57.7141093Z 2025-05-07T19:50:57.7958241Z [CHECK] Binary cc found in PATH 2025-05-07T19:50:59.5890070Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:50:59.5891260Z 2025-05-07T19:50:59.6462032Z [CHECK] Binary gcc found in PATH 2025-05-07T19:51:01.4300557Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:51:01.4300860Z 2025-05-07T19:51:01.5043460Z [CHECK] Binary c++ found in PATH 2025-05-07T19:51:03.3175069Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:51:03.3175521Z 2025-05-07T19:51:03.3929357Z [CHECK] Binary g++ found in PATH 2025-05-07T19:51:05.2633420Z [BUILD] Extracted and set Python tag: py313 2025-05-07T19:51:05.2634738Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:51:05.2860952Z core = 24 2025-05-07T19:51:05.3067587Z sockets = 2 2025-05-07T19:51:05.3068463Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:51:05.3069622Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:51:05.3069923Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:51:05.3070240Z + rm -rf dist 2025-05-07T19:51:05.3070378Z 2025-05-07T19:51:05.3081711Z 2025-05-07T19:51:05.3082487Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:51:05.3083452Z 2025-05-07T19:51:08.3799069Z INFO:root:running clean 2025-05-07T19:51:08.3799641Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:51:08.3800708Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:51:08.3801890Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:51:08.3802371Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:51:08.3803065Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:51:08.3803615Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:51:08.3804149Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:51:08.3804542Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:51:08.3805692Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:51:08.7428671Z 2025-05-07T19:51:08.7429428Z [BUILD] Printing git status ... 2025-05-07T19:51:08.7430265Z + git status 2025-05-07T19:51:08.7430612Z 2025-05-07T19:51:09.1656458Z HEAD detached at pull/4066/merge 2025-05-07T19:51:09.1656927Z Untracked files: 2025-05-07T19:51:09.1657463Z (use "git add ..." to include in what will be committed) 2025-05-07T19:51:09.1657997Z ../build_only/ 2025-05-07T19:51:09.1658240Z ../collect_env.py 2025-05-07T19:51:09.1658518Z fbgemm_gpu/docs/version.py 2025-05-07T19:51:09.1658700Z 2025-05-07T19:51:09.1659258Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:51:09.1659648Z 2025-05-07T19:51:09.1659749Z + git diff 2025-05-07T19:51:09.1659872Z 2025-05-07T19:51:09.1936203Z 2025-05-07T19:51:09.1936917Z ################################################################################ 2025-05-07T19:51:09.1937980Z # Configure FBGEMM-GPU Build 2025-05-07T19:51:09.1938754Z # 2025-05-07T19:51:09.1953071Z # [2025-05-07T19:51:09.194Z] + __configure_fbgemm_gpu_build 2025-05-07T19:51:09.1954160Z ################################################################################ 2025-05-07T19:51:09.1954392Z 2025-05-07T19:51:09.1957691Z [BUILD] Setting the build target: default ... 2025-05-07T19:51:09.1958172Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:51:11.0315038Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:51:11.0315383Z 2025-05-07T19:51:11.0913878Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:51:12.9294706Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:51:12.9295009Z 2025-05-07T19:51:13.0033073Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:51:14.8567510Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:14.8567835Z 2025-05-07T19:51:14.9313001Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:51:16.7817489Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:16.7817860Z 2025-05-07T19:51:16.8554062Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:51:18.7875261Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:51:18.7876218Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:51:18.7876546Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:51:18.7876890Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:51:18.7877269Z Build cuda_11.8.r11.8/compiler.31833905_0 ... 2025-05-07T19:51:18.7877618Z [BUILD] Setting the following CUDA targets: 7.0;8.0 2025-05-07T19:51:18.7877979Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:51:20.7106757Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:51:24.5385296Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:51:24.5385742Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:51:24.5386038Z 2025-05-07T19:51:24.9664643Z 2025-05-07T19:51:24.9664941Z [BUILD] Setting CUDA build args ... 2025-05-07T19:51:24.9674466Z [BUILD] Looking up CUDA version ... 2025-05-07T19:51:28.6788796Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:51:28.6789119Z 2025-05-07T19:51:30.5846819Z 2025-05-07T19:51:30.5847227Z [BUILD] Setting NVCC flags ... 2025-05-07T19:51:30.5848122Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++17 -Xcompiler -std=c++17 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:51:30.5848856Z 2025-05-07T19:51:31.0034366Z 2025-05-07T19:51:31.0034693Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:51:31.0034978Z 2025-05-07T19:51:32.8156620Z -std=c++17 -Xcompiler -std=c++17 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:51:32.8158180Z 2025-05-07T19:51:32.8976182Z 2025-05-07T19:51:32.8976626Z [BUILD] Setting CUDA build args ... 2025-05-07T19:51:32.8980112Z + conda run -n build_binary c++ --version 2025-05-07T19:51:32.8980402Z 2025-05-07T19:51:34.6940199Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:51:34.6940624Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:51:34.6941132Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:51:34.6941733Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:51:34.6942073Z 2025-05-07T19:51:34.6942078Z 2025-05-07T19:51:34.7680860Z 2025-05-07T19:51:34.7681673Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:51:34.7682032Z 2025-05-07T19:51:36.6480027Z 2025-05-07T19:51:36.6480459Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:51:36.6481058Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:51:36.6482889Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0' -DCMAKE_CXX_STANDARD=17 --debug 2025-05-07T19:51:36.6484546Z ################################################################################ 2025-05-07T19:51:36.6484921Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:51:36.6485210Z # 2025-05-07T19:51:36.6500549Z # [2025-05-07T19:51:36.649Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:51:36.6501071Z ################################################################################ 2025-05-07T19:51:36.6501301Z 2025-05-07T19:51:36.6505880Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:51:36.6509506Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0' --config-setting=--build-option=-DCMAKE_CXX_STANDARD=17 --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py313 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:51:36.6513355Z 2025-05-07T19:51:38.4946695Z * Getting build dependencies for wheel... 2025-05-07T19:51:39.7596677Z INFO:root:running egg_info 2025-05-07T19:51:39.7633781Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:51:39.7634239Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:51:39.7634848Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:51:39.7636694Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:51:39.7637573Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:51:39.7638501Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:51:39.7702288Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:51:39.7712542Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:51:39.7715506Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:51:39.7716536Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:51:39.7717610Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:51:39.7718111Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:51:39.7718668Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:51:39.7719261Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:51:39.7719821Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:51:39.7720238Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:51:39.7721980Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:51:40.0442130Z * Building wheel... 2025-05-07T19:51:41.3006532Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-oam94gw2', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--debug', '--package_channel=nightly', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:51:41.3010194Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix=None) 2025-05-07T19:51:41.3012377Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-oam94gw2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:51:41.3013418Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:51:41.3013941Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:51:41.3014492Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:51:41.3015292Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:51:41.3015685Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:51:41.3019526Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17'] 2025-05-07T19:51:41.3023140Z 2025-05-07T19:51:41.3023161Z 2025-05-07T19:51:41.3023325Z -------------------------------------------------------------------------------- 2025-05-07T19:51:41.3023682Z -- Trying 'Ninja' generator 2025-05-07T19:51:41.3023946Z -------------------------------- 2025-05-07T19:51:41.3024194Z --------------------------- 2025-05-07T19:51:41.3024438Z ---------------------- 2025-05-07T19:51:41.3024664Z ----------------- 2025-05-07T19:51:41.3024859Z ------------ 2025-05-07T19:51:41.3025060Z ------- 2025-05-07T19:51:41.3025236Z -- 2025-05-07T19:51:41.3419673Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:51:41.3421313Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:51:41.3422502Z CMake. 2025-05-07T19:51:41.3422854Z 2025-05-07T19:51:41.3423485Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:51:41.3425071Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:51:41.3426095Z to work with policies introduced by or earlier. 2025-05-07T19:51:41.3426350Z 2025-05-07T19:51:41.3426354Z 2025-05-07T19:51:41.3426529Z Not searching for unused variables given on the command line. 2025-05-07T19:51:41.3851740Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:51:41.3925038Z -- Detecting C compiler ABI info 2025-05-07T19:51:41.4793410Z -- Detecting C compiler ABI info - done 2025-05-07T19:51:41.4970631Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:51:41.4972419Z -- Detecting C compile features 2025-05-07T19:51:41.4974414Z -- Detecting C compile features - done 2025-05-07T19:51:41.5749515Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:51:41.5821725Z -- Detecting CXX compiler ABI info 2025-05-07T19:51:41.6769961Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:51:41.6955179Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:51:41.6957071Z -- Detecting CXX compile features 2025-05-07T19:51:41.6962757Z -- Detecting CXX compile features - done 2025-05-07T19:51:41.7028621Z -- Configuring done (0.4s) 2025-05-07T19:51:41.7072111Z -- Generating done (0.0s) 2025-05-07T19:51:41.7087244Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:51:41.7119696Z -- 2025-05-07T19:51:41.7120887Z ------- 2025-05-07T19:51:41.7121566Z ------------ 2025-05-07T19:51:41.7122154Z ----------------- 2025-05-07T19:51:41.7122742Z ---------------------- 2025-05-07T19:51:41.7123417Z --------------------------- 2025-05-07T19:51:41.7124250Z -------------------------------- 2025-05-07T19:51:41.7124550Z -- Trying 'Ninja' generator - success 2025-05-07T19:51:41.7125189Z -------------------------------------------------------------------------------- 2025-05-07T19:51:41.7125496Z 2025-05-07T19:51:41.7136388Z Configuring Project 2025-05-07T19:51:41.7137163Z Working directory: 2025-05-07T19:51:41.7138228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:51:41.7139397Z Command: 2025-05-07T19:51:41.7158341Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install -DPYTHON_VERSION_STRING:STRING=3.13.2 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.13.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:51:41.7180874Z 2025-05-07T19:51:41.7539183Z 2025-05-07T19:51:41.7539201Z 2025-05-07T19:51:41.7539764Z ================================================================================ 2025-05-07T19:51:41.7540843Z Default C compiler flags 2025-05-07T19:51:41.7541846Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:51:41.7542716Z 2025-05-07T19:51:41.7544133Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib 2025-05-07T19:51:41.7544896Z ================================================================================ 2025-05-07T19:51:41.7545130Z 2025-05-07T19:51:41.7545134Z 2025-05-07T19:51:41.7545139Z 2025-05-07T19:51:41.7545255Z ================================================================================ 2025-05-07T19:51:41.7545581Z Default C++ compiler flags 2025-05-07T19:51:41.7545913Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:51:41.7546207Z 2025-05-07T19:51:41.7546617Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib 2025-05-07T19:51:41.7547223Z ================================================================================ 2025-05-07T19:51:41.7547454Z 2025-05-07T19:51:41.7547457Z 2025-05-07T19:51:41.7547461Z 2025-05-07T19:51:41.7547633Z Not searching for unused variables given on the command line. 2025-05-07T19:51:41.7548056Z ================================================================================ 2025-05-07T19:51:41.7548353Z AVX2_FLAGS: 2025-05-07T19:51:41.7548482Z 2025-05-07T19:51:41.7548558Z -mavx2 2025-05-07T19:51:41.7548753Z -mf16c 2025-05-07T19:51:41.7548925Z -mfma 2025-05-07T19:51:41.7549121Z -fopenmp 2025-05-07T19:51:41.7549561Z ================================================================================ 2025-05-07T19:51:41.7549790Z 2025-05-07T19:51:41.7549814Z 2025-05-07T19:51:41.7549818Z 2025-05-07T19:51:41.7549928Z ================================================================================ 2025-05-07T19:51:41.7550220Z AVX512_FLAGS: 2025-05-07T19:51:41.7550360Z 2025-05-07T19:51:41.7550438Z -mavx2 2025-05-07T19:51:41.7550634Z -mf16c 2025-05-07T19:51:41.7550806Z -mfma 2025-05-07T19:51:41.7551001Z -mavx512f 2025-05-07T19:51:41.7551311Z -mavx512bw 2025-05-07T19:51:41.7551526Z -mavx512dq 2025-05-07T19:51:41.7551897Z -mavx512vl 2025-05-07T19:51:41.7552112Z -fopenmp 2025-05-07T19:51:41.7552338Z ================================================================================ 2025-05-07T19:51:41.7552616Z 2025-05-07T19:51:41.7552620Z 2025-05-07T19:51:41.7552624Z 2025-05-07T19:51:41.7552737Z ================================================================================ 2025-05-07T19:51:41.7553097Z The project is built using scikit-build 2025-05-07T19:51:41.7553421Z ================================================================================ 2025-05-07T19:51:41.7553646Z 2025-05-07T19:51:41.7553650Z 2025-05-07T19:51:41.7553654Z 2025-05-07T19:51:41.7553780Z ================================================================================ 2025-05-07T19:51:41.7554093Z Build Settings 2025-05-07T19:51:41.7554241Z 2025-05-07T19:51:41.7554348Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:51:41.7554630Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:51:41.7554825Z 2025-05-07T19:51:41.7554921Z NVCC_VERBOSE : 2025-05-07T19:51:41.7555191Z CUDNN_INCLUDE_DIR : 2025-05-07T19:51:41.7555443Z CUDNN_LIBRARY : 2025-05-07T19:51:41.7555881Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:41.7556459Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:51:41.7556727Z 8.0 2025-05-07T19:51:41.7556834Z 2025-05-07T19:51:41.7556930Z HIP_ROOT_DIR : 2025-05-07T19:51:41.7557198Z HIPCC_VERBOSE : 2025-05-07T19:51:41.7557444Z AMDGPU_TARGETS : 2025-05-07T19:51:41.7557713Z PYTORCH_ROCM_ARCH : 2025-05-07T19:51:41.7558103Z ================================================================================ 2025-05-07T19:51:41.7558312Z 2025-05-07T19:51:41.8297966Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:51:41.8680536Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:51:42.6979847Z -- The CUDA compiler identification is NVIDIA 11.8.89 with host compiler GNU 11.4.0 2025-05-07T19:51:42.7072173Z -- Detecting CXX compiler ABI info 2025-05-07T19:51:42.8005914Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:51:42.8193385Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:51:42.8194064Z -- Detecting CXX compile features 2025-05-07T19:51:42.8202243Z -- Detecting CXX compile features - done 2025-05-07T19:51:42.8319578Z -- Detecting C compiler ABI info 2025-05-07T19:51:42.9178366Z -- Detecting C compiler ABI info - done 2025-05-07T19:51:42.9355016Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:51:42.9356613Z -- Detecting C compile features 2025-05-07T19:51:42.9360850Z -- Detecting C compile features - done 2025-05-07T19:51:42.9461359Z -- Detecting CUDA compiler ABI info 2025-05-07T19:51:43.7629522Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:51:43.8138758Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:51:43.8158630Z -- Detecting CUDA compile features 2025-05-07T19:51:43.8162027Z -- Detecting CUDA compile features - done 2025-05-07T19:51:43.8237750Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:51:44.0769849Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:51:44.0770829Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:51:44.3492025Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:51:44.3493451Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:51:44.6025025Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:51:44.6026105Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:51:44.8690077Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:51:44.8691590Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:51:45.1230711Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:51:45.1231935Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:51:45.3382006Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:51:45.3383013Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:51:45.5924673Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:51:45.5925203Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:51:45.8628101Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:51:45.8629178Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:51:46.1177590Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:51:46.1178629Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:51:46.3857076Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:51:46.3858124Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:51:46.6415615Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:51:46.6416683Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:51:46.8570194Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:51:46.8750875Z -- Found CUDA: /github/home/miniconda/envs/build_binary (found version "11.8") 2025-05-07T19:51:46.8784706Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/include (found version "11.8.89") 2025-05-07T19:51:46.8861430Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:51:46.9753454Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed 2025-05-07T19:51:46.9754587Z -- Looking for pthread_create in pthreads 2025-05-07T19:51:47.0545556Z -- Looking for pthread_create in pthreads - not found 2025-05-07T19:51:47.0546652Z -- Looking for pthread_create in pthread 2025-05-07T19:51:47.1428193Z -- Looking for pthread_create in pthread - found 2025-05-07T19:51:47.1436864Z -- Found Threads: TRUE 2025-05-07T19:51:47.2197288Z -- PyTorch: CUDA detected: 11.8 2025-05-07T19:51:47.2198560Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:51:47.2200166Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary 2025-05-07T19:51:47.3361921Z -- PyTorch: Header version is: 11.8 2025-05-07T19:51:47.4341656Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.13.2") found components: Interpreter 2025-05-07T19:51:47.4363577Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:51:47.4364433Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:51:47.4364860Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:51:47.4365211Z Call Stack (most recent call first): 2025-05-07T19:51:47.4365896Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:51:47.4366994Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:51:47.4367845Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:51:47.4368281Z CMakeLists.txt:112 (include) 2025-05-07T19:51:47.4368462Z 2025-05-07T19:51:47.4368467Z 2025-05-07T19:51:47.4368684Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:51:47.4369138Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:51:47.4369576Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:51:47.4370148Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80 2025-05-07T19:51:47.4712752Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:51:47.4713813Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:51:47.4714182Z Call Stack (most recent call first): 2025-05-07T19:51:47.4714956Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:51:47.4715849Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:51:47.4716300Z CMakeLists.txt:112 (include) 2025-05-07T19:51:47.4716483Z 2025-05-07T19:51:47.4716487Z 2025-05-07T19:51:47.4717067Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so 2025-05-07T19:51:47.4718121Z 2025-05-07T19:51:47.4718125Z 2025-05-07T19:51:47.4718257Z ================================================================================ 2025-05-07T19:51:47.4718615Z PyTorch Flags: 2025-05-07T19:51:47.4718828Z 2025-05-07T19:51:47.4719040Z TORCH_INCLUDE_DIRS: 2025-05-07T19:51:47.4721850Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:47.4722639Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:47.4723230Z 2025-05-07T19:51:47.4723422Z TORCH_LIBRARIES: 2025-05-07T19:51:47.4723649Z torch 2025-05-07T19:51:47.4723834Z torch_library 2025-05-07T19:51:47.4724268Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:47.4724859Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:47.4725463Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:47.4725988Z 2025-05-07T19:51:47.4726177Z TORCH_CUDA_OPTIONS: 2025-05-07T19:51:47.4726569Z --expt-relaxed-constexpr 2025-05-07T19:51:47.4726835Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:47.4727130Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:47.4727419Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:47.4727730Z ================================================================================ 2025-05-07T19:51:47.4727964Z 2025-05-07T19:51:47.4727985Z 2025-05-07T19:51:47.4727989Z 2025-05-07T19:51:47.4728125Z ================================================================================ 2025-05-07T19:51:47.4728435Z NCCL Flags 2025-05-07T19:51:47.4728570Z 2025-05-07T19:51:47.4728949Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:47.4729829Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:47.4730473Z ================================================================================ 2025-05-07T19:51:47.4730706Z 2025-05-07T19:51:47.4730711Z 2025-05-07T19:51:47.4730715Z 2025-05-07T19:51:47.4730846Z ================================================================================ 2025-05-07T19:51:47.4731164Z CUDA Driver Path 2025-05-07T19:51:47.4731320Z 2025-05-07T19:51:47.4731590Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:47.4732096Z ================================================================================ 2025-05-07T19:51:47.4732322Z 2025-05-07T19:51:47.4732607Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:47.4752854Z 2025-05-07T19:51:47.4752913Z 2025-05-07T19:51:47.4753138Z ================================================================================ 2025-05-07T19:51:47.4753547Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:51:47.4753842Z 2025-05-07T19:51:47.4754048Z CPU_SRCS: 2025-05-07T19:51:47.4754165Z 2025-05-07T19:51:47.4754263Z 2025-05-07T19:51:47.4754474Z GPU_SRCS: 2025-05-07T19:51:47.4754585Z 2025-05-07T19:51:47.4754734Z 2025-05-07T19:51:47.4754917Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:47.4755057Z 2025-05-07T19:51:47.4755156Z 2025-05-07T19:51:47.4755336Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:47.4755471Z 2025-05-07T19:51:47.4755715Z 2025-05-07T19:51:47.4755899Z OTHER_SRCS: 2025-05-07T19:51:47.4756288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:51:47.4756885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:51:47.4757489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:51:47.4758097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:51:47.4758692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:51:47.4759281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:51:47.4759846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:51:47.4760437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:51:47.4761005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:51:47.4761601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:51:47.4762192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:51:47.4762777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:51:47.4763368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:51:47.4763933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:51:47.4764516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:51:47.4765113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:51:47.4765759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:51:47.4766351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:51:47.4766925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:51:47.4767525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:51:47.4768098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:51:47.4768690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:51:47.4769301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:51:47.4769899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:51:47.4770503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:51:47.4771061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:51:47.4771658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:51:47.4772251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:51:47.4772811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:51:47.4773366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:51:47.4773941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:51:47.4774547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:51:47.4775112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:51:47.4775679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:51:47.4776252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:51:47.4776809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:51:47.4777379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:51:47.4777991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:51:47.4778559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:51:47.4779105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:51:47.4779671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:51:47.4780230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:51:47.4780772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:51:47.4781332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:51:47.4781881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:51:47.4782467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:51:47.4783036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:51:47.4783620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:51:47.4784206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:51:47.4784790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:51:47.4785384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:51:47.4786116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:51:47.4786718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:51:47.4787319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:51:47.4787941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:51:47.4788511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:51:47.4789086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:51:47.4789667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:51:47.4790233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:51:47.4791002Z 2025-05-07T19:51:47.4791260Z CC_FLAGS: 2025-05-07T19:51:47.4791380Z 2025-05-07T19:51:47.4791456Z 2025-05-07T19:51:47.4791650Z NVCC_FLAGS: 2025-05-07T19:51:47.4791797Z 2025-05-07T19:51:47.4791874Z 2025-05-07T19:51:47.4792070Z HIPCC_FLAGS: 2025-05-07T19:51:47.4792192Z 2025-05-07T19:51:47.4792272Z 2025-05-07T19:51:47.4792462Z INCLUDE_DIRS: 2025-05-07T19:51:47.4792691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:47.4793011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:47.4793297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:47.4793616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:47.4794117Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:47.4794888Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:47.4795539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:47.4795946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:47.4796384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:47.4796849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:47.4797375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:47.4797840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:47.4798392Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:47.4798944Z 2025-05-07T19:51:47.4799152Z Selected Source Files: 2025-05-07T19:51:47.4799305Z 2025-05-07T19:51:47.4799383Z 2025-05-07T19:51:47.4799593Z HIPified Source Files: 2025-05-07T19:51:47.4799743Z 2025-05-07T19:51:47.4799983Z 2025-05-07T19:51:47.4800196Z Library Dependencies: 2025-05-07T19:51:47.4800424Z torch 2025-05-07T19:51:47.4800630Z torch_library 2025-05-07T19:51:47.4801055Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:47.4801645Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:47.4802231Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:47.4803015Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:47.4803666Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:47.4804160Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:47.4804568Z 2025-05-07T19:51:47.4804752Z Output Library: 2025-05-07T19:51:47.4804969Z asmjit 2025-05-07T19:51:47.4805144Z 2025-05-07T19:51:47.4805344Z Destination Directory: 2025-05-07T19:51:47.4805572Z fbgemm_gpu 2025-05-07T19:51:47.4805814Z ================================================================================ 2025-05-07T19:51:47.4806042Z 2025-05-07T19:51:47.4806046Z 2025-05-07T19:51:47.4806050Z 2025-05-07T19:51:47.4806176Z ================================================================================ 2025-05-07T19:51:47.4806504Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:51:47.4806802Z 2025-05-07T19:51:47.4806976Z CPU_SRCS: 2025-05-07T19:51:47.4807106Z 2025-05-07T19:51:47.4807179Z 2025-05-07T19:51:47.4807359Z GPU_SRCS: 2025-05-07T19:51:47.4807485Z 2025-05-07T19:51:47.4807561Z 2025-05-07T19:51:47.4807764Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:47.4807904Z 2025-05-07T19:51:47.4807983Z 2025-05-07T19:51:47.4808269Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:47.4808404Z 2025-05-07T19:51:47.4808480Z 2025-05-07T19:51:47.4808672Z OTHER_SRCS: 2025-05-07T19:51:47.4808930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:51:47.4809379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:51:47.4809831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:51:47.4810247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:51:47.4810658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:51:47.4811120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:51:47.4811578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:51:47.4811945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:51:47.4812339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:51:47.4812746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:51:47.4813165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:51:47.4813574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:51:47.4814005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:51:47.4814373Z 2025-05-07T19:51:47.4814551Z CC_FLAGS: 2025-05-07T19:51:47.4814666Z 2025-05-07T19:51:47.4814758Z 2025-05-07T19:51:47.4814935Z NVCC_FLAGS: 2025-05-07T19:51:47.4815064Z 2025-05-07T19:51:47.4815138Z 2025-05-07T19:51:47.4815318Z HIPCC_FLAGS: 2025-05-07T19:51:47.4815454Z 2025-05-07T19:51:47.4815529Z 2025-05-07T19:51:47.4815707Z INCLUDE_DIRS: 2025-05-07T19:51:47.4815946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:47.4816265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:47.4816541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:47.4816856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:47.4817332Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:47.4818109Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:47.4818739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:47.4819151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:47.4819624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:47.4820095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:47.4820607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:47.4821047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:47.4821602Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:47.4822087Z 2025-05-07T19:51:47.4822292Z Selected Source Files: 2025-05-07T19:51:47.4822439Z 2025-05-07T19:51:47.4822514Z 2025-05-07T19:51:47.4822714Z HIPified Source Files: 2025-05-07T19:51:47.4822862Z 2025-05-07T19:51:47.4822951Z 2025-05-07T19:51:47.4823142Z Library Dependencies: 2025-05-07T19:51:47.4823378Z torch 2025-05-07T19:51:47.4823560Z torch_library 2025-05-07T19:51:47.4823999Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:47.4824577Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:47.4825183Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:47.4825959Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:47.4826612Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:47.4826989Z asmjit 2025-05-07T19:51:47.4827301Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:47.4827701Z 2025-05-07T19:51:47.4827882Z Output Library: 2025-05-07T19:51:47.4828100Z fbgemm 2025-05-07T19:51:47.4828275Z 2025-05-07T19:51:47.4828477Z Destination Directory: 2025-05-07T19:51:47.4828715Z fbgemm_gpu 2025-05-07T19:51:47.4829027Z ================================================================================ 2025-05-07T19:51:47.4829257Z 2025-05-07T19:51:47.4829261Z 2025-05-07T19:51:47.4829265Z 2025-05-07T19:51:47.4829398Z ================================================================================ 2025-05-07T19:51:47.4829732Z Running code generation script ... 2025-05-07T19:51:47.4830482Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:51:47.4831332Z ================================================================================ 2025-05-07T19:51:47.4831581Z 2025-05-07T19:51:48.0239109Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:51:48.0240030Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:51:48.0240746Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:51:48.0241248Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:51:48.0241740Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.0242254Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:48.0242743Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:51:48.0243212Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:51:48.0243695Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:51:48.0244176Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.0244698Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:48.0245196Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:51:48.0245686Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.0246205Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.0246725Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.0247285Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.0248019Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.0248562Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.0249089Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.0249612Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.0250180Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.0250709Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.0251210Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:51:48.0251715Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:51:48.0252080Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:51:48.0252480Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:51:48.0252933Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.0253411Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:51:48.0253843Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:51:48.0254315Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.0254784Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:51:48.0255249Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.0255759Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.0256264Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.0256754Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.0257357Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.0257885Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.0258352Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:51:48.0258758Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:51:48.0259130Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:51:48.0259541Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.0259929Z Written: lookup_adagrad.py 2025-05-07T19:51:48.0260214Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:51:48.0260592Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:51:48.0260991Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.0261439Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:51:48.0261873Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:51:48.0262301Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.0262761Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:48.0263192Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:51:48.0263630Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:51:48.0264048Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:51:48.0264497Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.0264972Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:48.0265410Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:51:48.0265869Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.0266328Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.0266817Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.0267316Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.0267870Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.0268363Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.0268829Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.0269319Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.0269817Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.0270315Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.0270758Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:51:48.0271209Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:51:48.0271740Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:51:48.0272171Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.0272569Z Written: lookup_adam.py 2025-05-07T19:51:48.0272857Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:51:48.0273298Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.0273749Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:51:48.0274233Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.0274712Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:51:48.0275177Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:51:48.0275662Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.0276141Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:51:48.0276625Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.0277148Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.0277785Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.0278301Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.0278869Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.0279447Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.0279953Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:51:48.0280416Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:51:48.0280807Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:51:48.0281288Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.0281702Z Written: lookup_lamb.py 2025-05-07T19:51:48.0282043Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:51:48.0282517Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.0283014Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:51:48.0283581Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.0284214Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:51:48.0284719Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:51:48.0285208Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.0285735Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:51:48.0286254Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.0286772Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.0287335Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.0287849Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.0288419Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.0288969Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.0289489Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:51:48.0290024Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:51:48.0290412Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:51:48.0291229Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.0291658Z Written: lookup_lars_sgd.py 2025-05-07T19:51:48.0292031Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:51:48.0292499Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.0293077Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:51:48.0293728Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.0294358Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:51:48.0294990Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:51:48.0295610Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.0296281Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:51:48.0296907Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.0297601Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.0298302Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.0298946Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.0299640Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.0300317Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.1120077Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:51:48.1121765Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:51:48.1123226Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:51:48.1124452Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.1124907Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:51:48.1125327Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:51:48.1125850Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.1126433Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:51:48.1127025Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.1127606Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:51:48.1128201Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:51:48.1128774Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.1129391Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:51:48.1129968Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.1130612Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.1131259Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.1131850Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.1132498Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.1133122Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.1133735Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:51:48.1134261Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:51:48.1135049Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:51:48.1135675Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.1136156Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:51:48.1136545Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:51:48.1137088Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.1137630Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:51:48.1138175Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:51:48.1138677Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:51:48.1139208Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:51:48.1139752Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.1140306Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.1140875Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:51:48.1141418Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:48.1141976Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:51:48.1142492Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:51:48.1143041Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:51:48.1143611Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:51:48.1144124Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:51:48.1144803Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:51:48.1145337Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.1145929Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.1146508Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:51:48.1147057Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:48.1147628Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:51:48.1148157Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:51:48.1148731Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.1149306Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.1149883Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:51:48.1150453Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.1151022Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.1151977Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.1152622Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.1153296Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.1153911Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:51:48.1154539Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.1155165Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.1155782Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.1156406Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:51:48.1156993Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.1157729Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.1158445Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.1159095Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.1159768Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.1160399Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:51:48.1161043Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.1161706Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:51:48.1162348Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:51:48.1163032Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:51:48.1163698Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:51:48.1164470Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:51:48.1165107Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:51:48.1165721Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:51:48.1166374Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:51:48.1166960Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:51:48.1167547Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:51:48.1168113Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:51:48.1168773Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:51:48.1169330Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:51:48.1169851Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:51:48.1170356Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:51:48.1170773Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:48.1171283Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.1171729Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:51:48.1172113Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:51:48.1172558Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:48.1173037Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.1173492Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:51:48.1173853Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:51:48.1174314Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:51:48.1174799Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.1175370Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:51:48.1175911Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:51:48.1176385Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:48.1176938Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.1177479Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:51:48.1178034Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.1178647Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:51:48.1179275Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:51:48.1179861Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:51:48.1180547Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.1181197Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:51:48.1181813Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.1182515Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:51:48.1183196Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:51:48.1183800Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:51:48.1184486Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.1185153Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:51:48.2174031Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.2175476Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:51:48.2176119Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:51:48.2176747Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.2177429Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:48.2178059Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:51:48.2178705Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:51:48.2179618Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:51:48.2180250Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.2180935Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:48.2181570Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:51:48.2182239Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.2182892Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.2183589Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.2184307Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.2184982Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.2185671Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.2186323Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.2187022Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.2187743Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.2188412Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.2189067Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:51:48.2189628Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:51:48.2190169Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:51:48.2191348Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.2192097Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:51:48.2192606Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:51:48.2193421Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.2194142Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:51:48.2194798Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:51:48.2195439Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:51:48.2196143Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.2196819Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:51:48.2197531Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.2198298Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:51:48.2198856Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:51:48.2199349Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:51:48.2199918Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.2200491Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:51:48.2201034Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.2201578Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:51:48.2202014Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:51:48.2202483Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.2202976Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:48.2204407Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:51:48.2204885Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:51:48.2205328Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:51:48.2205810Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.2206286Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:48.2206772Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:51:48.2207244Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.2207745Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.2208276Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.2208789Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:48.2209308Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.2209799Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.2210307Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.2210812Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.2211366Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:48.2211893Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.2212355Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:51:48.2212780Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:51:48.2213146Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:51:48.2213585Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.2213957Z Written: lookup_sgd.py 2025-05-07T19:51:48.2214275Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:51:48.2214671Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:51:48.2215069Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.2215560Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:51:48.2216048Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:51:48.2216478Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:51:48.2216930Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.2217412Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:51:48.2217886Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.2218346Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:51:48.2218833Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:48.2219303Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:51:48.2219767Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:51:48.2220231Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:48.2220729Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:51:48.2221216Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:51:48.2221716Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:48.2222247Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:51:48.2222724Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:51:48.2242356Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:48.2242907Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:51:48.2243427Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:51:48.2243872Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:51:48.2244409Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:51:48.2244863Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.2245240Z Written: lookup_none.py 2025-05-07T19:51:48.2245559Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:51:48.2245980Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.2246475Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:51:48.2247006Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:51:48.2247575Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:51:48.2248088Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:51:48.2248561Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:51:48.2249057Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:51:48.2249516Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:51:48.2250036Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:51:48.2250557Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:51:48.2251099Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:51:48.2251623Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:51:48.2252110Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:51:48.2252606Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:51:48.2253073Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:51:48.2253543Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:51:48.2253997Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:51:48.2254502Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:51:48.2255011Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:51:48.2255402Z Written: pt2_arg_utils.h 2025-05-07T19:51:48.2255674Z Written: __init__.py 2025-05-07T19:51:48.2255922Z Written: lookup_args_ssd.py 2025-05-07T19:51:48.2256210Z Written: lookup_args.py 2025-05-07T19:51:48.2315266Z 2025-05-07T19:51:48.2315323Z 2025-05-07T19:51:48.2315899Z ================================================================================ 2025-05-07T19:51:48.2316565Z Running code generation script ... 2025-05-07T19:51:48.2317334Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:51:48.2318089Z ================================================================================ 2025-05-07T19:51:48.2318347Z 2025-05-07T19:51:48.3395121Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:51:48.3396641Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:51:48.3397360Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:51:48.3397851Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:51:48.3398312Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:48.3398808Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:51:48.3399263Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:51:48.3399630Z Written: optimizer_args.py 2025-05-07T19:51:48.3506465Z 2025-05-07T19:51:48.3506589Z 2025-05-07T19:51:48.3507257Z ================================================================================ 2025-05-07T19:51:48.3507640Z Running code generation script ... 2025-05-07T19:51:48.3508372Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:51:48.3509364Z ================================================================================ 2025-05-07T19:51:48.3509585Z 2025-05-07T19:51:48.4694110Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:51:48.4696695Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:51:48.4699245Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:51:48.4701222Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:51:48.4703181Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:51:48.4705125Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:51:48.4706769Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:51:48.4707398Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:51:48.4708084Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:51:48.4708812Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:51:48.4709497Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:51:48.4710210Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:51:48.4710897Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:51:48.4711921Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:51:48.4712682Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:51:48.4713377Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:51:48.4714104Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:51:48.4714792Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:51:48.4715732Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:51:48.4716463Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:51:48.4717131Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:51:48.4717943Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:51:48.4718571Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:51:48.4719155Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:51:48.4719641Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:51:48.4804465Z 2025-05-07T19:51:48.4804589Z 2025-05-07T19:51:48.4805103Z ================================================================================ 2025-05-07T19:51:48.4806228Z Running code generation script ... 2025-05-07T19:51:48.4807717Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:51:48.4808463Z ================================================================================ 2025-05-07T19:51:48.4808687Z 2025-05-07T19:51:48.8176065Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:51:48.8176927Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:51:48.8177669Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:48.8178202Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:51:48.8178975Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:48.8179482Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:51:48.8179948Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:48.8180456Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:48.8180913Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:51:48.8181376Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:51:48.8181854Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:48.8182342Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:48.8182831Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:51:48.8183298Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:51:48.8183803Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:51:48.8184310Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:51:48.8184829Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:51:48.8185370Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:51:48.8185861Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:48.8186353Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:48.8186838Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:48.8187349Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:48.8187819Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:48.8188315Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:48.8188795Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:48.8189248Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:48.8189743Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:48.8190244Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:48.8191338Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:48.8191819Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:48.8192295Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:51:48.8192735Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:51:48.8193167Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:51:48.8193636Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:51:48.8194077Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:51:48.8194507Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:51:48.8194931Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:51:48.8195363Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:51:48.8195761Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:51:48.8196200Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:51:48.8196675Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:51:48.8197122Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:51:48.8197578Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:51:48.8198006Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:51:48.8198432Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:51:48.8198872Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:51:48.8199332Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:51:48.8199798Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:51:48.8200346Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:51:48.8200798Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:51:48.8201230Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:51:48.8201728Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:48.8202239Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:48.8202761Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:48.8203281Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:48.8203859Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.8204265Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:48.8204647Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:48.8302163Z 2025-05-07T19:51:48.8302170Z 2025-05-07T19:51:48.8302408Z ================================================================================ 2025-05-07T19:51:48.8302841Z Running code generation script ... 2025-05-07T19:51:48.8303575Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:51:48.8304349Z ================================================================================ 2025-05-07T19:51:48.8304577Z 2025-05-07T19:51:49.0958807Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:51:49.0961225Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:51:49.0963271Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:51:49.0964594Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:51:49.0965706Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:51:49.0966179Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:51:49.0966640Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:51:49.0967100Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:51:49.0967578Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:51:49.0968384Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:51:49.0968847Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:51:49.1084893Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:51:49.1098662Z 2025-05-07T19:51:49.1098793Z 2025-05-07T19:51:49.1099285Z ================================================================================ 2025-05-07T19:51:49.1100490Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:51:49.1101488Z 2025-05-07T19:51:49.1102012Z CPU_SRCS: 2025-05-07T19:51:49.1103171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:51:49.1105110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:51:49.1106532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:51:49.1107286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:51:49.1107891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:51:49.1108483Z 2025-05-07T19:51:49.1108671Z GPU_SRCS: 2025-05-07T19:51:49.1109027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:51:49.1109606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:51:49.1110216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:51:49.1110841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:51:49.1111739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:51:49.1112584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:51:49.1113239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:51:49.1113837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:51:49.1114448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:51:49.1115137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:51:49.1115629Z 2025-05-07T19:51:49.1115868Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.1116024Z 2025-05-07T19:51:49.1116113Z 2025-05-07T19:51:49.1116347Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.1116500Z 2025-05-07T19:51:49.1116589Z 2025-05-07T19:51:49.1116821Z OTHER_SRCS: 2025-05-07T19:51:49.1116949Z 2025-05-07T19:51:49.1117036Z 2025-05-07T19:51:49.1117267Z CC_FLAGS: 2025-05-07T19:51:49.1117390Z 2025-05-07T19:51:49.1117474Z 2025-05-07T19:51:49.1117702Z NVCC_FLAGS: 2025-05-07T19:51:49.1117970Z --expt-relaxed-constexpr 2025-05-07T19:51:49.1118257Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.1118587Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.1118899Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.1119202Z 2025-05-07T19:51:49.1119407Z HIPCC_FLAGS: 2025-05-07T19:51:49.1119568Z 2025-05-07T19:51:49.1119655Z 2025-05-07T19:51:49.1119857Z INCLUDE_DIRS: 2025-05-07T19:51:49.1120125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.1120454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.1120779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.1121123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.1121620Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.1122430Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.1123088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.1123539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.1124088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.1124554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.1125177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.1125619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.1126167Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.1126640Z 2025-05-07T19:51:49.1126868Z Selected Source Files: 2025-05-07T19:51:49.1127270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:51:49.1127904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:51:49.1128506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:51:49.1129089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:51:49.1129677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:51:49.1130267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:51:49.1130839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:51:49.1131422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:51:49.1132042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:51:49.1132635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:51:49.1133181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:51:49.1133784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:51:49.1134334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:51:49.1134979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:51:49.1135593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:51:49.1136069Z 2025-05-07T19:51:49.1136297Z HIPified Source Files: 2025-05-07T19:51:49.1136450Z 2025-05-07T19:51:49.1136533Z 2025-05-07T19:51:49.1136765Z Library Dependencies: 2025-05-07T19:51:49.1137005Z torch 2025-05-07T19:51:49.1137227Z torch_library 2025-05-07T19:51:49.1137641Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.1138214Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.1138782Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.1139540Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.1140181Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.1140669Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.1141072Z 2025-05-07T19:51:49.1141268Z Output Library: 2025-05-07T19:51:49.1141516Z fbgemm_gpu_tbe_cache 2025-05-07T19:51:49.1141740Z 2025-05-07T19:51:49.1141965Z Destination Directory: 2025-05-07T19:51:49.1142197Z fbgemm_gpu 2025-05-07T19:51:49.1142457Z ================================================================================ 2025-05-07T19:51:49.1142678Z 2025-05-07T19:51:49.1663101Z 2025-05-07T19:51:49.1663179Z 2025-05-07T19:51:49.1663495Z ================================================================================ 2025-05-07T19:51:49.1664150Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:51:49.1664594Z 2025-05-07T19:51:49.1664858Z CPU_SRCS: 2025-05-07T19:51:49.1665176Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:51:49.1665675Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:51:49.1666145Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:51:49.1666539Z 2025-05-07T19:51:49.1666737Z GPU_SRCS: 2025-05-07T19:51:49.1667053Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:51:49.1667734Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:51:49.1668350Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:51:49.1669002Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:51:49.1669735Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:51:49.1670367Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:51:49.1670962Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:51:49.1671707Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:51:49.1672526Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:51:49.1673241Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:51:49.1673913Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:51:49.1674602Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:51:49.1675283Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:51:49.1675948Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:51:49.1676622Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:51:49.1677253Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:51:49.1678017Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:51:49.1678718Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:51:49.1679296Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:51:49.1679904Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:51:49.1680458Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:51:49.1681058Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:51:49.1681617Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.1682051Z 2025-05-07T19:51:49.1682282Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.1682425Z 2025-05-07T19:51:49.1682507Z 2025-05-07T19:51:49.1682732Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.1682869Z 2025-05-07T19:51:49.1682953Z 2025-05-07T19:51:49.1683162Z OTHER_SRCS: 2025-05-07T19:51:49.1683283Z 2025-05-07T19:51:49.1683362Z 2025-05-07T19:51:49.1683579Z CC_FLAGS: 2025-05-07T19:51:49.1683702Z 2025-05-07T19:51:49.1683785Z 2025-05-07T19:51:49.1684006Z NVCC_FLAGS: 2025-05-07T19:51:49.1684227Z --expt-relaxed-constexpr 2025-05-07T19:51:49.1684523Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.1684815Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.1685112Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.1685391Z 2025-05-07T19:51:49.1685579Z HIPCC_FLAGS: 2025-05-07T19:51:49.1685702Z 2025-05-07T19:51:49.1685805Z 2025-05-07T19:51:49.1685991Z INCLUDE_DIRS: 2025-05-07T19:51:49.1686245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.1686551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.1686850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.1687154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.1687644Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.1688402Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.1689014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.1689444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.1689862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.1690386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.1691295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.1691877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.1692473Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.1692973Z 2025-05-07T19:51:49.1693216Z Selected Source Files: 2025-05-07T19:51:49.1693560Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:51:49.1694048Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:51:49.1694493Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:51:49.1694952Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:51:49.1695444Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:51:49.1695998Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:51:49.1696647Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:51:49.1697367Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:51:49.1697965Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:51:49.1698533Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:51:49.1699133Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:51:49.1699755Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:51:49.1700376Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:51:49.1701132Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:51:49.1701751Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:51:49.1702402Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:51:49.1703021Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:51:49.1703654Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:51:49.1704265Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:51:49.1704857Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:51:49.1705467Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:51:49.1706049Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:51:49.1706655Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:51:49.1707224Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:51:49.1707801Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:51:49.1708388Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.1708789Z 2025-05-07T19:51:49.1709029Z HIPified Source Files: 2025-05-07T19:51:49.1709185Z 2025-05-07T19:51:49.1709270Z 2025-05-07T19:51:49.1709514Z Library Dependencies: 2025-05-07T19:51:49.1709783Z torch 2025-05-07T19:51:49.1709982Z torch_library 2025-05-07T19:51:49.1710424Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.1710980Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.1711812Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.1712652Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.1713300Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.1713661Z asmjit 2025-05-07T19:51:49.1713955Z fbgemm 2025-05-07T19:51:49.1714147Z fbgemm_gpu_tbe_cache 2025-05-07T19:51:49.1714391Z fbgemm_gpu_config 2025-05-07T19:51:49.1714737Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.1715147Z 2025-05-07T19:51:49.1715334Z Output Library: 2025-05-07T19:51:49.1715577Z fbgemm_gpu_tbe_inference 2025-05-07T19:51:49.1715805Z 2025-05-07T19:51:49.1716011Z Destination Directory: 2025-05-07T19:51:49.1716260Z fbgemm_gpu 2025-05-07T19:51:49.1716491Z ================================================================================ 2025-05-07T19:51:49.1716723Z 2025-05-07T19:51:49.4354353Z 2025-05-07T19:51:49.4354657Z 2025-05-07T19:51:49.4355177Z ================================================================================ 2025-05-07T19:51:49.4355682Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:51:49.4356048Z 2025-05-07T19:51:49.4356272Z CPU_SRCS: 2025-05-07T19:51:49.4356493Z src/config/feature_gates.cpp 2025-05-07T19:51:49.4356769Z 2025-05-07T19:51:49.4356979Z GPU_SRCS: 2025-05-07T19:51:49.4357102Z 2025-05-07T19:51:49.4357206Z 2025-05-07T19:51:49.4357408Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4357565Z 2025-05-07T19:51:49.4357643Z 2025-05-07T19:51:49.4357854Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4358016Z 2025-05-07T19:51:49.4358094Z 2025-05-07T19:51:49.4358295Z OTHER_SRCS: 2025-05-07T19:51:49.4358412Z 2025-05-07T19:51:49.4358491Z 2025-05-07T19:51:49.4358743Z CC_FLAGS: 2025-05-07T19:51:49.4358854Z 2025-05-07T19:51:49.4358957Z 2025-05-07T19:51:49.4359155Z NVCC_FLAGS: 2025-05-07T19:51:49.4359272Z 2025-05-07T19:51:49.4359349Z 2025-05-07T19:51:49.4359550Z HIPCC_FLAGS: 2025-05-07T19:51:49.4359672Z 2025-05-07T19:51:49.4359749Z 2025-05-07T19:51:49.4360237Z INCLUDE_DIRS: 2025-05-07T19:51:49.4360497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4360822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4361127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4361441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4361968Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4362749Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4363406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4363810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4364258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4364741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4365243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4365714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4366274Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4366789Z 2025-05-07T19:51:49.4366993Z Selected Source Files: 2025-05-07T19:51:49.4367275Z src/config/feature_gates.cpp 2025-05-07T19:51:49.4367560Z 2025-05-07T19:51:49.4367779Z HIPified Source Files: 2025-05-07T19:51:49.4367930Z 2025-05-07T19:51:49.4368030Z 2025-05-07T19:51:49.4368222Z Library Dependencies: 2025-05-07T19:51:49.4368470Z torch 2025-05-07T19:51:49.4368665Z torch_library 2025-05-07T19:51:49.4369111Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4369693Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4370305Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4371088Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4371749Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4372277Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4372676Z 2025-05-07T19:51:49.4372886Z Output Library: 2025-05-07T19:51:49.4373261Z fbgemm_gpu_config 2025-05-07T19:51:49.4373507Z 2025-05-07T19:51:49.4373706Z Destination Directory: 2025-05-07T19:51:49.4373968Z fbgemm_gpu 2025-05-07T19:51:49.4374212Z ================================================================================ 2025-05-07T19:51:49.4374467Z 2025-05-07T19:51:49.4374519Z 2025-05-07T19:51:49.4374523Z 2025-05-07T19:51:49.4374660Z ================================================================================ 2025-05-07T19:51:49.4375038Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:51:49.4375387Z 2025-05-07T19:51:49.4375573Z CPU_SRCS: 2025-05-07T19:51:49.4375879Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:51:49.4376327Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:51:49.4376695Z 2025-05-07T19:51:49.4376880Z GPU_SRCS: 2025-05-07T19:51:49.4377164Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:51:49.4377581Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:51:49.4377967Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:51:49.4378354Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:51:49.4378751Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:51:49.4379106Z 2025-05-07T19:51:49.4379302Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4379462Z 2025-05-07T19:51:49.4379541Z 2025-05-07T19:51:49.4379734Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4379893Z 2025-05-07T19:51:49.4379978Z 2025-05-07T19:51:49.4380186Z OTHER_SRCS: 2025-05-07T19:51:49.4380302Z 2025-05-07T19:51:49.4380379Z 2025-05-07T19:51:49.4380574Z CC_FLAGS: 2025-05-07T19:51:49.4380687Z 2025-05-07T19:51:49.4380765Z 2025-05-07T19:51:49.4380961Z NVCC_FLAGS: 2025-05-07T19:51:49.4381142Z 2025-05-07T19:51:49.4381220Z 2025-05-07T19:51:49.4381416Z HIPCC_FLAGS: 2025-05-07T19:51:49.4381537Z 2025-05-07T19:51:49.4381613Z 2025-05-07T19:51:49.4381813Z INCLUDE_DIRS: 2025-05-07T19:51:49.4382053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4382398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4382710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4383027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4383544Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4384320Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4384982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4385396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4385855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4386349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4386869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4387347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4387913Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4388437Z 2025-05-07T19:51:49.4388641Z Selected Source Files: 2025-05-07T19:51:49.4388999Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:51:49.4389455Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:51:49.4389914Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:51:49.4390338Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:51:49.4390910Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:51:49.4391440Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:51:49.4391838Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:51:49.4392212Z 2025-05-07T19:51:49.4392413Z HIPified Source Files: 2025-05-07T19:51:49.4392590Z 2025-05-07T19:51:49.4392668Z 2025-05-07T19:51:49.4392861Z Library Dependencies: 2025-05-07T19:51:49.4393109Z torch 2025-05-07T19:51:49.4393319Z torch_library 2025-05-07T19:51:49.4393959Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4394570Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4395160Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4395960Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4396607Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4397130Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4397537Z 2025-05-07T19:51:49.4397779Z Output Library: 2025-05-07T19:51:49.4398017Z fbgemm_gpu_tbe_utils 2025-05-07T19:51:49.4398242Z 2025-05-07T19:51:49.4398452Z Destination Directory: 2025-05-07T19:51:49.4398690Z fbgemm_gpu 2025-05-07T19:51:49.4398944Z ================================================================================ 2025-05-07T19:51:49.4399172Z 2025-05-07T19:51:49.4399180Z 2025-05-07T19:51:49.4399184Z 2025-05-07T19:51:49.4399300Z ================================================================================ 2025-05-07T19:51:49.4399717Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:51:49.4400090Z 2025-05-07T19:51:49.4400262Z CPU_SRCS: 2025-05-07T19:51:49.4400491Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:51:49.4400769Z 2025-05-07T19:51:49.4400953Z GPU_SRCS: 2025-05-07T19:51:49.4401166Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:51:49.4401451Z 2025-05-07T19:51:49.4401633Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4401786Z 2025-05-07T19:51:49.4401861Z 2025-05-07T19:51:49.4402043Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4402272Z 2025-05-07T19:51:49.4402347Z 2025-05-07T19:51:49.4402543Z OTHER_SRCS: 2025-05-07T19:51:49.4402658Z 2025-05-07T19:51:49.4402733Z 2025-05-07T19:51:49.4402928Z CC_FLAGS: 2025-05-07T19:51:49.4403037Z 2025-05-07T19:51:49.4403116Z 2025-05-07T19:51:49.4403312Z NVCC_FLAGS: 2025-05-07T19:51:49.4403429Z 2025-05-07T19:51:49.4403503Z 2025-05-07T19:51:49.4403697Z HIPCC_FLAGS: 2025-05-07T19:51:49.4403815Z 2025-05-07T19:51:49.4403888Z 2025-05-07T19:51:49.4404078Z INCLUDE_DIRS: 2025-05-07T19:51:49.4404299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4404616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4404904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4405202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4405691Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4406453Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4407102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4407500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4407927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4408403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4408903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4409360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4409898Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4410396Z 2025-05-07T19:51:49.4410582Z Selected Source Files: 2025-05-07T19:51:49.4410847Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:51:49.4411154Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:51:49.4411441Z 2025-05-07T19:51:49.4411625Z HIPified Source Files: 2025-05-07T19:51:49.4411788Z 2025-05-07T19:51:49.4411863Z 2025-05-07T19:51:49.4412063Z Library Dependencies: 2025-05-07T19:51:49.4412291Z torch 2025-05-07T19:51:49.4412489Z torch_library 2025-05-07T19:51:49.4412909Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4413615Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4414207Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4415003Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4415659Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4416032Z fbgemm_gpu_tbe_utils 2025-05-07T19:51:49.4416396Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4416782Z 2025-05-07T19:51:49.4416981Z Output Library: 2025-05-07T19:51:49.4417209Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:51:49.4417474Z 2025-05-07T19:51:49.4417681Z Destination Directory: 2025-05-07T19:51:49.4417929Z fbgemm_gpu 2025-05-07T19:51:49.4418248Z ================================================================================ 2025-05-07T19:51:49.4418494Z 2025-05-07T19:51:49.4418498Z 2025-05-07T19:51:49.4418502Z 2025-05-07T19:51:49.4418618Z ================================================================================ 2025-05-07T19:51:49.4419007Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:51:49.4419334Z 2025-05-07T19:51:49.4419532Z CPU_SRCS: 2025-05-07T19:51:49.4419786Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:51:49.4420216Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:49.4420605Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:51:49.4420919Z 2025-05-07T19:51:49.4421115Z GPU_SRCS: 2025-05-07T19:51:49.4421343Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:51:49.4421700Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:51:49.4422043Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:51:49.4422422Z 2025-05-07T19:51:49.4422608Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4422760Z 2025-05-07T19:51:49.4422834Z 2025-05-07T19:51:49.4423013Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4423164Z 2025-05-07T19:51:49.4423237Z 2025-05-07T19:51:49.4423412Z OTHER_SRCS: 2025-05-07T19:51:49.4423544Z 2025-05-07T19:51:49.4423618Z 2025-05-07T19:51:49.4423804Z CC_FLAGS: 2025-05-07T19:51:49.4423912Z 2025-05-07T19:51:49.4423985Z 2025-05-07T19:51:49.4424171Z NVCC_FLAGS: 2025-05-07T19:51:49.4424378Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4424649Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4424921Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4425217Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4425461Z 2025-05-07T19:51:49.4425649Z HIPCC_FLAGS: 2025-05-07T19:51:49.4425768Z 2025-05-07T19:51:49.4425841Z 2025-05-07T19:51:49.4426029Z INCLUDE_DIRS: 2025-05-07T19:51:49.4426266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4426565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4426854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4427154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4427638Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4428407Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4429058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4429457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4429884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4430354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4430852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4431392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4431932Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4432436Z 2025-05-07T19:51:49.4432627Z Selected Source Files: 2025-05-07T19:51:49.4432931Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:51:49.4433353Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:49.4433833Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:51:49.4434192Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:51:49.4434534Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:51:49.4434880Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:51:49.4435165Z 2025-05-07T19:51:49.4435381Z HIPified Source Files: 2025-05-07T19:51:49.4435533Z 2025-05-07T19:51:49.4435606Z 2025-05-07T19:51:49.4435807Z Library Dependencies: 2025-05-07T19:51:49.4436024Z torch 2025-05-07T19:51:49.4436220Z torch_library 2025-05-07T19:51:49.4436651Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4437220Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4437821Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4438590Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4439237Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4439604Z fbgemm 2025-05-07T19:51:49.4439807Z fbgemm_gpu_config 2025-05-07T19:51:49.4440158Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4440540Z 2025-05-07T19:51:49.4440736Z Output Library: 2025-05-07T19:51:49.4440948Z fbgemm_gpu_tbe_common 2025-05-07T19:51:49.4441177Z 2025-05-07T19:51:49.4441365Z Destination Directory: 2025-05-07T19:51:49.4441604Z fbgemm_gpu 2025-05-07T19:51:49.4441825Z ================================================================================ 2025-05-07T19:51:49.4442065Z 2025-05-07T19:51:49.4442069Z 2025-05-07T19:51:49.4442073Z 2025-05-07T19:51:49.4442184Z ================================================================================ 2025-05-07T19:51:49.4442639Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:51:49.4442977Z 2025-05-07T19:51:49.4443167Z CPU_SRCS: 2025-05-07T19:51:49.4443278Z 2025-05-07T19:51:49.4443351Z 2025-05-07T19:51:49.4443541Z GPU_SRCS: 2025-05-07T19:51:49.4443781Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:49.4444182Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:51:49.4444583Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:51:49.4444923Z 2025-05-07T19:51:49.4445123Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4445261Z 2025-05-07T19:51:49.4445340Z 2025-05-07T19:51:49.4445538Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4445671Z 2025-05-07T19:51:49.4445747Z 2025-05-07T19:51:49.4445941Z OTHER_SRCS: 2025-05-07T19:51:49.4446055Z 2025-05-07T19:51:49.4446129Z 2025-05-07T19:51:49.4446312Z CC_FLAGS: 2025-05-07T19:51:49.4446423Z 2025-05-07T19:51:49.4446512Z 2025-05-07T19:51:49.4446735Z NVCC_FLAGS: 2025-05-07T19:51:49.4446974Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4447294Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4447615Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4447924Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4448227Z 2025-05-07T19:51:49.4448433Z HIPCC_FLAGS: 2025-05-07T19:51:49.4448566Z 2025-05-07T19:51:49.4448679Z 2025-05-07T19:51:49.4448884Z INCLUDE_DIRS: 2025-05-07T19:51:49.4449163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4449493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4449819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4450147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4450687Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4451513Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4452170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4452637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4453085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4453681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4454217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4454723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4455324Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4455847Z 2025-05-07T19:51:49.4456102Z Selected Source Files: 2025-05-07T19:51:49.4456420Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:49.4456867Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:51:49.4457301Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:51:49.4457694Z 2025-05-07T19:51:49.4457920Z HIPified Source Files: 2025-05-07T19:51:49.4458125Z 2025-05-07T19:51:49.4458218Z 2025-05-07T19:51:49.4458444Z Library Dependencies: 2025-05-07T19:51:49.4458721Z torch 2025-05-07T19:51:49.4458965Z torch_library 2025-05-07T19:51:49.4459411Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4460040Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4460644Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4461460Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4462118Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4462661Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4463099Z 2025-05-07T19:51:49.4463299Z Output Library: 2025-05-07T19:51:49.4463573Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:51:49.4463831Z 2025-05-07T19:51:49.4464039Z Destination Directory: 2025-05-07T19:51:49.4464347Z fbgemm_gpu 2025-05-07T19:51:49.4464615Z ================================================================================ 2025-05-07T19:51:49.4464854Z 2025-05-07T19:51:49.4464858Z 2025-05-07T19:51:49.4464862Z 2025-05-07T19:51:49.4464988Z ================================================================================ 2025-05-07T19:51:49.4465422Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:51:49.4465818Z 2025-05-07T19:51:49.4466005Z CPU_SRCS: 2025-05-07T19:51:49.4466266Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4466577Z 2025-05-07T19:51:49.4466775Z GPU_SRCS: 2025-05-07T19:51:49.4467017Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:51:49.4467396Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:51:49.4467748Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:51:49.4468145Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:51:49.4468573Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:51:49.4468981Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:51:49.4469381Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:51:49.4469746Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:51:49.4470132Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:51:49.4470504Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:51:49.4470924Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:49.4471407Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.4471847Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:49.4472282Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:51:49.4472683Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:49.4473112Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.4473525Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:49.4473945Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:51:49.4474323Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:49.4474734Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.4475214Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:49.4475626Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4476074Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4476484Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:51:49.4476885Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:51:49.4477283Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4477745Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4478159Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:51:49.4478582Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4479008Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:51:49.4479398Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:51:49.4479823Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4480235Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4480642Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:51:49.4481042Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4481500Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4481947Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:51:49.4482343Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:51:49.4482773Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4483230Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4483693Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:51:49.4484116Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4484615Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:51:49.4485021Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:51:49.4485458Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4485896Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4486300Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:51:49.4486734Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:49.4487185Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:49.4487647Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:49.4488044Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4488423Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4488743Z 2025-05-07T19:51:49.4488938Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4489076Z 2025-05-07T19:51:49.4489171Z 2025-05-07T19:51:49.4489357Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4489494Z 2025-05-07T19:51:49.4489588Z 2025-05-07T19:51:49.4489770Z OTHER_SRCS: 2025-05-07T19:51:49.4489905Z 2025-05-07T19:51:49.4489982Z 2025-05-07T19:51:49.4490161Z CC_FLAGS: 2025-05-07T19:51:49.4490294Z 2025-05-07T19:51:49.4490370Z 2025-05-07T19:51:49.4490678Z NVCC_FLAGS: 2025-05-07T19:51:49.4490909Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4491314Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4491589Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4491895Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4492146Z 2025-05-07T19:51:49.4492338Z HIPCC_FLAGS: 2025-05-07T19:51:49.4492457Z 2025-05-07T19:51:49.4492535Z 2025-05-07T19:51:49.4492725Z INCLUDE_DIRS: 2025-05-07T19:51:49.4492954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4493274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4493546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4493853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4494344Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4495113Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4496595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4497008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4497443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4497896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4498411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4498867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4499407Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4499911Z 2025-05-07T19:51:49.4500100Z Selected Source Files: 2025-05-07T19:51:49.4500404Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4500794Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:49.4501212Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:49.4501619Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:49.4502032Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:49.4502434Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:49.4502817Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:49.4503235Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4503644Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4504077Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4504512Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:49.4504918Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4505275Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4505743Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:51:49.4506112Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:51:49.4506461Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:51:49.4506849Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:51:49.4507249Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:51:49.4507659Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:51:49.4508034Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:51:49.4508407Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:51:49.4508770Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:51:49.4509133Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:51:49.4509537Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.4509927Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:51:49.4510333Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.4510716Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:51:49.4511110Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:51:49.4511588Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4512065Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:51:49.4512452Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:51:49.4512844Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4513289Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4513695Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:51:49.4514105Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4514494Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:51:49.4514892Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:51:49.4515295Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4515677Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:51:49.4516078Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4516484Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:51:49.4516944Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:51:49.4517347Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4517810Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:51:49.4518243Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:51:49.4518660Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4519080Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:51:49.4519482Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:51:49.4519910Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:51:49.4520301Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:51:49.4520734Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:49.4521180Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:49.4521633Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:51:49.4521988Z 2025-05-07T19:51:49.4522178Z HIPified Source Files: 2025-05-07T19:51:49.4522328Z 2025-05-07T19:51:49.4522418Z 2025-05-07T19:51:49.4522610Z Library Dependencies: 2025-05-07T19:51:49.4522844Z torch 2025-05-07T19:51:49.4523024Z torch_library 2025-05-07T19:51:49.4523461Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4524031Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4524628Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4525412Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4526106Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4526500Z fbgemm_gpu_tbe_common 2025-05-07T19:51:49.4526851Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4527255Z 2025-05-07T19:51:49.4527444Z Output Library: 2025-05-07T19:51:49.4527688Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:51:49.4528103Z 2025-05-07T19:51:49.4528312Z Destination Directory: 2025-05-07T19:51:49.4528541Z fbgemm_gpu 2025-05-07T19:51:49.4528784Z ================================================================================ 2025-05-07T19:51:49.4529013Z 2025-05-07T19:51:49.4529017Z 2025-05-07T19:51:49.4529021Z 2025-05-07T19:51:49.4529151Z ================================================================================ 2025-05-07T19:51:49.4529579Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:51:49.4529980Z 2025-05-07T19:51:49.4530161Z CPU_SRCS: 2025-05-07T19:51:49.4530410Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4530783Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4531153Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:51:49.4531492Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:51:49.4531823Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:51:49.4532178Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:51:49.4532557Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:51:49.4532992Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:51:49.4533365Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:51:49.4533774Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:51:49.4534211Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:51:49.4534609Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4535106Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:51:49.4535662Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:51:49.4536226Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:51:49.4536714Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4537208Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4537604Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4538065Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4624508Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4624946Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4625379Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4625806Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4626324Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4626889Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4627380Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4627884Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4628414Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4628939Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4629547Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4630233Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4630894Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4631604Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4632036Z 2025-05-07T19:51:49.4632255Z GPU_SRCS: 2025-05-07T19:51:49.4632555Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4633166Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4633644Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4634084Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4634490Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4634919Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4635402Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4635950Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4636430Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4636934Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4637472Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4637965Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4638573Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4639234Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4639907Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4640496Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4641036Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4641428Z 2025-05-07T19:51:49.4641638Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4641788Z 2025-05-07T19:51:49.4641900Z 2025-05-07T19:51:49.4642095Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4642246Z 2025-05-07T19:51:49.4642353Z 2025-05-07T19:51:49.4642555Z OTHER_SRCS: 2025-05-07T19:51:49.4642711Z 2025-05-07T19:51:49.4642796Z 2025-05-07T19:51:49.4642986Z CC_FLAGS: 2025-05-07T19:51:49.4643115Z 2025-05-07T19:51:49.4643199Z 2025-05-07T19:51:49.4643380Z NVCC_FLAGS: 2025-05-07T19:51:49.4643608Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4643892Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4644160Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4644457Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4644717Z 2025-05-07T19:51:49.4645020Z HIPCC_FLAGS: 2025-05-07T19:51:49.4645154Z 2025-05-07T19:51:49.4645235Z 2025-05-07T19:51:49.4645430Z INCLUDE_DIRS: 2025-05-07T19:51:49.4645656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4645977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4646257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4646679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4647162Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4647914Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4648544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4648938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4649362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4649981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4650500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4650959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4651502Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4652006Z 2025-05-07T19:51:49.4652197Z Selected Source Files: 2025-05-07T19:51:49.4652486Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4652852Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4653226Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:51:49.4653544Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:51:49.4653882Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:51:49.4654285Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:51:49.4654667Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:51:49.4655097Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:51:49.4655476Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:51:49.4655891Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:51:49.4656307Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:51:49.4656715Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4657200Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:51:49.4657763Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:51:49.4658328Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:51:49.4658816Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4659244Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:51:49.4659646Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4660102Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4660538Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4660945Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4661349Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4661755Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4662240Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4662770Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4663240Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4663720Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4664248Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4664760Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4665343Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4666063Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4666706Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4667299Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:51:49.4667786Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4668255Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4668708Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4669107Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4669516Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4669935Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4670414Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4670928Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4671452Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4671927Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4672439Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4672916Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4673490Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4674141Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4674791Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4675373Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4675955Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:51:49.4676325Z 2025-05-07T19:51:49.4676507Z HIPified Source Files: 2025-05-07T19:51:49.4676655Z 2025-05-07T19:51:49.4676739Z 2025-05-07T19:51:49.4676925Z Library Dependencies: 2025-05-07T19:51:49.4677149Z torch 2025-05-07T19:51:49.4677329Z torch_library 2025-05-07T19:51:49.4677749Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4678322Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4678920Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4679704Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4680336Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4680709Z fbgemm 2025-05-07T19:51:49.4680895Z fbgemm_gpu_config 2025-05-07T19:51:49.4681123Z fbgemm_gpu_tbe_cache 2025-05-07T19:51:49.4681336Z fbgemm_gpu_tbe_common 2025-05-07T19:51:49.4681568Z fbgemm_gpu_tbe_utils 2025-05-07T19:51:49.4681794Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:51:49.4682171Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4682548Z 2025-05-07T19:51:49.4682710Z Output Library: 2025-05-07T19:51:49.4682931Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:51:49.4683190Z 2025-05-07T19:51:49.4683378Z Destination Directory: 2025-05-07T19:51:49.4683595Z fbgemm_gpu 2025-05-07T19:51:49.4683919Z ================================================================================ 2025-05-07T19:51:49.4684137Z 2025-05-07T19:51:49.4684142Z 2025-05-07T19:51:49.4684145Z 2025-05-07T19:51:49.4684244Z ================================================================================ 2025-05-07T19:51:49.4684640Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:51:49.4684991Z 2025-05-07T19:51:49.4685161Z CPU_SRCS: 2025-05-07T19:51:49.4685476Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:51:49.4685884Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:51:49.4686221Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:51:49.4686633Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:51:49.4686990Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:51:49.4687301Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:51:49.4687625Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:51:49.4687961Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:51:49.4688337Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:51:49.4688765Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:51:49.4689124Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:51:49.4689521Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:51:49.4689934Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:51:49.4690335Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:51:49.4691117Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:51:49.4691695Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:51:49.4692266Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:51:49.4692761Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:51:49.4693183Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:51:49.4693543Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:51:49.4693910Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:51:49.4694188Z 2025-05-07T19:51:49.4694378Z GPU_SRCS: 2025-05-07T19:51:49.4694622Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:51:49.4695047Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:51:49.4695593Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:51:49.4696017Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:51:49.4696455Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:51:49.4696912Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:51:49.4697412Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:51:49.4697906Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4698433Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4698995Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4699502Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:51:49.4699990Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4700494Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4700962Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:51:49.4701376Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4701834Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4702305Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4702784Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4703421Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4703867Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:51:49.4704395Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4704824Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4705264Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:51:49.4705716Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4706186Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4706676Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4707176Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4707791Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4708281Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:51:49.4708763Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4709263Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4709691Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:51:49.4710075Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4710461Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4710865Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4711344Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4711994Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4712539Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:51:49.4712947Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4713395Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4713802Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:51:49.4714220Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4714636Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4715082Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4715543Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4716046Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4716502Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:51:49.4716977Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4717428Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4717837Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:51:49.4718258Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4718679Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4719120Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4719588Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4720072Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4720522Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:51:49.4720930Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4721382Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4721803Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:51:49.4722236Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4722701Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4723166Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4723674Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4724285Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4724757Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:51:49.4725185Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4725660Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4726145Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:51:49.4726648Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4727200Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4727739Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4728320Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4728968Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4729532Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:51:49.4730064Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4730611Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4731144Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:51:49.4731654Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4732203Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4732742Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4733324Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4733939Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4734492Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:51:49.4735024Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4735567Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4736041Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:51:49.4736429Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4736850Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4737281Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4737724Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4738259Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4738690Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:51:49.4739104Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4739531Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4740027Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:51:49.4740599Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4741186Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4741794Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4742414Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4743077Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4743685Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:51:49.4744281Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4744904Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4745337Z 2025-05-07T19:51:49.4745535Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4745673Z 2025-05-07T19:51:49.4745746Z 2025-05-07T19:51:49.4745940Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4746268Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:51:49.4746744Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:51:49.4747198Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:51:49.4747544Z 2025-05-07T19:51:49.4747731Z OTHER_SRCS: 2025-05-07T19:51:49.4747844Z 2025-05-07T19:51:49.4748025Z 2025-05-07T19:51:49.4748201Z CC_FLAGS: 2025-05-07T19:51:49.4748307Z 2025-05-07T19:51:49.4748377Z 2025-05-07T19:51:49.4748549Z NVCC_FLAGS: 2025-05-07T19:51:49.4748737Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4749010Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4749274Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4749624Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4749854Z 2025-05-07T19:51:49.4750018Z HIPCC_FLAGS: 2025-05-07T19:51:49.4750126Z 2025-05-07T19:51:49.4750199Z 2025-05-07T19:51:49.4750362Z INCLUDE_DIRS: 2025-05-07T19:51:49.4750585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4750866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4751133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4751486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4752146Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4752965Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4753622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4754039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4754454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4754931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4755435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4755888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4756449Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4756933Z 2025-05-07T19:51:49.4757136Z Selected Source Files: 2025-05-07T19:51:49.4757483Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:51:49.4757915Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:51:49.4758252Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:51:49.4758646Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:51:49.4759056Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:51:49.4759388Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:51:49.4759709Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:51:49.4760058Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:51:49.4760460Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:51:49.4760881Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:51:49.4761271Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:51:49.4761671Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:51:49.4762109Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:51:49.4762504Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:51:49.4763002Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:51:49.4763574Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:51:49.4764217Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:51:49.4764684Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:51:49.4765060Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:51:49.4765411Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:51:49.4765738Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:51:49.4766073Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:51:49.4766468Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:51:49.4766875Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:51:49.4767286Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:51:49.4767685Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:51:49.4768131Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:51:49.4768582Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:51:49.4769058Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4769542Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4770132Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4770617Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:51:49.4771058Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4771546Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4771972Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:51:49.4772380Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4772797Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4773233Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4773692Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4774163Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4774607Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:51:49.4775011Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4775451Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4775879Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:51:49.4776333Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4776812Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4777450Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4777989Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4778539Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4779069Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:51:49.4779625Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4780147Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4780604Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:51:49.4780979Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4781394Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4781799Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4782249Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4782719Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4783157Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:51:49.4783567Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4783991Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4784403Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:51:49.4784787Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4785204Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4785620Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4786074Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4786555Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4786980Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:51:49.4787392Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4787813Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4788232Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:51:49.4788622Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4789045Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4789461Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4789927Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4790469Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4791307Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:51:49.4791742Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4792180Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4792622Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:51:49.4793047Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4793525Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4793997Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4794483Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4794991Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4795453Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:51:49.4795875Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4796350Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4796830Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:51:49.4797335Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4797884Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4798426Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4799001Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4799606Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4800165Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:51:49.4800785Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4801327Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4801527Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:51:49.4801740Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4801956Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4802172Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4802423Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4802667Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4802868Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:51:49.4803100Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4803325Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4803458Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:51:49.4803733Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4803882Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4804039Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4804218Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4804512Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4804639Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:51:49.4804786Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4804948Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4805149Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:51:49.4805378Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4805615Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4805923Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4806173Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4806437Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4806643Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:51:49.4806866Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4807095Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4807179Z 2025-05-07T19:51:49.4807260Z HIPified Source Files: 2025-05-07T19:51:49.4807268Z 2025-05-07T19:51:49.4807329Z 2025-05-07T19:51:49.4807422Z Library Dependencies: 2025-05-07T19:51:49.4807485Z torch 2025-05-07T19:51:49.4807556Z torch_library 2025-05-07T19:51:49.4807841Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4807985Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4808282Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4808592Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4808765Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4808832Z fbgemm 2025-05-07T19:51:49.4808906Z fbgemm_gpu_config 2025-05-07T19:51:49.4808987Z fbgemm_gpu_tbe_cache 2025-05-07T19:51:49.4809062Z fbgemm_gpu_tbe_common 2025-05-07T19:51:49.4809135Z fbgemm_gpu_tbe_utils 2025-05-07T19:51:49.4809223Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:51:49.4809464Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4809528Z 2025-05-07T19:51:49.4809602Z Output Library: 2025-05-07T19:51:49.4809696Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:51:49.4809761Z 2025-05-07T19:51:49.4809841Z Destination Directory: 2025-05-07T19:51:49.4809913Z fbgemm_gpu 2025-05-07T19:51:49.4810020Z ================================================================================ 2025-05-07T19:51:49.4810025Z 2025-05-07T19:51:49.4810029Z 2025-05-07T19:51:49.4810032Z 2025-05-07T19:51:49.4810128Z ================================================================================ 2025-05-07T19:51:49.4810477Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:51:49.4810552Z 2025-05-07T19:51:49.4810622Z CPU_SRCS: 2025-05-07T19:51:49.4810626Z 2025-05-07T19:51:49.4810691Z 2025-05-07T19:51:49.4810768Z GPU_SRCS: 2025-05-07T19:51:49.4810951Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:51:49.4811159Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:51:49.4811372Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:51:49.4811560Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:51:49.4811768Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:51:49.4811978Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:51:49.4812179Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:51:49.4812390Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:51:49.4812607Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:51:49.4812820Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:51:49.4813043Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:51:49.4813273Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:51:49.4813348Z 2025-05-07T19:51:49.4813424Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4813429Z 2025-05-07T19:51:49.4813494Z 2025-05-07T19:51:49.4813624Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4813638Z 2025-05-07T19:51:49.4813703Z 2025-05-07T19:51:49.4813774Z OTHER_SRCS: 2025-05-07T19:51:49.4813778Z 2025-05-07T19:51:49.4813843Z 2025-05-07T19:51:49.4813924Z CC_FLAGS: 2025-05-07T19:51:49.4813928Z 2025-05-07T19:51:49.4813996Z 2025-05-07T19:51:49.4814067Z NVCC_FLAGS: 2025-05-07T19:51:49.4814167Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4814258Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4814351Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4814437Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4814512Z 2025-05-07T19:51:49.4814589Z HIPCC_FLAGS: 2025-05-07T19:51:49.4814593Z 2025-05-07T19:51:49.4814657Z 2025-05-07T19:51:49.4814746Z INCLUDE_DIRS: 2025-05-07T19:51:49.4814849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4814934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4815032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4815136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4815401Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4815763Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4815905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4816053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4816199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4816395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4816579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4816712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4817876Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4817957Z 2025-05-07T19:51:49.4818045Z Selected Source Files: 2025-05-07T19:51:49.4818232Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:51:49.4818448Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:51:49.4818654Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:51:49.4818841Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:51:49.4819063Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:51:49.4819274Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:51:49.4819467Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:51:49.4819682Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:51:49.4819907Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:51:49.4820111Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:51:49.4820334Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:51:49.4820569Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:51:49.4820636Z 2025-05-07T19:51:49.4820720Z HIPified Source Files: 2025-05-07T19:51:49.4820725Z 2025-05-07T19:51:49.4820799Z 2025-05-07T19:51:49.4820882Z Library Dependencies: 2025-05-07T19:51:49.4820952Z torch 2025-05-07T19:51:49.4821025Z torch_library 2025-05-07T19:51:49.4821321Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4821481Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4821786Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4822124Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4822300Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4822447Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:51:49.4822648Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4822715Z 2025-05-07T19:51:49.4822793Z Output Library: 2025-05-07T19:51:49.4822886Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:51:49.4822960Z 2025-05-07T19:51:49.4823044Z Destination Directory: 2025-05-07T19:51:49.4823115Z fbgemm_gpu 2025-05-07T19:51:49.4823225Z ================================================================================ 2025-05-07T19:51:49.4823230Z 2025-05-07T19:51:49.4823233Z 2025-05-07T19:51:49.4823237Z 2025-05-07T19:51:49.4823335Z ================================================================================ 2025-05-07T19:51:49.4823522Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:51:49.4823602Z 2025-05-07T19:51:49.4823671Z CPU_SRCS: 2025-05-07T19:51:49.4823675Z 2025-05-07T19:51:49.4823741Z 2025-05-07T19:51:49.4823811Z GPU_SRCS: 2025-05-07T19:51:49.4824006Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4824182Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4824371Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4824559Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4824788Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4825024Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4825177Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4825328Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4825540Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4825696Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4825845Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4826001Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4826179Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4826388Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4826595Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4826765Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4826965Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4827163Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4827352Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4827564Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4827779Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4827958Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4828166Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4828379Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4828605Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4828854Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4829112Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4829348Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4829604Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4829870Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4830010Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4830232Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4830399Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4830547Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4830714Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4830888Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4831041Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4831278Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4831453Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4831773Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4831957Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4832141Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4832286Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4832464Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4832631Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4832783Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4832967Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4833148Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4833218Z 2025-05-07T19:51:49.4833307Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4833311Z 2025-05-07T19:51:49.4833381Z 2025-05-07T19:51:49.4833463Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4833468Z 2025-05-07T19:51:49.4833535Z 2025-05-07T19:51:49.4833615Z OTHER_SRCS: 2025-05-07T19:51:49.4833676Z 2025-05-07T19:51:49.4833746Z 2025-05-07T19:51:49.4833820Z CC_FLAGS: 2025-05-07T19:51:49.4833824Z 2025-05-07T19:51:49.4833900Z 2025-05-07T19:51:49.4833972Z NVCC_FLAGS: 2025-05-07T19:51:49.4834065Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4834166Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4834264Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4834355Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4834423Z 2025-05-07T19:51:49.4834508Z HIPCC_FLAGS: 2025-05-07T19:51:49.4834512Z 2025-05-07T19:51:49.4834581Z 2025-05-07T19:51:49.4834656Z INCLUDE_DIRS: 2025-05-07T19:51:49.4834769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4834859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4834959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4835058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4835340Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4835717Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4835858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4836019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4836169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4836364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4836566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4836704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4837000Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4837070Z 2025-05-07T19:51:49.4837169Z Selected Source Files: 2025-05-07T19:51:49.4837361Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4837543Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4837762Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4837954Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4838242Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4838496Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4838644Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4838799Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4838947Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4839115Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4839259Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:51:49.4839410Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:51:49.4839599Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4839811Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4840021Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4840205Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4840402Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4840605Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4840795Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4841015Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4841238Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4841427Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4841656Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4841926Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4842163Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4842441Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4842707Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4842955Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4843237Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4843506Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4843653Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4843923Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4844096Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4844239Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4844403Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4844584Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4844726Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4844889Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4845069Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4845217Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4845389Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4845564Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4845717Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:51:49.4845876Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4846039Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4846202Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:51:49.4846371Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:51:49.4846591Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:51:49.4846678Z 2025-05-07T19:51:49.4846764Z HIPified Source Files: 2025-05-07T19:51:49.4846768Z 2025-05-07T19:51:49.4846841Z 2025-05-07T19:51:49.4846927Z Library Dependencies: 2025-05-07T19:51:49.4847013Z torch 2025-05-07T19:51:49.4847090Z torch_library 2025-05-07T19:51:49.4847371Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4847538Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4847836Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4848151Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4848337Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4848431Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:51:49.4848623Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4848696Z 2025-05-07T19:51:49.4848788Z Output Library: 2025-05-07T19:51:49.4848886Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:51:49.4848956Z 2025-05-07T19:51:49.4849055Z Destination Directory: 2025-05-07T19:51:49.4849132Z fbgemm_gpu 2025-05-07T19:51:49.4849238Z ================================================================================ 2025-05-07T19:51:49.4849243Z 2025-05-07T19:51:49.4849385Z 2025-05-07T19:51:49.4849389Z 2025-05-07T19:51:49.4849508Z ================================================================================ 2025-05-07T19:51:49.4849702Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:51:49.4849772Z 2025-05-07T19:51:49.4849848Z CPU_SRCS: 2025-05-07T19:51:49.4849917Z 2025-05-07T19:51:49.4849987Z 2025-05-07T19:51:49.4850059Z GPU_SRCS: 2025-05-07T19:51:49.4850191Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:51:49.4850342Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:51:49.4850496Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4850649Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4850818Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4850979Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:49.4851160Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4851340Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4851494Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:51:49.4851634Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:51:49.4851791Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4851970Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4852073Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:51:49.4852142Z 2025-05-07T19:51:49.4852238Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4852245Z 2025-05-07T19:51:49.4852314Z 2025-05-07T19:51:49.4852395Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4852399Z 2025-05-07T19:51:49.4852468Z 2025-05-07T19:51:49.4852556Z OTHER_SRCS: 2025-05-07T19:51:49.4852560Z 2025-05-07T19:51:49.4852629Z 2025-05-07T19:51:49.4852702Z CC_FLAGS: 2025-05-07T19:51:49.4852706Z 2025-05-07T19:51:49.4852789Z 2025-05-07T19:51:49.4852862Z NVCC_FLAGS: 2025-05-07T19:51:49.4852953Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4853042Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4853149Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4853241Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4853310Z 2025-05-07T19:51:49.4853399Z HIPCC_FLAGS: 2025-05-07T19:51:49.4853403Z 2025-05-07T19:51:49.4853475Z 2025-05-07T19:51:49.4853550Z INCLUDE_DIRS: 2025-05-07T19:51:49.4853650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4853753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4853848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4854011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4854282Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4854637Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4854769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4854930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4855075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4855260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4855442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4855593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4855871Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4855945Z 2025-05-07T19:51:49.4856051Z Selected Source Files: 2025-05-07T19:51:49.4856189Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:51:49.4856354Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:51:49.4856511Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:51:49.4856615Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:51:49.4856745Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:51:49.4856895Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:51:49.4857059Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:51:49.4857215Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:51:49.4857393Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:51:49.4857634Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:51:49.4857774Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:51:49.4857934Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:51:49.4858109Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:51:49.4858179Z 2025-05-07T19:51:49.4858266Z HIPified Source Files: 2025-05-07T19:51:49.4858270Z 2025-05-07T19:51:49.4858339Z 2025-05-07T19:51:49.4858437Z Library Dependencies: 2025-05-07T19:51:49.4858508Z torch 2025-05-07T19:51:49.4858585Z torch_library 2025-05-07T19:51:49.4858882Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4859036Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4859328Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4859640Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4859814Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4859906Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:51:49.4860095Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4860168Z 2025-05-07T19:51:49.4860242Z Output Library: 2025-05-07T19:51:49.4860340Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:51:49.4860411Z 2025-05-07T19:51:49.4860493Z Destination Directory: 2025-05-07T19:51:49.4860564Z fbgemm_gpu 2025-05-07T19:51:49.4860666Z ================================================================================ 2025-05-07T19:51:49.4860671Z 2025-05-07T19:51:49.4860675Z 2025-05-07T19:51:49.4860687Z 2025-05-07T19:51:49.4860784Z ================================================================================ 2025-05-07T19:51:49.4860997Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:51:49.4861069Z 2025-05-07T19:51:49.4861146Z CPU_SRCS: 2025-05-07T19:51:49.4861150Z 2025-05-07T19:51:49.4861216Z 2025-05-07T19:51:49.4861284Z GPU_SRCS: 2025-05-07T19:51:49.4861395Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:51:49.4861564Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:51:49.4861660Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:51:49.4861754Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:51:49.4861857Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:51:49.4861954Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:51:49.4862088Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:51:49.4862233Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:51:49.4862325Z gen_embedding_backward_split_none.cpp 2025-05-07T19:51:49.4862486Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:51:49.4862592Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:51:49.4862744Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:51:49.4862927Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:51:49.4863129Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:51:49.4863322Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:51:49.4863467Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:51:49.4863582Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:51:49.4863727Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:51:49.4863871Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:51:49.4864032Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:51:49.4864208Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:51:49.4864330Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:51:49.4864462Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:51:49.4864631Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:51:49.4864777Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:51:49.4864902Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:51:49.4865033Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:51:49.4865185Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:51:49.4865332Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:51:49.4865512Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:51:49.4865698Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:51:49.4865888Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:51:49.4866075Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:51:49.4866198Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:51:49.4866340Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:51:49.4866551Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:51:49.4866767Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:51:49.4866840Z 2025-05-07T19:51:49.4866920Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4866924Z 2025-05-07T19:51:49.4866989Z 2025-05-07T19:51:49.4867067Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4867078Z 2025-05-07T19:51:49.4867142Z 2025-05-07T19:51:49.4867213Z OTHER_SRCS: 2025-05-07T19:51:49.4867217Z 2025-05-07T19:51:49.4867285Z 2025-05-07T19:51:49.4867360Z CC_FLAGS: 2025-05-07T19:51:49.4867364Z 2025-05-07T19:51:49.4867431Z 2025-05-07T19:51:49.4867499Z NVCC_FLAGS: 2025-05-07T19:51:49.4867590Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4867676Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4867768Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4867853Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4867926Z 2025-05-07T19:51:49.4868004Z HIPCC_FLAGS: 2025-05-07T19:51:49.4868008Z 2025-05-07T19:51:49.4868076Z 2025-05-07T19:51:49.4868160Z INCLUDE_DIRS: 2025-05-07T19:51:49.4868253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4868335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4868475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4868576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4868827Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4869178Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4869311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4869452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4869587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4869780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4869957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4870083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4870357Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4870433Z 2025-05-07T19:51:49.4870511Z Selected Source Files: 2025-05-07T19:51:49.4870609Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:51:49.4870734Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:51:49.4870826Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:51:49.4870918Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:51:49.4871006Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:51:49.4871115Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:51:49.4871322Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:51:49.4871457Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:51:49.4871725Z gen_embedding_backward_split_none.cpp 2025-05-07T19:51:49.4871957Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:51:49.4872066Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:51:49.4872226Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:51:49.4872432Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:51:49.4872651Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:51:49.4872844Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:51:49.4873006Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:51:49.4873128Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:51:49.4873271Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:51:49.4873429Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:51:49.4873605Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:51:49.4873788Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:51:49.4873927Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:51:49.4874066Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:51:49.4874201Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:51:49.4874351Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:51:49.4874493Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:51:49.4874633Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:51:49.4874783Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:51:49.4874950Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:51:49.4875143Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:51:49.4875345Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:51:49.4875542Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:51:49.4875739Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:51:49.4875873Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:51:49.4876018Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:51:49.4876300Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:51:49.4876530Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:51:49.4876601Z 2025-05-07T19:51:49.4876698Z HIPified Source Files: 2025-05-07T19:51:49.4876703Z 2025-05-07T19:51:49.4876772Z 2025-05-07T19:51:49.4876856Z Library Dependencies: 2025-05-07T19:51:49.4876930Z torch 2025-05-07T19:51:49.4877009Z torch_library 2025-05-07T19:51:49.4877307Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4877466Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4877789Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4878132Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4878311Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4878400Z fbgemm_gpu_config 2025-05-07T19:51:49.4878486Z fbgemm_gpu_tbe_utils 2025-05-07T19:51:49.4878689Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4878757Z 2025-05-07T19:51:49.4878843Z Output Library: 2025-05-07T19:51:49.4878955Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:51:49.4879024Z 2025-05-07T19:51:49.4879120Z Destination Directory: 2025-05-07T19:51:49.4879193Z fbgemm_gpu 2025-05-07T19:51:49.4879303Z ================================================================================ 2025-05-07T19:51:49.4879307Z 2025-05-07T19:51:49.4879311Z 2025-05-07T19:51:49.4879315Z 2025-05-07T19:51:49.4879430Z ================================================================================ 2025-05-07T19:51:49.4879647Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:51:49.4879717Z 2025-05-07T19:51:49.4879792Z CPU_SRCS: 2025-05-07T19:51:49.4880002Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:51:49.4880180Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:51:49.4880252Z 2025-05-07T19:51:49.4880334Z GPU_SRCS: 2025-05-07T19:51:49.4880518Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:51:49.4880647Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:51:49.4880775Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:51:49.4880902Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:51:49.4881035Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:51:49.4881159Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:51:49.4881293Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:51:49.4881419Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:51:49.4881492Z 2025-05-07T19:51:49.4881585Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4881589Z 2025-05-07T19:51:49.4881654Z 2025-05-07T19:51:49.4881735Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4881739Z 2025-05-07T19:51:49.4881806Z 2025-05-07T19:51:49.4881895Z OTHER_SRCS: 2025-05-07T19:51:49.4881899Z 2025-05-07T19:51:49.4881968Z 2025-05-07T19:51:49.4882041Z CC_FLAGS: 2025-05-07T19:51:49.4882045Z 2025-05-07T19:51:49.4882122Z 2025-05-07T19:51:49.4882194Z NVCC_FLAGS: 2025-05-07T19:51:49.4882286Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4882388Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4882489Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4882580Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4882649Z 2025-05-07T19:51:49.4882736Z HIPCC_FLAGS: 2025-05-07T19:51:49.4882740Z 2025-05-07T19:51:49.4882810Z 2025-05-07T19:51:49.4882883Z INCLUDE_DIRS: 2025-05-07T19:51:49.4882993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4883087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4883181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4883279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4883635Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4884111Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4884238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4884404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4884550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4884729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4884921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4885055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4885333Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4885404Z 2025-05-07T19:51:49.4885510Z Selected Source Files: 2025-05-07T19:51:49.4885703Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:51:49.4885885Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:51:49.4886072Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:51:49.4886202Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:51:49.4886327Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:51:49.4886456Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:51:49.4886605Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:51:49.4886734Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:51:49.4886859Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:51:49.4886997Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:51:49.4887063Z 2025-05-07T19:51:49.4887202Z HIPified Source Files: 2025-05-07T19:51:49.4887206Z 2025-05-07T19:51:49.4887295Z 2025-05-07T19:51:49.4887380Z Library Dependencies: 2025-05-07T19:51:49.4887449Z torch 2025-05-07T19:51:49.4887527Z torch_library 2025-05-07T19:51:49.4887821Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4887980Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4888279Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4888610Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4888779Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4888869Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:51:49.4888965Z fbgemm_gpu_tbe_utils 2025-05-07T19:51:49.4889155Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4889227Z 2025-05-07T19:51:49.4889308Z Output Library: 2025-05-07T19:51:49.4889416Z fbgemm_gpu_tbe_index_select 2025-05-07T19:51:49.4889480Z 2025-05-07T19:51:49.4889566Z Destination Directory: 2025-05-07T19:51:49.4889655Z fbgemm_gpu 2025-05-07T19:51:49.4889762Z ================================================================================ 2025-05-07T19:51:49.4889766Z 2025-05-07T19:51:49.4889770Z 2025-05-07T19:51:49.4889773Z 2025-05-07T19:51:49.4889875Z ================================================================================ 2025-05-07T19:51:49.4890071Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:51:49.4890141Z 2025-05-07T19:51:49.4890214Z CPU_SRCS: 2025-05-07T19:51:49.4890379Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:51:49.4890463Z 2025-05-07T19:51:49.4890653Z GPU_SRCS: 2025-05-07T19:51:49.4890976Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:51:49.4891140Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:51:49.4891217Z 2025-05-07T19:51:49.4891469Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4891473Z 2025-05-07T19:51:49.4891551Z 2025-05-07T19:51:49.4891666Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4891670Z 2025-05-07T19:51:49.4891746Z 2025-05-07T19:51:49.4891825Z OTHER_SRCS: 2025-05-07T19:51:49.4891914Z 2025-05-07T19:51:49.4892012Z 2025-05-07T19:51:49.4892095Z CC_FLAGS: 2025-05-07T19:51:49.4892099Z 2025-05-07T19:51:49.4892172Z 2025-05-07T19:51:49.4892251Z NVCC_FLAGS: 2025-05-07T19:51:49.4892365Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4892458Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4892560Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4892687Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4892763Z 2025-05-07T19:51:49.4892839Z HIPCC_FLAGS: 2025-05-07T19:51:49.4892843Z 2025-05-07T19:51:49.4892918Z 2025-05-07T19:51:49.4893019Z INCLUDE_DIRS: 2025-05-07T19:51:49.4893125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4893217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4893339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4893447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4893720Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4894122Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4894269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4894424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4894578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4894797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4894992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4895131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4895447Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4895597Z 2025-05-07T19:51:49.4895691Z Selected Source Files: 2025-05-07T19:51:49.4895881Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:51:49.4896055Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:51:49.4896214Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:51:49.4896288Z 2025-05-07T19:51:49.4896385Z HIPified Source Files: 2025-05-07T19:51:49.4896389Z 2025-05-07T19:51:49.4896460Z 2025-05-07T19:51:49.4896553Z Library Dependencies: 2025-05-07T19:51:49.4896648Z torch 2025-05-07T19:51:49.4896733Z torch_library 2025-05-07T19:51:49.4897030Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4897189Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4897516Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4897853Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4898038Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4898242Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4898309Z 2025-05-07T19:51:49.4898390Z Output Library: 2025-05-07T19:51:49.4898498Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:51:49.4898565Z 2025-05-07T19:51:49.4898651Z Destination Directory: 2025-05-07T19:51:49.4898727Z fbgemm_gpu 2025-05-07T19:51:49.4898840Z ================================================================================ 2025-05-07T19:51:49.4898844Z 2025-05-07T19:51:49.4898849Z 2025-05-07T19:51:49.4898853Z 2025-05-07T19:51:49.4898955Z ================================================================================ 2025-05-07T19:51:49.4899075Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:51:49.4899151Z 2025-05-07T19:51:49.4899223Z CPU_SRCS: 2025-05-07T19:51:49.4899319Z src/memory_utils/memory_utils.cpp 2025-05-07T19:51:49.4899434Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:51:49.4899627Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:51:49.4899831Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:51:49.4900081Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:51:49.4900299Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:51:49.4900503Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:51:49.4900732Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:51:49.4900888Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:51:49.4901014Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:51:49.4901139Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:51:49.4901261Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:51:49.4901404Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:51:49.4901509Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:51:49.4901610Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:51:49.4901740Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:51:49.4901838Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:51:49.4901933Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:51:49.4902029Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:51:49.4902117Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:51:49.4902215Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:51:49.4902309Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:51:49.4902416Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:51:49.4902512Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:51:49.4902742Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:51:49.4902895Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:51:49.4903101Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:51:49.4903479Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:51:49.4903692Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:51:49.4903778Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:51:49.4903869Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:51:49.4903972Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:51:49.4904156Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:51:49.4904232Z src/topology_utils.cpp 2025-05-07T19:51:49.4904298Z 2025-05-07T19:51:49.4904378Z GPU_SRCS: 2025-05-07T19:51:49.4904475Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:51:49.4904568Z src/input_combine_ops/input_combine.cu 2025-05-07T19:51:49.4904758Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:51:49.4904851Z src/memory_utils/memory_utils.cu 2025-05-07T19:51:49.4904942Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:51:49.4905116Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:51:49.4905296Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:51:49.4905410Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:51:49.4905525Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:51:49.4905761Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:51:49.4905920Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:51:49.4906074Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:51:49.4906199Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:51:49.4906340Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:51:49.4906459Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:51:49.4906572Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:51:49.4906692Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:51:49.4906794Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:51:49.4906938Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:51:49.4907069Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:51:49.4907190Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:51:49.4907367Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:51:49.4907484Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:51:49.4907579Z src/metric_ops/metric_ops.cu 2025-05-07T19:51:49.4907776Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:51:49.4907949Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:51:49.4908122Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:51:49.4908216Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:51:49.4908315Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:51:49.4908425Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:51:49.4908548Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:51:49.4908638Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:51:49.4908727Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:51:49.4908853Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:51:49.4908944Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:51:49.4909055Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:51:49.4909183Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:51:49.4909284Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:51:49.4909401Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:51:49.4909526Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:51:49.4909663Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:51:49.4909758Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:51:49.4909848Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:51:49.4909952Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:51:49.4910048Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:51:49.4910220Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:51:49.4910331Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:51:49.4910427Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:51:49.4910521Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:51:49.4910611Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:51:49.4910723Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:51:49.4910810Z src/sparse_ops/sparse_range.cu 2025-05-07T19:51:49.4910912Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:51:49.4911008Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:51:49.4911103Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:51:49.4911253Z 2025-05-07T19:51:49.4911331Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:51:49.4911336Z 2025-05-07T19:51:49.4911407Z 2025-05-07T19:51:49.4911486Z HIP_SPECIFIC_SRCS: 2025-05-07T19:51:49.4911490Z 2025-05-07T19:51:49.4911555Z 2025-05-07T19:51:49.4911801Z OTHER_SRCS: 2025-05-07T19:51:49.4911809Z 2025-05-07T19:51:49.4911878Z 2025-05-07T19:51:49.4911957Z CC_FLAGS: 2025-05-07T19:51:49.4911962Z 2025-05-07T19:51:49.4912044Z 2025-05-07T19:51:49.4912142Z NVCC_FLAGS: 2025-05-07T19:51:49.4912240Z --expt-relaxed-constexpr 2025-05-07T19:51:49.4912341Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:51:49.4912465Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:51:49.4912571Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:51:49.4912648Z 2025-05-07T19:51:49.4912733Z HIPCC_FLAGS: 2025-05-07T19:51:49.4912737Z 2025-05-07T19:51:49.4912833Z 2025-05-07T19:51:49.4912914Z INCLUDE_DIRS: 2025-05-07T19:51:49.4913024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4913141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:51:49.4913247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:51:49.4913353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:51:49.4913640Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:51:49.4914044Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:51:49.4914195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:51:49.4914358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:51:49.4914599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:51:49.4914798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:51:49.4914997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:51:49.4915161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:51:49.4915466Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:51:49.4915545Z 2025-05-07T19:51:49.4915640Z Selected Source Files: 2025-05-07T19:51:49.4915764Z src/memory_utils/memory_utils.cpp 2025-05-07T19:51:49.4915873Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:51:49.4916071Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:51:49.4916286Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:51:49.4916485Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:51:49.4916699Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:51:49.4916922Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:51:49.4917151Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:51:49.4917297Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:51:49.4917427Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:51:49.4917561Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:51:49.4917674Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:51:49.4917822Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:51:49.4917941Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:51:49.4918040Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:51:49.4918210Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:51:49.4918318Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:51:49.4918419Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:51:49.4918507Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:51:49.4918600Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:51:49.4918712Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:51:49.4918806Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:51:49.4918906Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:51:49.4919013Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:51:49.4919243Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:51:49.4919389Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:51:49.4919596Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:51:49.4919836Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:51:49.4919940Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:51:49.4920039Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:51:49.4920150Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:51:49.4920269Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:51:49.4920461Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:51:49.4920551Z src/topology_utils.cpp 2025-05-07T19:51:49.4920673Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:51:49.4920774Z src/input_combine_ops/input_combine.cu 2025-05-07T19:51:49.4920982Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:51:49.4921089Z src/memory_utils/memory_utils.cu 2025-05-07T19:51:49.4921186Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:51:49.4921371Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:51:49.4921564Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:51:49.4921694Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:51:49.4921827Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:51:49.4922072Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:51:49.4922258Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:51:49.4922480Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:51:49.4922620Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:51:49.4922775Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:51:49.4922908Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:51:49.4923037Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:51:49.4923168Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:51:49.4923281Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:51:49.4923545Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:51:49.4923689Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:51:49.4923927Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:51:49.4924063Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:51:49.4924175Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:51:49.4924270Z src/metric_ops/metric_ops.cu 2025-05-07T19:51:49.4924473Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:51:49.4924647Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:51:49.4924819Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:51:49.4924911Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:51:49.4925010Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:51:49.4925125Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:51:49.4925247Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:51:49.4925338Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:51:49.4925428Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:51:49.4925553Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:51:49.4925689Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:51:49.4925801Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:51:49.4925926Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:51:49.4926042Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:51:49.4926166Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:51:49.4926296Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:51:49.4926432Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:51:49.4926526Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:51:49.4926615Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:51:49.4926706Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:51:49.4926818Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:51:49.4926933Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:51:49.4927047Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:51:49.4927153Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:51:49.4927250Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:51:49.4927339Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:51:49.4927452Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:51:49.4927539Z src/sparse_ops/sparse_range.cu 2025-05-07T19:51:49.4927647Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:51:49.4927745Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:51:49.4927840Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:51:49.4927907Z 2025-05-07T19:51:49.4927985Z HIPified Source Files: 2025-05-07T19:51:49.4927989Z 2025-05-07T19:51:49.4928062Z 2025-05-07T19:51:49.4928142Z Library Dependencies: 2025-05-07T19:51:49.4928209Z torch 2025-05-07T19:51:49.4928276Z torch_library 2025-05-07T19:51:49.4928570Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:51:49.4928716Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:51:49.4929008Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:51:49.4929332Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:51:49.4929498Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:51:49.4929613Z fbgemm 2025-05-07T19:51:49.4929712Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:51:49.4929804Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:51:49.4929888Z fbgemm_gpu_tbe_index_select 2025-05-07T19:51:49.4929965Z fbgemm_gpu_tbe_cache 2025-05-07T19:51:49.4930057Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:51:49.4930130Z fbgemm_gpu_tbe_utils 2025-05-07T19:51:49.4930318Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:49.4930388Z 2025-05-07T19:51:49.4930464Z Output Library: 2025-05-07T19:51:49.4930535Z fbgemm_gpu_py 2025-05-07T19:51:49.4930606Z 2025-05-07T19:51:49.4930698Z Destination Directory: 2025-05-07T19:51:49.4930767Z fbgemm_gpu 2025-05-07T19:51:49.4930865Z ================================================================================ 2025-05-07T19:51:49.4930873Z 2025-05-07T19:51:49.4930971Z -- Configuring done (7.7s) 2025-05-07T19:51:49.6013925Z -- Generating done (0.1s) 2025-05-07T19:51:49.6033855Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:51:49.6213054Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build' 2025-05-07T19:51:49.6213081Z 2025-05-07T19:51:49.6213989Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:51:49.7614000Z [1/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:51:49.7639368Z [2/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:51:49.7684688Z [3/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:51:49.7703293Z [4/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:51:49.7722254Z [5/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:51:49.7956101Z [6/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:51:49.8008492Z [7/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:51:49.8034639Z [8/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:51:49.8217129Z [9/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:51:49.8370880Z [10/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:51:49.8627745Z [11/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:51:49.8647947Z [12/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:51:49.8817596Z [13/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:51:49.8867517Z [14/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:51:49.8885666Z [15/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:51:49.8903875Z [16/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:51:49.8922046Z [17/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:51:49.9050568Z [18/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:51:49.9203812Z [19/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:51:49.9288411Z [20/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:51:49.9441496Z [21/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:51:49.9486442Z [22/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:51:49.9520468Z [23/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:51:49.9539545Z [24/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:51:49.9558768Z [25/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:51:49.9663509Z [26/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:51:49.9763967Z [27/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:51:49.9814743Z [28/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:51:49.9870663Z [29/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:51:49.9968978Z [30/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:51:50.0091832Z [31/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:51:50.0109184Z [32/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:51:50.0175469Z [33/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:51:50.0358686Z [34/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:51:50.0368855Z [35/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:51:50.0673646Z [36/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:51:50.0736476Z [37/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:51:50.0859942Z [38/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:51:50.1024325Z [39/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:51:50.1582569Z [40/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:51:50.1676221Z [41/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:51:50.1722599Z [42/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:51:50.2027856Z [43/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:51:50.2204659Z [44/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:51:50.2390854Z [45/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:51:50.3157782Z [46/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:51:50.4205906Z [47/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:51:50.4224189Z [48/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:51:50.4551386Z [49/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:51:50.5787666Z [50/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:51:50.6132483Z [51/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:51:50.6840736Z [52/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:51:50.9348558Z [53/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:51:50.9819347Z [54/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:51:51.0982687Z [55/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:51:51.1005298Z [56/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:51:51.2083630Z [57/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:51:51.2275042Z [58/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:51:51.2597065Z [59/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:51:51.8364247Z [60/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:51:52.1842919Z [61/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:51:52.4083693Z [62/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:51:53.3043877Z [63/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:51:53.4614676Z [64/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:51:53.8951926Z [65/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:51:55.8134616Z [66/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:51:57.5675087Z [67/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:51:57.5905924Z [68/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:51:57.6062144Z [69/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:51:57.7315185Z [70/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:51:57.8450053Z [71/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:51:59.2732239Z [72/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:51:59.4104175Z [73/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:51:59.6914407Z [74/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:51:59.8389335Z [75/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:01.8196200Z [76/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:02.9030424Z [77/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:52:03.4269775Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:52:04.9940696Z [79/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:06.1242060Z [80/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:06.2247979Z [81/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:06.5532210Z [82/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:52:09.3682329Z [83/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:10.8243418Z [84/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:11.7599537Z [85/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:13.1992186Z [86/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:14.3200864Z [87/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:17.2433933Z [88/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:52:19.6344596Z [89/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:21.1632888Z [90/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:22.4530534Z [91/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:23.7775725Z [92/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:26.0271521Z [93/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:27.7205805Z [94/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.7813542Z [95/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:30.2575123Z [96/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:31.2866699Z [97/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:52:31.4888332Z [98/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:52:31.7952849Z [99/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:32.0025969Z [100/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:52:32.4281006Z [101/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:52:32.5157244Z [102/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:52:32.5672701Z [103/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:52:32.7744486Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:52:32.9718705Z [105/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:32.9888109Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:52:35.4447395Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:52:35.5123391Z [108/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:35.9421777Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:52:37.7210918Z [110/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:38.1671858Z [111/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:42.4605738Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:52:45.2763032Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:52:46.2180618Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:52:46.8291801Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:47.3915698Z [116/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:49.0228787Z [117/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:51.1796406Z [118/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:52:55.3219870Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:58.5048991Z [120/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:52:59.1935850Z [121/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T19:52:59.7786172Z [122/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:53:02.6139428Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:53:03.2685021Z [124/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:11.5264533Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:53:12.0634916Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:53:12.1016786Z [127/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:53:12.1047753Z [128/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:12.7131716Z [129/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:12.7152183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7153744Z 2025-05-07T19:53:12.7155161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7156805Z 2025-05-07T19:53:12.7158237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7159789Z 2025-05-07T19:53:12.7161283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7162931Z 2025-05-07T19:53:12.7164365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7166321Z 2025-05-07T19:53:12.7167745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7169348Z 2025-05-07T19:53:12.7170738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7172248Z 2025-05-07T19:53:12.7173589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7175162Z 2025-05-07T19:53:12.7176578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.7178195Z 2025-05-07T19:53:16.1697775Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:16.1715614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1717233Z 2025-05-07T19:53:16.1718446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1719839Z 2025-05-07T19:53:16.1721086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1722526Z 2025-05-07T19:53:16.1723764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1725218Z 2025-05-07T19:53:16.1726746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1728511Z 2025-05-07T19:53:16.1730051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1731791Z 2025-05-07T19:53:16.1733341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1735077Z 2025-05-07T19:53:16.1736601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1738360Z 2025-05-07T19:53:16.1739876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1741591Z 2025-05-07T19:53:16.1743277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1744807Z 2025-05-07T19:53:16.1746007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1747473Z 2025-05-07T19:53:16.1748856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:16.1750513Z 2025-05-07T19:53:16.3958550Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:17.0329479Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:17.0351725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0353402Z 2025-05-07T19:53:17.0354868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0356497Z 2025-05-07T19:53:17.0357789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0359198Z 2025-05-07T19:53:17.0360701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0362291Z 2025-05-07T19:53:17.0363691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0365281Z 2025-05-07T19:53:17.0366696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0368300Z 2025-05-07T19:53:17.0369655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0371335Z 2025-05-07T19:53:17.0372829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0374730Z 2025-05-07T19:53:17.0376266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0377978Z 2025-05-07T19:53:17.0379492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0381211Z 2025-05-07T19:53:17.0382679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0384298Z 2025-05-07T19:53:17.0385569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.0387251Z 2025-05-07T19:53:18.4169892Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:20.4503596Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:53:26.7841864Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:27.6586872Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:27.7139593Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:53:28.0544862Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:28.6955396Z [139/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:29.2516001Z [140/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:34.0576271Z [141/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:44.8933024Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:50.1651596Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:51.6370042Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:59.8239722Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:53:59.8259675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8261187Z 2025-05-07T19:53:59.8262587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8264181Z 2025-05-07T19:53:59.8265496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8267355Z 2025-05-07T19:53:59.8268638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8270184Z 2025-05-07T19:53:59.8271608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8273065Z 2025-05-07T19:53:59.8274464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8276028Z 2025-05-07T19:53:59.8277393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8278925Z 2025-05-07T19:53:59.8280256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8281613Z 2025-05-07T19:53:59.8282835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:59.8284185Z 2025-05-07T19:54:03.4853389Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:07.5765612Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:54:07.5786635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5788255Z 2025-05-07T19:54:07.5789677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5791736Z 2025-05-07T19:54:07.5793044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5794511Z 2025-05-07T19:54:07.5795862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5797728Z 2025-05-07T19:54:07.5799056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5800577Z 2025-05-07T19:54:07.5801977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5803571Z 2025-05-07T19:54:07.5804978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5806544Z 2025-05-07T19:54:07.5807905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5809409Z 2025-05-07T19:54:07.5810982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:07.5812691Z 2025-05-07T19:54:09.1026859Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:09.1048779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1053474Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1056735Z (955): here 2025-05-07T19:54:09.1056946Z 2025-05-07T19:54:09.1058201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1062474Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1065569Z (1007): here 2025-05-07T19:54:09.1065786Z 2025-05-07T19:54:09.1067052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1071681Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1074911Z (1059): here 2025-05-07T19:54:09.1075124Z 2025-05-07T19:54:09.1076657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1081149Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1084454Z (1111): here 2025-05-07T19:54:09.1084673Z 2025-05-07T19:54:09.1085943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1090370Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1093945Z (1163): here 2025-05-07T19:54:09.1094152Z 2025-05-07T19:54:09.1095400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1100068Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1103326Z (1215): here 2025-05-07T19:54:09.1103531Z 2025-05-07T19:54:09.1104751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1108893Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1112056Z (1267): here 2025-05-07T19:54:09.1112268Z 2025-05-07T19:54:09.1113432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1117727Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1121072Z (1319): here 2025-05-07T19:54:09.1121282Z 2025-05-07T19:54:09.1122849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1127353Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1130596Z (1371): here 2025-05-07T19:54:09.1130756Z 2025-05-07T19:54:09.1131924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1135952Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1139087Z (1423): here 2025-05-07T19:54:09.1139294Z 2025-05-07T19:54:09.1140329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1144199Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1147413Z (1475): here 2025-05-07T19:54:09.1147626Z 2025-05-07T19:54:09.1148736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1152992Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1155925Z (1527): here 2025-05-07T19:54:09.1156166Z 2025-05-07T19:54:09.1157287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1161345Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1164311Z (1579): here 2025-05-07T19:54:09.1164509Z 2025-05-07T19:54:09.1165635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1169819Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1172949Z (1631): here 2025-05-07T19:54:09.1173150Z 2025-05-07T19:54:09.1174277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1178368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1181401Z (1683): here 2025-05-07T19:54:09.1181606Z 2025-05-07T19:54:09.1182818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1187102Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1190805Z (1735): here 2025-05-07T19:54:09.1191018Z 2025-05-07T19:54:09.1192175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1196451Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1199779Z (1787): here 2025-05-07T19:54:09.1199988Z 2025-05-07T19:54:09.1201267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1205370Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1208615Z (1839): here 2025-05-07T19:54:09.1208834Z 2025-05-07T19:54:09.1210106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1214893Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1218312Z (1891): here 2025-05-07T19:54:09.1218511Z 2025-05-07T19:54:09.1219680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1223765Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1226819Z (1943): here 2025-05-07T19:54:09.1227051Z 2025-05-07T19:54:09.1228335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1232540Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1235753Z (1995): here 2025-05-07T19:54:09.1235955Z 2025-05-07T19:54:09.1237091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1241204Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1244238Z (2047): here 2025-05-07T19:54:09.1244444Z 2025-05-07T19:54:09.1245599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1249670Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1252640Z (2099): here 2025-05-07T19:54:09.1252873Z 2025-05-07T19:54:09.1253988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1258253Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1261397Z (2151): here 2025-05-07T19:54:09.1261606Z 2025-05-07T19:54:09.1262680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1266854Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1269881Z (955): here 2025-05-07T19:54:09.1270078Z 2025-05-07T19:54:09.1271460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1275729Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1279094Z (1007): here 2025-05-07T19:54:09.1279280Z 2025-05-07T19:54:09.1280402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1284705Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1288056Z (1059): here 2025-05-07T19:54:09.1288270Z 2025-05-07T19:54:09.1289567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1293837Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1297066Z (1111): here 2025-05-07T19:54:09.1297283Z 2025-05-07T19:54:09.1298566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1303274Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1306581Z (1163): here 2025-05-07T19:54:09.1306797Z 2025-05-07T19:54:09.1307979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1312113Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1315136Z (1215): here 2025-05-07T19:54:09.1315351Z 2025-05-07T19:54:09.1316640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1320573Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1323770Z (1267): here 2025-05-07T19:54:09.1323971Z 2025-05-07T19:54:09.1325129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1329201Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1332174Z (1319): here 2025-05-07T19:54:09.1332387Z 2025-05-07T19:54:09.1333549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1337561Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1340482Z (1371): here 2025-05-07T19:54:09.1340703Z 2025-05-07T19:54:09.1341819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1346118Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1349227Z (1423): here 2025-05-07T19:54:09.1349430Z 2025-05-07T19:54:09.1350567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1354951Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1357849Z (1475): here 2025-05-07T19:54:09.1358063Z 2025-05-07T19:54:09.1359271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1363602Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1367012Z (1527): here 2025-05-07T19:54:09.1367187Z 2025-05-07T19:54:09.1368321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1372627Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1375942Z (1579): here 2025-05-07T19:54:09.1376170Z 2025-05-07T19:54:09.1377446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1381484Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1384755Z (1631): here 2025-05-07T19:54:09.1385012Z 2025-05-07T19:54:09.1386288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1391484Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1394790Z (1683): here 2025-05-07T19:54:09.1394997Z 2025-05-07T19:54:09.1396162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1400295Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1403528Z (1735): here 2025-05-07T19:54:09.1403771Z 2025-05-07T19:54:09.1405077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1408869Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1412293Z (1787): here 2025-05-07T19:54:09.1412518Z 2025-05-07T19:54:09.1413736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1417869Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1420825Z (1839): here 2025-05-07T19:54:09.1421063Z 2025-05-07T19:54:09.1422239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1426360Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1429375Z (1891): here 2025-05-07T19:54:09.1429579Z 2025-05-07T19:54:09.1430695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1435246Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1438297Z (1943): here 2025-05-07T19:54:09.1438517Z 2025-05-07T19:54:09.1439690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1443876Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1447101Z (1995): here 2025-05-07T19:54:09.1447331Z 2025-05-07T19:54:09.1448572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1452931Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1456252Z (2047): here 2025-05-07T19:54:09.1456496Z 2025-05-07T19:54:09.1457745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1462222Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1465615Z (2099): here 2025-05-07T19:54:09.1465834Z 2025-05-07T19:54:09.1466958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1471259Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1474786Z (2151): here 2025-05-07T19:54:09.1475001Z 2025-05-07T19:54:09.1476264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1480994Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1484029Z (955): here 2025-05-07T19:54:09.1484243Z 2025-05-07T19:54:09.1485431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1489380Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1492968Z (1007): here 2025-05-07T19:54:09.1493191Z 2025-05-07T19:54:09.1494247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1497950Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1501030Z (1059): here 2025-05-07T19:54:09.1501284Z 2025-05-07T19:54:09.1502479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1506790Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1509847Z (1111): here 2025-05-07T19:54:09.1510057Z 2025-05-07T19:54:09.1511200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1515371Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1518305Z (1163): here 2025-05-07T19:54:09.1518496Z 2025-05-07T19:54:09.1519621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1523797Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1526720Z (1215): here 2025-05-07T19:54:09.1527246Z 2025-05-07T19:54:09.1528456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1532488Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1535668Z (1267): here 2025-05-07T19:54:09.1535895Z 2025-05-07T19:54:09.1537049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1541339Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1544392Z (1319): here 2025-05-07T19:54:09.1544596Z 2025-05-07T19:54:09.1545838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1550434Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1553899Z (1371): here 2025-05-07T19:54:09.1554096Z 2025-05-07T19:54:09.1555198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1559443Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1562738Z (1423): here 2025-05-07T19:54:09.1562942Z 2025-05-07T19:54:09.1564215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1568753Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1571799Z (1475): here 2025-05-07T19:54:09.1572023Z 2025-05-07T19:54:09.1573418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1577425Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1580719Z (1527): here 2025-05-07T19:54:09.1580900Z 2025-05-07T19:54:09.1581941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1585831Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1588814Z (1579): here 2025-05-07T19:54:09.1589032Z 2025-05-07T19:54:09.1590204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1594705Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1597679Z (1631): here 2025-05-07T19:54:09.1597876Z 2025-05-07T19:54:09.1599029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1603075Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1606048Z (1683): here 2025-05-07T19:54:09.1606242Z 2025-05-07T19:54:09.1607366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1611575Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1614505Z (1735): here 2025-05-07T19:54:09.1614721Z 2025-05-07T19:54:09.1616247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1620340Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1623539Z (1787): here 2025-05-07T19:54:09.1623750Z 2025-05-07T19:54:09.1624914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1629202Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1632372Z (1839): here 2025-05-07T19:54:09.1632587Z 2025-05-07T19:54:09.1633856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1638295Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1641865Z (1891): here 2025-05-07T19:54:09.1642060Z 2025-05-07T19:54:09.1643116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1647459Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1650749Z (1943): here 2025-05-07T19:54:09.1650982Z 2025-05-07T19:54:09.1652271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1656802Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1659834Z (1995): here 2025-05-07T19:54:09.1660038Z 2025-05-07T19:54:09.1661199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1665538Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1668688Z (2047): here 2025-05-07T19:54:09.1668870Z 2025-05-07T19:54:09.1669940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1674057Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1677168Z (2099): here 2025-05-07T19:54:09.1677370Z 2025-05-07T19:54:09.1678528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:09.1682516Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:09.1685809Z (2151): here 2025-05-07T19:54:09.1686026Z 2025-05-07T19:54:09.1705465Z [149/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:09.1727359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1728947Z 2025-05-07T19:54:09.1730329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1732083Z 2025-05-07T19:54:09.1733558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1735323Z 2025-05-07T19:54:09.1736783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1739006Z 2025-05-07T19:54:09.1740579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1742358Z 2025-05-07T19:54:09.1743823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1745391Z 2025-05-07T19:54:09.1746829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1748412Z 2025-05-07T19:54:09.1749755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1751677Z 2025-05-07T19:54:09.1753255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1754840Z 2025-05-07T19:54:09.1756154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1757659Z 2025-05-07T19:54:09.1759002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1760593Z 2025-05-07T19:54:09.1762034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1763701Z 2025-05-07T19:54:09.1863916Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:09.1884618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1886313Z 2025-05-07T19:54:09.1887759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1889428Z 2025-05-07T19:54:09.1891144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1892858Z 2025-05-07T19:54:09.1894329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1896016Z 2025-05-07T19:54:09.1897457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1899131Z 2025-05-07T19:54:09.1900967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1902673Z 2025-05-07T19:54:09.1904212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1905827Z 2025-05-07T19:54:09.1907284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1908901Z 2025-05-07T19:54:09.1910287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1912083Z 2025-05-07T19:54:09.1913566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1915147Z 2025-05-07T19:54:09.1916577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1918223Z 2025-05-07T19:54:09.1919686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:09.1921696Z 2025-05-07T19:54:11.3356426Z [151/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:54:13.7348417Z [152/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:13.9203705Z [153/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:15.0559419Z [154/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:18.3765778Z [155/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:18.7247028Z [156/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:19.4728484Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:19.4751604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4753388Z 2025-05-07T19:54:19.4754908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4756633Z 2025-05-07T19:54:19.4758167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4759906Z 2025-05-07T19:54:19.4761408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4763158Z 2025-05-07T19:54:19.4764664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4766428Z 2025-05-07T19:54:19.4771006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4772734Z 2025-05-07T19:54:19.4774210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4775898Z 2025-05-07T19:54:19.4777425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4779149Z 2025-05-07T19:54:19.4780586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4782243Z 2025-05-07T19:54:19.4783777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4785409Z 2025-05-07T19:54:19.4786907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4788613Z 2025-05-07T19:54:19.4789970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:19.4792186Z 2025-05-07T19:54:19.5901338Z [158/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:21.6044945Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:21.6065767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6067337Z 2025-05-07T19:54:21.6068677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6070210Z 2025-05-07T19:54:21.6071673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6073174Z 2025-05-07T19:54:21.6074542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6076062Z 2025-05-07T19:54:21.6077756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6079299Z 2025-05-07T19:54:21.6080642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6082177Z 2025-05-07T19:54:21.6083535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6085078Z 2025-05-07T19:54:21.6086442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6087830Z 2025-05-07T19:54:21.6088946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6090997Z 2025-05-07T19:54:21.6092568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6094377Z 2025-05-07T19:54:21.6095966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6097995Z 2025-05-07T19:54:21.6099549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:21.6101328Z 2025-05-07T19:54:26.1601313Z [160/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:54:26.1624016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1625883Z 2025-05-07T19:54:26.1627457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1629250Z 2025-05-07T19:54:26.1630819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1632695Z 2025-05-07T19:54:26.1634265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1636410Z 2025-05-07T19:54:26.1637868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1639623Z 2025-05-07T19:54:26.1641052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1642759Z 2025-05-07T19:54:26.1644397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1645967Z 2025-05-07T19:54:26.1647325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1649091Z 2025-05-07T19:54:26.1650624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:26.1652320Z 2025-05-07T19:54:26.5539480Z [161/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:26.8228983Z [162/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:28.5774711Z [163/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:30.1615519Z [164/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:30.1771040Z [165/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.1915513Z [166/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2066096Z [167/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2227955Z [168/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2377736Z [169/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2527499Z [170/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2675350Z [171/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2822353Z [172/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2973286Z [173/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3121730Z [174/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3271501Z [175/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3420544Z [176/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3569750Z [177/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3879319Z [178/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:31.0466400Z [179/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:31.1685299Z [180/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:31.2243123Z [181/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:31.2410496Z [182/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:33.2949527Z [183/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:33.4660849Z [184/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:33.6326108Z [185/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:33.8284986Z [186/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:34.1189254Z [187/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:34.1340305Z [188/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:34.1492080Z [189/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:34.1639342Z [190/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:34.1792314Z [191/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:34.1942225Z [192/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:34.2091055Z [193/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:34.6103262Z [194/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:34.8514434Z [195/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:36.1207152Z [196/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:38.1776790Z [197/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:40.3504259Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:41.0887066Z [199/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:41.1921699Z [200/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:41.4600887Z [201/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:54:41.4624275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4626094Z 2025-05-07T19:54:41.4627951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4629765Z 2025-05-07T19:54:41.4631449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4633262Z 2025-05-07T19:54:41.4634838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4636639Z 2025-05-07T19:54:41.4638243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4640043Z 2025-05-07T19:54:41.4641612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4643431Z 2025-05-07T19:54:41.4645014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4646475Z 2025-05-07T19:54:41.4647917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4649641Z 2025-05-07T19:54:41.4650736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.4652042Z 2025-05-07T19:54:42.2555083Z [202/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:44.1399550Z [203/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:45.3398927Z [204/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:45.6091575Z [205/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:47.4634644Z [206/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:47.8697130Z [207/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:54:48.2309505Z [208/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:54:48.2986185Z [209/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:49.2557295Z [210/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:49.4634951Z [211/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:49.5322032Z [212/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:54:49.5343552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5345184Z 2025-05-07T19:54:49.5346636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5348088Z 2025-05-07T19:54:49.5349364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5351008Z 2025-05-07T19:54:49.5352495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5354015Z 2025-05-07T19:54:49.5355396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5356948Z 2025-05-07T19:54:49.5358713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5360052Z 2025-05-07T19:54:49.5361409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5362997Z 2025-05-07T19:54:49.5364367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5365945Z 2025-05-07T19:54:49.5367237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5368667Z 2025-05-07T19:54:49.5370006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5371451Z 2025-05-07T19:54:49.5372696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5374327Z 2025-05-07T19:54:49.5375634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5377322Z 2025-05-07T19:54:49.9267871Z [213/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.3372284Z [214/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:54:51.4878552Z [215/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.8368038Z [216/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:52.0783889Z [217/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:53.0059447Z [218/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:53.2330797Z [219/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:54:53.2353275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2354891Z 2025-05-07T19:54:53.2356192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2357734Z 2025-05-07T19:54:53.2359224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2361220Z 2025-05-07T19:54:53.2362672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2364440Z 2025-05-07T19:54:53.2365986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2367737Z 2025-05-07T19:54:53.2369257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2370993Z 2025-05-07T19:54:53.2372534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2374274Z 2025-05-07T19:54:53.2375774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2377492Z 2025-05-07T19:54:53.2378994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2380714Z 2025-05-07T19:54:53.2382227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2383932Z 2025-05-07T19:54:53.2385467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2387212Z 2025-05-07T19:54:53.2388875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.2390850Z 2025-05-07T19:54:53.4783813Z [220/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:54:53.4806298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4807935Z 2025-05-07T19:54:53.4809297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4810904Z 2025-05-07T19:54:53.4812429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4814145Z 2025-05-07T19:54:53.4815605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4817352Z 2025-05-07T19:54:53.4818944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4820596Z 2025-05-07T19:54:53.4822067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4823732Z 2025-05-07T19:54:53.4824993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4826678Z 2025-05-07T19:54:53.4827999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4829544Z 2025-05-07T19:54:53.4830895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4832557Z 2025-05-07T19:54:53.4833908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4835414Z 2025-05-07T19:54:53.4836799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4838582Z 2025-05-07T19:54:53.4839962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.4841468Z 2025-05-07T19:54:53.6506705Z [221/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:54:53.6529369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6531020Z 2025-05-07T19:54:53.6532430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6534009Z 2025-05-07T19:54:53.6535496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6537271Z 2025-05-07T19:54:53.6538793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6540477Z 2025-05-07T19:54:53.6542253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6543982Z 2025-05-07T19:54:53.6545335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6546898Z 2025-05-07T19:54:53.6548388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6550026Z 2025-05-07T19:54:53.6551555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6553016Z 2025-05-07T19:54:53.6554399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6556064Z 2025-05-07T19:54:53.6557563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6559126Z 2025-05-07T19:54:53.6560505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6562138Z 2025-05-07T19:54:53.6563657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.6565380Z 2025-05-07T19:54:53.7486160Z [222/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:54.1585598Z [223/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:54:54.3139328Z [224/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:54:54.6557661Z [225/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:54:54.9125943Z [226/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:54:55.0328502Z [227/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:54:56.3734242Z [228/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:54:56.5935989Z [229/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:54:57.1871137Z [230/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:54:57.8833146Z [231/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:58.2217279Z [232/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:54:58.5628388Z [233/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:54:58.6618563Z [234/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:54:59.3382352Z [235/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:00.0805774Z [236/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:00.2469382Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:00.4442096Z [238/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:00.6819222Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:00.6843734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6848511Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6851984Z (946): here 2025-05-07T19:55:00.6852202Z 2025-05-07T19:55:00.6853551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6858301Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6862074Z (996): here 2025-05-07T19:55:00.6862304Z 2025-05-07T19:55:00.6863685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6868436Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6872120Z (1046): here 2025-05-07T19:55:00.6872354Z 2025-05-07T19:55:00.6873764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6878484Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6881960Z (1096): here 2025-05-07T19:55:00.6882182Z 2025-05-07T19:55:00.6883552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6889765Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6893482Z (1146): here 2025-05-07T19:55:00.6893726Z 2025-05-07T19:55:00.6895088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6899805Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6903231Z (1196): here 2025-05-07T19:55:00.6903451Z 2025-05-07T19:55:00.6904808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6909506Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6913096Z (1246): here 2025-05-07T19:55:00.6913317Z 2025-05-07T19:55:00.6914923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6919618Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6923031Z (1296): here 2025-05-07T19:55:00.6923246Z 2025-05-07T19:55:00.6924639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6929184Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6931688Z (1346): here 2025-05-07T19:55:00.6931879Z 2025-05-07T19:55:00.6932899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6936715Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6939304Z (1396): here 2025-05-07T19:55:00.6939495Z 2025-05-07T19:55:00.6940542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6944122Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6946789Z (1446): here 2025-05-07T19:55:00.6946973Z 2025-05-07T19:55:00.6948000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6951816Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6954469Z (1496): here 2025-05-07T19:55:00.6954655Z 2025-05-07T19:55:00.6955885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6959484Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6962128Z (1546): here 2025-05-07T19:55:00.6962311Z 2025-05-07T19:55:00.6963386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6967009Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6969615Z (1596): here 2025-05-07T19:55:00.6969805Z 2025-05-07T19:55:00.6970841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6974412Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6977123Z (1646): here 2025-05-07T19:55:00.6977308Z 2025-05-07T19:55:00.6978385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6982071Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6984739Z (1696): here 2025-05-07T19:55:00.6984914Z 2025-05-07T19:55:00.6985989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6989617Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.6992762Z (1746): here 2025-05-07T19:55:00.6992955Z 2025-05-07T19:55:00.6994022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.6997957Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7000666Z (1796): here 2025-05-07T19:55:00.7000862Z 2025-05-07T19:55:00.7001925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7005573Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7008248Z (1846): here 2025-05-07T19:55:00.7008439Z 2025-05-07T19:55:00.7009474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7013077Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7015969Z (1896): here 2025-05-07T19:55:00.7016146Z 2025-05-07T19:55:00.7017207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7020822Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7023497Z (1946): here 2025-05-07T19:55:00.7023691Z 2025-05-07T19:55:00.7024763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7028474Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7031072Z (1996): here 2025-05-07T19:55:00.7031412Z 2025-05-07T19:55:00.7032451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7036565Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7039266Z (2046): here 2025-05-07T19:55:00.7039448Z 2025-05-07T19:55:00.7040489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7044148Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7046841Z (2096): here 2025-05-07T19:55:00.7047016Z 2025-05-07T19:55:00.7048058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7051619Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7054375Z (946): here 2025-05-07T19:55:00.7054555Z 2025-05-07T19:55:00.7055648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7059256Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7061852Z (996): here 2025-05-07T19:55:00.7062037Z 2025-05-07T19:55:00.7063091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7066696Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7069373Z (1046): here 2025-05-07T19:55:00.7069567Z 2025-05-07T19:55:00.7070606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7074493Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7077137Z (1096): here 2025-05-07T19:55:00.7077326Z 2025-05-07T19:55:00.7078398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7081957Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7084612Z (1146): here 2025-05-07T19:55:00.7084799Z 2025-05-07T19:55:00.7085857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7089439Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7092422Z (1196): here 2025-05-07T19:55:00.7092603Z 2025-05-07T19:55:00.7093663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7097108Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7099717Z (1246): here 2025-05-07T19:55:00.7099911Z 2025-05-07T19:55:00.7100935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7104724Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7107504Z (1296): here 2025-05-07T19:55:00.7107690Z 2025-05-07T19:55:00.7108819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7112817Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7115522Z (1346): here 2025-05-07T19:55:00.7115710Z 2025-05-07T19:55:00.7116828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7120450Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7123091Z (1396): here 2025-05-07T19:55:00.7123269Z 2025-05-07T19:55:00.7124327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7127940Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7130623Z (1446): here 2025-05-07T19:55:00.7130817Z 2025-05-07T19:55:00.7131872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7135694Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7138374Z (1496): here 2025-05-07T19:55:00.7138578Z 2025-05-07T19:55:00.7139630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7143181Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7145835Z (1546): here 2025-05-07T19:55:00.7146002Z 2025-05-07T19:55:00.7147096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7150738Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7153525Z (1596): here 2025-05-07T19:55:00.7153718Z 2025-05-07T19:55:00.7154941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7158606Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7161489Z (1646): here 2025-05-07T19:55:00.7161688Z 2025-05-07T19:55:00.7162855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7166755Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7169461Z (1696): here 2025-05-07T19:55:00.7169648Z 2025-05-07T19:55:00.7170721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7174520Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7177161Z (1746): here 2025-05-07T19:55:00.7177341Z 2025-05-07T19:55:00.7178403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7182014Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7184710Z (1796): here 2025-05-07T19:55:00.7184890Z 2025-05-07T19:55:00.7185902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7189543Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7192578Z (1846): here 2025-05-07T19:55:00.7192764Z 2025-05-07T19:55:00.7194029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7197555Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7200194Z (1896): here 2025-05-07T19:55:00.7200398Z 2025-05-07T19:55:00.7201483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7205183Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7207913Z (1946): here 2025-05-07T19:55:00.7208111Z 2025-05-07T19:55:00.7209143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7212767Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7215665Z (1996): here 2025-05-07T19:55:00.7215848Z 2025-05-07T19:55:00.7216919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7220641Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7223130Z (2046): here 2025-05-07T19:55:00.7223308Z 2025-05-07T19:55:00.7224299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7227680Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7230168Z (2096): here 2025-05-07T19:55:00.7230343Z 2025-05-07T19:55:00.7231622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7234948Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7237371Z (946): here 2025-05-07T19:55:00.7237551Z 2025-05-07T19:55:00.7238528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7241877Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7244340Z (996): here 2025-05-07T19:55:00.7244499Z 2025-05-07T19:55:00.7245466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7248811Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7251357Z (1046): here 2025-05-07T19:55:00.7251532Z 2025-05-07T19:55:00.7252506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7255867Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7258300Z (1096): here 2025-05-07T19:55:00.7258465Z 2025-05-07T19:55:00.7259471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7262775Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7265218Z (1146): here 2025-05-07T19:55:00.7265384Z 2025-05-07T19:55:00.7266376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7269855Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7272639Z (1196): here 2025-05-07T19:55:00.7272828Z 2025-05-07T19:55:00.7273872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7277483Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7280135Z (1246): here 2025-05-07T19:55:00.7280319Z 2025-05-07T19:55:00.7281390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7285018Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7287855Z (1296): here 2025-05-07T19:55:00.7288031Z 2025-05-07T19:55:00.7289071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7292868Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7295454Z (1346): here 2025-05-07T19:55:00.7295632Z 2025-05-07T19:55:00.7296664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7300215Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7302789Z (1396): here 2025-05-07T19:55:00.7302977Z 2025-05-07T19:55:00.7304044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7307974Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7310781Z (1446): here 2025-05-07T19:55:00.7310983Z 2025-05-07T19:55:00.7312209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7316026Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7318746Z (1496): here 2025-05-07T19:55:00.7318942Z 2025-05-07T19:55:00.7319992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7323582Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7326521Z (1546): here 2025-05-07T19:55:00.7326719Z 2025-05-07T19:55:00.7327779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7331403Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7334081Z (1596): here 2025-05-07T19:55:00.7334262Z 2025-05-07T19:55:00.7335369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7339053Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7341836Z (1646): here 2025-05-07T19:55:00.7342040Z 2025-05-07T19:55:00.7343114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7347023Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7349705Z (1696): here 2025-05-07T19:55:00.7349892Z 2025-05-07T19:55:00.7350946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7354752Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7357437Z (1746): here 2025-05-07T19:55:00.7357621Z 2025-05-07T19:55:00.7358657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7362250Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7364974Z (1796): here 2025-05-07T19:55:00.7365162Z 2025-05-07T19:55:00.7366219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7369906Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7372540Z (1846): here 2025-05-07T19:55:00.7372733Z 2025-05-07T19:55:00.7373817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7377450Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7380159Z (1896): here 2025-05-07T19:55:00.7380358Z 2025-05-07T19:55:00.7381410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7385078Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7387726Z (1946): here 2025-05-07T19:55:00.7387899Z 2025-05-07T19:55:00.7391052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7394836Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7397455Z (1996): here 2025-05-07T19:55:00.7397649Z 2025-05-07T19:55:00.7398676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7402303Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7404862Z (2046): here 2025-05-07T19:55:00.7405039Z 2025-05-07T19:55:00.7406102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:00.7409963Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:00.7412556Z (2096): here 2025-05-07T19:55:00.7412745Z 2025-05-07T19:55:00.7908133Z [240/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:01.2044414Z [241/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:01.3032293Z [242/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:02.2604786Z [243/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:02.4484898Z [244/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:03.5626721Z [245/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:03.8993125Z [246/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:04.5917263Z [247/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:55:04.8811549Z [248/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:05.4132301Z [249/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:10.6651337Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:55:30.0800201Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:55:56.6763832Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:55:56.6786336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6787986Z 2025-05-07T19:55:56.6789552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6791741Z 2025-05-07T19:55:56.6793384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6795079Z 2025-05-07T19:55:56.6796799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6798617Z 2025-05-07T19:55:56.6800211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6802422Z 2025-05-07T19:55:56.6804021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6805868Z 2025-05-07T19:55:56.6807465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6809270Z 2025-05-07T19:55:56.6810790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6812355Z 2025-05-07T19:55:56.6813803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:56.6815596Z 2025-05-07T19:56:04.6250931Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:05.6321655Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:05.9447999Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:56:05.9469105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:05.9470645Z 2025-05-07T19:56:05.9472034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:05.9473480Z 2025-05-07T19:56:06.0279397Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:07.1423256Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:10.5901642Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:56:12.0414951Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:13.8509128Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:56:18.1376497Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:18.6983102Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:18.7522156Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:19.7702346Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:20.6866929Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:21.2910036Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:21.4338836Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:22.1780916Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:56:22.1801820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:22.1803133Z 2025-05-07T19:56:22.1804393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:22.1805845Z 2025-05-07T19:56:22.3073309Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:22.4166550Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:22.5948929Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:22.6838474Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:56:23.7638649Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:24.5513194Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:25.2833692Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:25.5132390Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:56:25.5155161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5156900Z 2025-05-07T19:56:25.5158580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5160259Z 2025-05-07T19:56:25.5161738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5163408Z 2025-05-07T19:56:25.5164874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5166484Z 2025-05-07T19:56:25.5167910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5169535Z 2025-05-07T19:56:25.5170936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5172575Z 2025-05-07T19:56:25.5173728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5175251Z 2025-05-07T19:56:25.5176487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5178271Z 2025-05-07T19:56:25.5179716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5181385Z 2025-05-07T19:56:25.5182864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5184555Z 2025-05-07T19:56:25.5185965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5187443Z 2025-05-07T19:56:25.5188828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.5190818Z 2025-05-07T19:56:28.1601914Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:28.3389075Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:56:28.3412289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3413992Z 2025-05-07T19:56:28.3415295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3416913Z 2025-05-07T19:56:28.3418189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3419740Z 2025-05-07T19:56:28.3421162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3422701Z 2025-05-07T19:56:28.3423996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3425532Z 2025-05-07T19:56:28.3426903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3428574Z 2025-05-07T19:56:28.3430053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3432101Z 2025-05-07T19:56:28.3433655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3435347Z 2025-05-07T19:56:28.3436828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3438481Z 2025-05-07T19:56:28.3440041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3441701Z 2025-05-07T19:56:28.3443171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3444809Z 2025-05-07T19:56:28.3446313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.3447978Z 2025-05-07T19:56:28.4164453Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:56:28.7333165Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:29.0437693Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:56:29.3010700Z [282/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:56:29.5779358Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:29.8846357Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:56:31.6178116Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:31.7571582Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:34.9675428Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:34.9691566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9692870Z 2025-05-07T19:56:34.9694015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9695273Z 2025-05-07T19:56:34.9696326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9697593Z 2025-05-07T19:56:34.9698576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9699700Z 2025-05-07T19:56:34.9700675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9701790Z 2025-05-07T19:56:34.9702764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9704213Z 2025-05-07T19:56:34.9705202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9706319Z 2025-05-07T19:56:34.9707315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9708441Z 2025-05-07T19:56:34.9720334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9721588Z 2025-05-07T19:56:34.9722579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9723722Z 2025-05-07T19:56:34.9724724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9725842Z 2025-05-07T19:56:34.9726826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9727967Z 2025-05-07T19:56:34.9728936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9730047Z 2025-05-07T19:56:34.9731074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9732236Z 2025-05-07T19:56:34.9733465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:34.9734676Z 2025-05-07T19:56:34.9735650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:34.9736790Z 2025-05-07T19:56:36.8477493Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:37.2168208Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:56:37.5678522Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:48.2207685Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:51.1214277Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:57.7499151Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:58.2451250Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:59.5317429Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:05.4911446Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:06.7308067Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:06.7479718Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:07.2413792Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:08.6489901Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T19:57:09.0914497Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:09.5339670Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:10.1454268Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:10.1746157Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:11.5812324Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:13.3346381Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:13.4243887Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:13.4401339Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:14.1835866Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:14.4411833Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:14.7900457Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:57:14.9732774Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:17.2089059Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:17.6588121Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:18.7206462Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:22.8445417Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:24.2859432Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T19:57:26.4565315Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:57:27.4005537Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:27.4960335Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:28.1519962Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:57:28.3789511Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:57:30.7029873Z [323/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:30.8662325Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:57:34.1647843Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:54.7637203Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T19:58:03.0292503Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:03.8650696Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:05.7496320Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:06.9277533Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:19.1572881Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:19.7418662Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:21.0160969Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:21.3380722Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:22.2646736Z [335/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T19:58:26.2787648Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:28.6083842Z [337/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:29.0162361Z [338/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:58:29.0701769Z [339/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T19:58:30.2772745Z [340/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:58:30.5679253Z [341/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:31.5313632Z [342/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:58:31.8970169Z [343/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T19:58:31.9127144Z [344/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T19:58:32.8363383Z [345/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:33.5654343Z [346/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:33.5835991Z [347/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:35.3458983Z [348/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:58:37.7455030Z [349/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:37.7673837Z [350/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:58:37.8307767Z [351/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:39.0439432Z [352/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:58:39.3291182Z [353/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:40.3907395Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:58:40.5734525Z [355/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:58:41.5186598Z [356/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:44.9327247Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:50.2537969Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:50.5850279Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T19:58:52.3856552Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T19:58:53.6627985Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:53.7201175Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:55.4172700Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T19:58:57.2272855Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T19:58:57.9714679Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:58.1079763Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T19:58:59.7911906Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T19:59:00.8627024Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:03.1625402Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:09.2034832Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:10.5347876Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:14.4864807Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:15.0646861Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:18.7849642Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T19:59:19.2996231Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:21.3546943Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:23.3674799Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:26.4516629Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:26.7442074Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T19:59:35.4102146Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:40.7982032Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T19:59:44.0789855Z [382/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:59:45.6099145Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T19:59:47.8415904Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T19:59:49.5969579Z [385/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:59:52.1673870Z [386/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T19:59:52.6696286Z [387/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:59:55.0339654Z [388/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T19:59:55.6084537Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T19:59:56.3704424Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T19:59:59.3416927Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:00.6100319Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:02.1022286Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:03.4066095Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:00:04.1180924Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:04.3717928Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:04.3739969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:04.3741553Z 2025-05-07T20:00:04.3742898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:04.3744394Z 2025-05-07T20:00:05.4540025Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:07.1086743Z [398/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:00:07.4606589Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:08.2437251Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:08.5544178Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:08.6623625Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:08.6644988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:08.6646548Z 2025-05-07T20:00:08.6647850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:08.6649353Z 2025-05-07T20:00:09.4981372Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:09.9948095Z [404/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:00:10.4318370Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:11.3040572Z [406/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:12.2828778Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:12.8158279Z [408/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:12.9622227Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:13.4296547Z [410/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:14.2797333Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:15.0257085Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:17.2120678Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:18.2682261Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:19.9444458Z [415/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:20.6255779Z [416/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:21.5753214Z [417/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:23.0485599Z [418/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:23.0671032Z [419/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:00:23.1218616Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:23.3105089Z [421/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:23.3577577Z [422/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:00:23.5911497Z [423/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:23.6406451Z [424/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:00:24.5892408Z [425/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:25.4689766Z [426/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:25.9913780Z [427/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:26.0509538Z [428/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:00:26.2508831Z [429/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:26.2882133Z [430/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:26.8336568Z [431/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:27.0936288Z [432/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:00:27.7282520Z [433/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:00:27.9224110Z [434/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:28.1399603Z [435/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:00:28.4792215Z [436/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:00:28.5610403Z [437/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:28.6467519Z [438/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:00:29.2222799Z [439/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:29.3430975Z [440/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:00:29.8330347Z [441/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:30.0861609Z [442/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:00:30.4525957Z [443/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:00:31.8811066Z [444/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:31.9261040Z [445/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:00:31.9653334Z [446/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:31.9818185Z [447/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:00:32.0165555Z [448/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:00:32.0884560Z [449/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:32.3896573Z [450/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:00:32.4324391Z [451/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:00:32.4376560Z [452/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:00:32.8568729Z [453/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:00:33.5808142Z [454/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:00:34.3720947Z [455/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:34.8972067Z [456/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:00:34.9190476Z [457/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:00:35.1255888Z [458/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:00:35.1427893Z [459/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:00:36.4379007Z [460/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:00:38.8179007Z [461/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:00:39.4211258Z [462/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:39.5673650Z [463/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:00:39.9375380Z [464/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:00:40.1943071Z [465/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T20:00:40.7059933Z [466/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:00:41.5112928Z [467/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:42.0762117Z [468/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:42.5392818Z [469/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:43.0910028Z [470/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:00:43.7862025Z [471/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:00:44.2671999Z [472/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:00:44.3821454Z [473/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:44.8838038Z [474/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:00:45.0703750Z [475/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:00:45.9719224Z [476/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:00:46.3140379Z [477/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:00:46.7674720Z [478/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:00:46.8812288Z [479/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:00:47.1115903Z [480/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:47.2517373Z [481/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:48.5618256Z [482/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:00:48.8417190Z [483/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:00:49.1560143Z [484/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:00:49.1795946Z [485/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:00:49.1908951Z [486/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:00:49.6787732Z [487/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:00:49.9328185Z [488/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:50.1164624Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:50.5030245Z [490/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:00:53.2164066Z [491/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:53.2764588Z [492/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:00:53.7700317Z [493/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:53.8938502Z [494/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:00:54.8568489Z [495/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:00:55.1378265Z [496/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:00:57.0079565Z [497/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:00:58.7286112Z [498/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:00:59.3461256Z [499/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:00:59.4718598Z [500/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:00:59.5753759Z [501/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:00:59.7803002Z [502/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:01:00.1319674Z [503/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:01:00.8854487Z [504/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:01:01.3467889Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:01.4138430Z [506/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:01:01.4314938Z [507/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:01:02.0118482Z [508/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:01:02.1823606Z [509/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:01:03.5549210Z [510/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:01:04.6340690Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:06.3395810Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:01:07.7954351Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:08.6160857Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:01:11.8629546Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:01:14.3932016Z [516/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:01:14.4099159Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:01:19.3742944Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:01:19.6132655Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:01:22.8829944Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:01:26.7057150Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:01:27.9503737Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:01:36.8184901Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:01:37.4636943Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:01:37.9057553Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:01:39.0124479Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:01:39.6437988Z [527/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:01:41.1326096Z [528/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:01:42.8301952Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:01:43.2307017Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:01:44.0354128Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:01:44.4206048Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:01:47.2806480Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:01:47.2978953Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:01:47.8965272Z [535/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:01:47.9142290Z [536/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:01:47.9162304Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:01:47.9164248Z ################################################################################ 2025-05-07T20:01:47.9164869Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.9165687Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:01:47.9166538Z Removing all RPATHs ... 2025-05-07T20:01:47.9166950Z ################################################################################ 2025-05-07T20:01:48.0011304Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:01:48.0474073Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 1 2025-05-07T20:01:48.0476212Z ################################################################################ 2025-05-07T20:01:48.0476813Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.0477515Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:01:48.0478369Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:48.0479109Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:48.0479959Z ################################################################################ 2025-05-07T20:01:48.0568605Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:48.0570982Z ################################################################################ 2025-05-07T20:01:48.0571668Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.0572680Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:48.0573702Z Removing all RPATHs ... 2025-05-07T20:01:48.0574270Z ################################################################################ 2025-05-07T20:01:48.0938166Z [541/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:01:48.0956547Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:48.0958611Z ################################################################################ 2025-05-07T20:01:48.0959212Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.0960100Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:48.0961099Z Removing all RPATHs ... 2025-05-07T20:01:48.0961512Z ################################################################################ 2025-05-07T20:01:48.1262262Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:01:48.1264528Z ################################################################################ 2025-05-07T20:01:48.1265065Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.1266082Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:01:48.1267151Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:48.1267704Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:48.1268303Z ################################################################################ 2025-05-07T20:01:48.1345556Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:01:48.1347637Z ################################################################################ 2025-05-07T20:01:48.1348265Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.1349160Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:01:48.1350095Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:48.1350686Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:48.1351494Z ################################################################################ 2025-05-07T20:01:48.2158081Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:48.2160492Z ################################################################################ 2025-05-07T20:01:48.2161102Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.2161989Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:48.2162872Z Removing all RPATHs ... 2025-05-07T20:01:48.2163276Z ################################################################################ 2025-05-07T20:01:48.2165338Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:48.2167392Z ################################################################################ 2025-05-07T20:01:48.2168160Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.2169107Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:48.2170086Z Removing all RPATHs ... 2025-05-07T20:01:48.2170556Z ################################################################################ 2025-05-07T20:01:48.2983206Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:01:48.2985464Z ################################################################################ 2025-05-07T20:01:48.2986042Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.2987183Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:01:48.2988334Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:48.2988956Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:48.2989655Z ################################################################################ 2025-05-07T20:01:48.3257191Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:01:48.3259524Z ################################################################################ 2025-05-07T20:01:48.3260143Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.3261190Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:01:48.3262250Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:48.3262856Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:48.3263526Z ################################################################################ 2025-05-07T20:01:48.7558729Z [549/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:01:48.7561074Z ################################################################################ 2025-05-07T20:01:48.7561684Z [CMAKE] Running post-build script ... 2025-05-07T20:01:48.7562666Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:01:48.7563660Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:48.7564286Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:48.7564982Z ################################################################################ 2025-05-07T20:01:48.9566064Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:01:49.6661570Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:01:49.6837023Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:01:49.6939283Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:01:49.8583783Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:01:51.1850670Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:01:51.8050882Z [556/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:01:51.8052721Z ################################################################################ 2025-05-07T20:01:51.8053194Z [CMAKE] Running post-build script ... 2025-05-07T20:01:51.8054048Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:01:51.8054903Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:51.8055395Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:51.8055970Z ################################################################################ 2025-05-07T20:01:51.9284318Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:01:52.4361766Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:01:52.4696545Z [559/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:01:52.4698698Z ################################################################################ 2025-05-07T20:01:52.4699600Z [CMAKE] Running post-build script ... 2025-05-07T20:01:52.4700534Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:01:52.4701526Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:52.4702148Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:52.4702866Z ################################################################################ 2025-05-07T20:01:52.8214369Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:01:54.1027043Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:01:54.4580724Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:01:54.8414517Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:01:54.8518123Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:01:55.9917915Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:01:56.1385502Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:01:57.5619520Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:01:57.8112662Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:02:04.8669048Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:02:04.8686548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:04.8687639Z 2025-05-07T20:02:04.8688487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:04.8689715Z 2025-05-07T20:02:07.3886609Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:02:07.4678891Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:02:07.5525040Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:02:07.9242699Z [573/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:02:07.9342663Z [574/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:07.9343922Z ################################################################################ 2025-05-07T20:02:07.9344285Z [CMAKE] Running post-build script ... 2025-05-07T20:02:07.9344899Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:07.9345525Z Removing all RPATHs ... 2025-05-07T20:02:07.9345815Z ################################################################################ 2025-05-07T20:02:09.6904340Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:02:11.4350190Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:02:12.3726494Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:02:14.3333657Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:02:15.6784965Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:02:15.7546640Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:02:18.6795694Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:02:20.0351091Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:02:20.6871998Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:22.3773596Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:26.1741104Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:27.3119019Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:02:27.7373422Z [587/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:28.4623397Z [588/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build -L"/github/home/miniconda/envs/build_binary/lib/stubs" -L"/github/home/miniconda/envs/build_binary/lib" && : 2025-05-07T20:02:28.4995192Z [589/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:02:28.4997422Z ################################################################################ 2025-05-07T20:02:28.4998031Z [CMAKE] Running post-build script ... 2025-05-07T20:02:28.4999068Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:28.5000139Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:28.5000748Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:28.5001497Z ################################################################################ 2025-05-07T20:02:28.6250180Z [590/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:29.6563086Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:02:29.8745923Z [592/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:02:29.8748153Z ################################################################################ 2025-05-07T20:02:29.8748732Z [CMAKE] Running post-build script ... 2025-05-07T20:02:29.8749772Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:29.8750812Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:29.8751509Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:29.8752202Z ################################################################################ 2025-05-07T20:02:32.2794053Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:02:32.8861995Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:02:33.3559756Z [595/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:02:33.3561097Z ################################################################################ 2025-05-07T20:02:33.3561453Z [CMAKE] Running post-build script ... 2025-05-07T20:02:33.3562317Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:33.3562946Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:33.3563312Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:33.3563724Z ################################################################################ 2025-05-07T20:02:33.3808669Z [596/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:02:33.7510632Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:02:33.8170295Z [598/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:34.2132893Z [599/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:02:34.2134603Z ################################################################################ 2025-05-07T20:02:34.2134960Z [CMAKE] Running post-build script ... 2025-05-07T20:02:34.2135580Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:34.2136210Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:34.2136576Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:34.2136974Z ################################################################################ 2025-05-07T20:02:34.6221324Z [600/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:02:35.6491034Z [601/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:02:36.6315730Z [602/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:02:37.0660300Z [603/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:02:38.0033167Z [604/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:02:49.3015970Z [605/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:02:50.0140561Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:50.1142154Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:02:50.1143392Z ################################################################################ 2025-05-07T20:02:50.1143744Z [CMAKE] Running post-build script ... 2025-05-07T20:02:50.1144518Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:02:50.1145054Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:50.1145423Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:50.1145819Z ################################################################################ 2025-05-07T20:02:50.1146807Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:02:50.1186824Z -- Install configuration: "Release" 2025-05-07T20:02:50.1187508Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:02:50.1209629Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:02:50.1210535Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:02:50.1230549Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:02:50.1231621Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:02:50.1248027Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:02:50.1263988Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:02:50.1265002Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:02:50.1265926Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:02:50.1282040Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:02:50.1283091Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:02:50.1284336Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:50.1285571Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:50.1286797Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:50.1287893Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:50.1289077Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:02:50.1290183Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:02:50.1292399Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:02:50.1293640Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:02:50.1294852Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:02:50.1296000Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:02:50.1297164Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:02:50.1298396Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:02:50.1299641Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:02:50.1300936Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:02:50.1302170Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:02:50.1303419Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:02:50.1304801Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:02:50.1306013Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:02:50.1307150Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:02:50.1308322Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:02:50.1309592Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:02:50.1310844Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:02:50.1312009Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:50.1318519Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:02:50.1371772Z 2025-05-07T20:02:50.1409458Z 2025-05-07T20:02:50.1410639Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:02:50.1413301Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:02:50.1415827Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:02:50.1416669Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:02:50.1417677Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:02:50.1419002Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:02:50.1419978Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:02:50.1420773Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:02:50.1421588Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:02:50.1422369Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:02:50.1423213Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:02:50.1424312Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:02:50.1425387Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:02:50.1426354Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:02:50.1427350Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:02:50.1428514Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:02:50.1429763Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:02:50.1431077Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:02:50.1432692Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:02:50.1433991Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:02:50.1435035Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:02:50.1435819Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:02:50.1436456Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config 2025-05-07T20:02:50.1437153Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:02:50.1438009Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:02:50.1438735Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs 2025-05-07T20:02:50.1439409Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:02:50.1440179Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:02:50.1440952Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:02:50.1441822Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:02:50.1442822Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:02:50.1444121Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:02:50.1445087Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:02:50.1445899Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:02:50.1446702Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:02:50.1447370Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:02:50.1448083Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:02:50.1448984Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:02:50.1449707Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll 2025-05-07T20:02:50.1450357Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:02:50.1451005Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:02:50.1451646Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:02:50.1452299Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton 2025-05-07T20:02:50.1452969Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:02:50.1453757Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:02:50.1454545Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:02:50.1455427Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:02:50.1456149Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils 2025-05-07T20:02:50.1456803Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:02:50.1457600Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:02:50.1458416Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:02:50.1459218Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:02:50.1459948Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:02:50.1460633Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:02:50.1461435Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:02:50.1462137Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:02:50.1462824Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:02:50.1463647Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:02:50.1464353Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1465091Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:02:50.1465961Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:02:50.1467012Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:02:50.1468350Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:02:50.1469452Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:02:50.1470535Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:02:50.1472017Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:02:50.1473472Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:02:50.1474914Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:02:50.1476275Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:02:50.1477653Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:02:50.1478940Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:02:50.1480234Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:02:50.1481243Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1481987Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:02:50.1482907Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:02:50.1483932Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:02:50.1484794Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:02:50.1485781Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:02:50.1486928Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:02:50.1487856Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:02:50.1488773Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:02:50.1489784Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:02:50.1491519Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:02:50.1492643Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:02:50.1493378Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:02:50.1494103Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:02:50.1495231Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:02:50.1496128Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:50.1496824Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:02:50.1497647Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:02:50.1498482Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:02:50.1499424Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:02:50.1500197Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:02:50.1500942Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:02:50.1501838Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:02:50.1502721Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:02:50.1503769Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:02:50.1504681Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:02:50.1505431Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:02:50.1506236Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:02:50.1507215Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:02:50.1508102Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:50.1508914Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:02:50.1510031Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:02:50.1511041Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:02:50.1512114Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:02:50.1513201Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:02:50.1513903Z 2025-05-07T20:02:50.1624410Z INFO:root:running bdist_wheel 2025-05-07T20:02:50.1668489Z INFO:root:running build 2025-05-07T20:02:50.1669330Z INFO:root:running build_py 2025-05-07T20:02:50.1672379Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1675260Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1678116Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1679426Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1680782Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1682113Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1683558Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1685017Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1687084Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1688644Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1690172Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1692682Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1694449Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1696013Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1697498Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1699115Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1701490Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1703112Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1705886Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1708890Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1710376Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1712089Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1713503Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1716084Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:50.1717283Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:50.1718953Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:50.1721385Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1722494Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1724116Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1725644Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1727126Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1728709Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1730196Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1731608Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1733087Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1735159Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:50.1737130Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:50.1738223Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:50.1739900Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:50.1741889Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:02:50.1742970Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:02:50.1745162Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:02:50.1746196Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:02:50.1748403Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:50.1749480Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:50.1751137Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:50.1753040Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:50.1754763Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:50.1756906Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:50.1757960Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:50.1759657Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:50.1761356Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:50.1762855Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:50.1764890Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:50.1765889Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:50.1767530Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:50.1769764Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:50.1770828Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:50.1772450Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:50.1775707Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1776782Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1778542Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1780224Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1781714Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1783262Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1784920Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1786955Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1788624Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1790185Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1792346Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1794217Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1795918Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1797531Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:50.1799910Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1801025Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1802816Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1804430Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1805994Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1807551Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1809442Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1810918Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1812399Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1814510Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1816028Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1817488Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:50.1819654Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:50.1820686Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:50.1822583Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:50.1824646Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:50.1825994Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:50.1829690Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:50.1831110Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:50.1832578Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:50.1833673Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:50.1834923Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:50.1836412Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:50.1837948Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:50.1839506Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:50.1840993Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:50.1843697Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:50.1844810Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:50.1846498Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:50.1848496Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:50.1849622Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:50.1851376Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:50.1853151Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:50.1854348Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:50.1856076Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:50.1895768Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.1945556Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.2235291Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:50.2724059Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.2229103Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.2233217Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.2865854Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.2933671Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.3063422Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.3412676Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:52.8684274Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:52.9268423Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:57.0187292Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:57.6264172Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.0304680Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.2715851Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.3004204Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.4427313Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4428863Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4432877Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4444201Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4449365Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4458328Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4466787Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4473105Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4487635Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4492598Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4514126Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4516478Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4521440Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4531039Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4537591Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:59.4554088Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:59.4558561Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:59.4568916Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:59.4571163Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.4598209Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7640443Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7644330Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7646696Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7647879Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7649374Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7650768Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7652094Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7653305Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7654555Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7655785Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7657022Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7658389Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7659771Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7661080Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7662431Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7663975Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7665425Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7666915Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7679701Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7681229Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7682619Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7683827Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:59.7685042Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:59.7686484Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:59.7687785Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7689050Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7690354Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7692116Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7693547Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7695045Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7696457Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7697904Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7699188Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:59.7700478Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:59.7701977Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:59.7703289Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:02:59.7704515Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:02:59.7705781Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:59.7707139Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:59.7708439Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:59.7709769Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:59.7711147Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:59.7712660Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:59.7714067Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:59.7715409Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:59.7716732Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7718170Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7719489Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7720794Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7722142Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7723499Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7724961Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7726529Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7728092Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7729561Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7731132Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7732782Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7734455Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7736078Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7737723Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7739283Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7740917Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7742427Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7743790Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7745203Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7746560Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7747976Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7749486Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7750902Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7752572Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7754083Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7755696Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7757143Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7758522Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7759982Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7761436Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7762791Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7764276Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7765597Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7766949Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7768341Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7769687Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7771064Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7772457Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7773785Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7775206Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7776662Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7778351Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7779846Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7781343Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7791772Z INFO:skbuild:copied 90 files 2025-05-07T20:02:59.7792553Z INFO:root:running build_ext 2025-05-07T20:02:59.7794968Z INFO:root:installing to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:02:59.7796333Z INFO:root:running install 2025-05-07T20:02:59.7852255Z INFO:root:running install_lib 2025-05-07T20:02:59.7853613Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:02:59.7855520Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:02:59.7856874Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:02:59.7858004Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:02:59.7859715Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:02:59.7860850Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:02:59.7861911Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7863358Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7864833Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7866422Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7867997Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7869610Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7871281Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7873027Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7874577Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:59.7875703Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:02:59.7876878Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:02:59.7878455Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:02:59.7879606Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:02:59.7880446Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7881606Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7883143Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7884418Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7885559Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7887089Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7888242Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7889407Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7891307Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7893033Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7894933Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7896651Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7898384Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7900187Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7902056Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7904036Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7905815Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7907620Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7909449Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7911259Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7913117Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:02:59.7914239Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:02:59.7914997Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7916184Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7917766Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7919377Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7920972Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7922683Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7924491Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7926067Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7927621Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7929239Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7930902Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7932486Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7933628Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7934767Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7936424Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7937654Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7938418Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7939605Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7941339Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7943001Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7944480Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7945995Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7947520Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7948686Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7949832Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7951420Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7953164Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7954758Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7956360Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7957534Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7958710Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7960339Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7961903Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:02:59.7963090Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:02:59.7963973Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7965168Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7966834Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7968456Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:59.7969964Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:59.7971460Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:59.7972969Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:59.7974072Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:02:59.7975207Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:59.7976662Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:59.7978303Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:59.7979787Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:59.7981201Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.7982558Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.7985629Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.8057833Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.9397435Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.9401917Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.9453238Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.9456926Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.9472457Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.9500573Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.0706655Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.0754803Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.3860884Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.4342236Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.5429981Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.5617495Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.5639718Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.5753184Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5755746Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5757855Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5759946Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5761972Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5764248Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5766294Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5768422Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5770709Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5772809Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5774921Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5777100Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5779227Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5781224Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5783367Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:00.5784805Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:00.5786289Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:00.5788261Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:00.5789942Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.5792036Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6024874Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6027859Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6029301Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6030862Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6032684Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6034319Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6035859Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6037312Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6038965Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6040393Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6041840Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6043393Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6044948Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6046466Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6047991Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6049652Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6051207Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6052873Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6054467Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6056043Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6057507Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6058881Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:00.6059655Z INFO:skbuild:copied 125 files 2025-05-07T20:03:00.6059942Z INFO:root:running install_egg_info 2025-05-07T20:03:00.6096396Z INFO:root:running egg_info 2025-05-07T20:03:00.6134955Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:03:00.6136622Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:03:00.6137388Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:03:00.6138024Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:03:00.6230816Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:03:00.6261505Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:03:00.6264215Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.13.egg-info 2025-05-07T20:03:00.6266648Z INFO:root:running install_scripts 2025-05-07T20:03:00.6267506Z INFO:skbuild:copied 0 files 2025-05-07T20:03:03.2678223Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:03:03.2682173Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-oam94gw2/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:03:03.2683916Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:03:03.2954255Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:03:03.2969732Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:03:03.2971017Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:03:03.4642608Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:03:03.4777802Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:03:03.4891946Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:03:04.5233389Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:03:04.6375392Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:03:05.0108278Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:03:05.0782320Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:03:05.3993590Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:03:14.0186034Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:03:14.6284994Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:03:29.4386700Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:03:31.0255685Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:03:33.0515105Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:03:33.5133706Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:03:33.6923513Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:03:38.4481782Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:03:44.6424516Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:03:45.4355179Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:03:45.4533423Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:03:45.4537989Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:03:45.4538843Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:03:45.4539493Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:03:45.4540360Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:03:45.4543337Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:03:45.4554711Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:03:45.4558210Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:03:45.4560372Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:03:45.4561779Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:03:45.4563094Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:03:45.4564776Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:03:45.4568566Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:03:45.4589949Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:03:45.4634070Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:03:45.4637058Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:03:45.4638824Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:03:45.4639688Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:03:45.4641291Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:03:45.4643674Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:03:45.4645436Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:03:45.4646709Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:03:45.4647762Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:03:45.4649205Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:03:45.4651733Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:03:45.4653249Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:03:45.4655945Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:03:45.4657249Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:03:45.4662841Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:03:45.4664358Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:03:45.4665902Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:03:45.4667680Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:03:45.4669023Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:03:45.4671079Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:03:45.4678214Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:03:45.4680368Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:03:45.4682539Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:03:45.4684783Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:03:45.4686178Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:03:45.4688262Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:03:45.4690468Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:03:45.4694305Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:03:45.4699105Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:03:45.4699701Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:03:45.4701722Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:03:45.4707708Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:03:45.4712994Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:03:45.4714892Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:03:45.4718350Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:03:45.4723859Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:03:45.4726134Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:03:45.4728975Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:03:45.4731941Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:03:45.4733901Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:03:45.4736506Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:03:45.4739000Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:03:45.4741983Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:03:45.4744714Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:03:45.4747822Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:03:45.4750175Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:03:45.4753201Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:03:45.4756255Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:03:45.4759505Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:03:45.4762224Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:03:45.4764113Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:03:45.4767158Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:03:45.4768957Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:03:45.4770049Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:03:45.4771732Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:03:45.4776655Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:03:45.4778632Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:03:45.4781386Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:03:45.4783538Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:03:45.4784840Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:03:45.4787299Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:03:45.4789518Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:03:45.4792348Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:03:45.4793818Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:03:45.4795253Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:03:45.4796759Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:03:45.4798372Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:03:45.4799213Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:03:45.4804986Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:03:45.4830508Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:03:45.4833172Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:03:45.4836606Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:03:45.4838029Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:03:45.4840080Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:03:45.4841332Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:03:45.4842752Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:03:45.4844444Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:03:45.4846555Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:03:45.4852420Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:03:45.4854061Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:03:45.4855768Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:03:45.4863474Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:03:45.4867791Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:03:45.4868956Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:03:45.4877599Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:03:45.4879277Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:03:45.4881055Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:03:45.4882620Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:03:45.4884645Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:03:45.4887578Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:03:45.4914462Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:03:45.4915293Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:03:45.4921249Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:03:45.4923482Z INFO:root:removing _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:45.7763921Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:03:45.7765430Z │ │ Version │ 2025-05-07T20:03:45.7766475Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:03:45.7766984Z │ PyTorch │ 2.8.0.dev20250507+cu118 │ 2025-05-07T20:03:45.7767523Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:45.7768041Z │ CUDA (Declared by PyTorch) │ 11.8 │ 2025-05-07T20:03:45.7768807Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:45.7769317Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:03:45.7769850Z │ │ Copyright (c) 2005-2022 NVIDIA Corporation │ 2025-05-07T20:03:45.7770434Z │ │ Built on Wed_Sep_21_10:33:58_PDT_2022 │ 2025-05-07T20:03:45.7770904Z │ │ Cuda compilation tools, release 11.8, V11.8.89 │ 2025-05-07T20:03:45.7771364Z │ │ Build cuda_11.8.r11.8/compiler.31833905_0 │ 2025-05-07T20:03:45.7771885Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:03:46.0260013Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:46.1008009Z 2025-05-07T20:03:46.1161785Z ################################################################################ 2025-05-07T20:03:46.1163631Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:46.1164886Z [CHECK] Listing out library size: 2025-05-07T20:03:46.1166022Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:46.1166926Z 2025-05-07T20:03:46.1173884Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:46.1174686Z 2025-05-07T20:03:46.1176208Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:46.1177144Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.1179617Z 2025-05-07T20:03:46.1242007Z GLIBC_2.2.5 2025-05-07T20:03:46.1242666Z GLIBC_2.14 2025-05-07T20:03:46.1243349Z 2025-05-07T20:03:46.1243363Z 2025-05-07T20:03:46.1244378Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:46.1246201Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.1246737Z 2025-05-07T20:03:46.1309206Z 2025-05-07T20:03:46.1309225Z 2025-05-07T20:03:46.1336830Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.lF8QTUCWNI.symbols.txt 2025-05-07T20:03:46.1337274Z 2025-05-07T20:03:46.1369738Z 2025-05-07T20:03:46.1404067Z [CHECK] Total Number of symbols: 803 2025-05-07T20:03:46.1420774Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:03:46.1436748Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.v55bZkxcfE.usymbols.txt 2025-05-07T20:03:46.1437174Z 2025-05-07T20:03:46.1462144Z 2025-05-07T20:03:46.1492353Z [CHECK] Listing out undefined symbols (49 total): 2025-05-07T20:03:46.1510704Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.1511213Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.1511707Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.1512051Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:46.1512444Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.1512756Z U __popcountdi2@GCC_3.4 2025-05-07T20:03:46.1513061Z U abort@GLIBC_2.2.5 2025-05-07T20:03:46.1513333Z U close@GLIBC_2.2.5 2025-05-07T20:03:46.1513618Z U fputs@GLIBC_2.2.5 2025-05-07T20:03:46.1513885Z U free@GLIBC_2.2.5 2025-05-07T20:03:46.1514179Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:03:46.1514474Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:46.1514766Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:46.1515071Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:03:46.1515370Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:46.1515665Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:46.1515943Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:46.1516234Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.1516511Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:46.1516811Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.1517089Z U mmap@GLIBC_2.2.5 2025-05-07T20:03:46.1517381Z U mprotect@GLIBC_2.2.5 2025-05-07T20:03:46.1517681Z U munmap@GLIBC_2.2.5 2025-05-07T20:03:46.1517957Z U open64@GLIBC_2.2.5 2025-05-07T20:03:46.1518308Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.1518698Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:03:46.1519053Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:46.1519382Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:46.1519717Z U read@GLIBC_2.2.5 2025-05-07T20:03:46.1519997Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:46.1520294Z U shm_open 2025-05-07T20:03:46.1520559Z U shm_unlink 2025-05-07T20:03:46.1521044Z U snprintf@GLIBC_2.2.5 2025-05-07T20:03:46.1521366Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:46.1521652Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:46.1521954Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.1522234Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:46.1522537Z U syscall@GLIBC_2.2.5 2025-05-07T20:03:46.1522823Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:46.1523124Z U uname@GLIBC_2.2.5 2025-05-07T20:03:46.1523415Z U unlink@GLIBC_2.2.5 2025-05-07T20:03:46.1523700Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:03:46.1524090Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.1524681Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.1525099Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.1525455Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.1525775Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.1526083Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.1526360Z w __gmon_start__ 2025-05-07T20:03:46.1526680Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.1527050Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:46.1527299Z 2025-05-07T20:03:46.1555920Z linux-vdso.so.1 (0x00007ffdca7dd000) 2025-05-07T20:03:46.1557364Z libtorch_cpu.so => not found 2025-05-07T20:03:46.1557756Z libtorch_cuda.so => not found 2025-05-07T20:03:46.1558214Z libtorch.so => not found 2025-05-07T20:03:46.1558669Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f36de1fd000) 2025-05-07T20:03:46.1559098Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f36de1cf000) 2025-05-07T20:03:46.1559495Z libc.so.6 => /lib64/libc.so.6 (0x00007f36ddfc5000) 2025-05-07T20:03:46.1559848Z libm.so.6 => /lib64/libm.so.6 (0x00007f36ddeea000) 2025-05-07T20:03:46.1560224Z /lib64/ld-linux-x86-64.so.2 (0x00007f36de4e0000) 2025-05-07T20:03:46.1562207Z 2025-05-07T20:03:46.1562345Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.1562789Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:46.1563351Z 2025-05-07T20:03:46.1597626Z 2025-05-07T20:03:46.1598453Z Dynamic section at offset 0x78e78 contains 33 entries: 2025-05-07T20:03:46.1599598Z Tag Type Name/Value 2025-05-07T20:03:46.1600824Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.1601830Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.1602357Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.1602885Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.1603409Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.1603929Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.1604429Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:03:46.1604837Z 0x000000000000000c (INIT) 0x1a000 2025-05-07T20:03:46.1605180Z 0x000000000000000d (FINI) 0x5af0c 2025-05-07T20:03:46.1605504Z 0x0000000000000019 (INIT_ARRAY) 0x780a0 2025-05-07T20:03:46.1605891Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.1606247Z 0x000000000000001a (FINI_ARRAY) 0x780a8 2025-05-07T20:03:46.1606583Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.1606929Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:03:46.1607253Z 0x000000006ffffef5 (GNU_HASH) 0x1e18 2025-05-07T20:03:46.1607596Z 0x0000000000000005 (STRTAB) 0x86e0 2025-05-07T20:03:46.1607917Z 0x0000000000000006 (SYMTAB) 0x3b80 2025-05-07T20:03:46.1608274Z 0x000000000000000a (STRSZ) 45342 (bytes) 2025-05-07T20:03:46.1608852Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.1609225Z 0x0000000000000003 (PLTGOT) 0x790d8 2025-05-07T20:03:46.1609594Z 0x0000000000000002 (PLTRELSZ) 8064 (bytes) 2025-05-07T20:03:46.1609936Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.1610272Z 0x0000000000000017 (JMPREL) 0x17220 2025-05-07T20:03:46.1610594Z 0x0000000000000007 (RELA) 0x13ed8 2025-05-07T20:03:46.1610948Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:03:46.1611303Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.1611639Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.1612010Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.1612420Z 0x000000006ffffffe (VERNEED) 0x13e48 2025-05-07T20:03:46.1612765Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:03:46.1613086Z 0x000000006ffffff0 (VERSYM) 0x137fe 2025-05-07T20:03:46.1613429Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:03:46.1613732Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.1613966Z 2025-05-07T20:03:46.1614084Z ################################################################################ 2025-05-07T20:03:46.1614312Z 2025-05-07T20:03:46.1614316Z 2025-05-07T20:03:46.1614448Z ################################################################################ 2025-05-07T20:03:46.1614920Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:46.1615397Z [CHECK] Listing out library size: 2025-05-07T20:03:46.1615827Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:46.1616197Z 2025-05-07T20:03:46.1620411Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:46.1620700Z 2025-05-07T20:03:46.1621083Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:46.1622019Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.1622607Z 2025-05-07T20:03:46.1675040Z GLIBC_2.2.5 2025-05-07T20:03:46.1675682Z GLIBC_2.14 2025-05-07T20:03:46.1682959Z 2025-05-07T20:03:46.1683082Z 2025-05-07T20:03:46.1683680Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:46.1684704Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.1685314Z 2025-05-07T20:03:46.1742912Z GLIBCXX_3.4 2025-05-07T20:03:46.1743310Z GLIBCXX_3.4.9 2025-05-07T20:03:46.1743606Z GLIBCXX_3.4.21 2025-05-07T20:03:46.1743753Z 2025-05-07T20:03:46.1743830Z 2025-05-07T20:03:46.1769680Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.hbUOhjklii.symbols.txt 2025-05-07T20:03:46.1770186Z 2025-05-07T20:03:46.1788257Z 2025-05-07T20:03:46.1819505Z [CHECK] Total Number of symbols: 108 2025-05-07T20:03:46.1833017Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:46.1853198Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.7lnEW7XuKE.usymbols.txt 2025-05-07T20:03:46.1853700Z 2025-05-07T20:03:46.1871007Z 2025-05-07T20:03:46.1895952Z [CHECK] Listing out undefined symbols (61 total): 2025-05-07T20:03:46.1913878Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.1914465Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.1914787Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.1915114Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.1915446Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.1915757Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.1916301Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.1916623Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.1916947Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:46.1917268Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.1917600Z U c10::BoolType::get() 2025-05-07T20:03:46.1917909Z U c10::StringType::get() 2025-05-07T20:03:46.1918233Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.1918995Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.1920351Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.1921130Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:46.1921437Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:46.1921716Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.1922013Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.1922347Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.1922742Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.1923160Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:46.1923828Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:46.1924730Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.1925638Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.1926910Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.1927871Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:46.1928713Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.1929660Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.1930574Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.1931157Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.1931533Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.1931903Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.1932255Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.1932812Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.1933778Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.1934520Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.1934854Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.1935175Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.1935571Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.1935900Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.1936206Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.1936513Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.1936776Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:46.1937079Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.1937848Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.1938988Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:46.1939958Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.1940572Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:03:46.1940943Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.1941345Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.1941737Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.1942319Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.1943060Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.1943650Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.1944160Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.1944581Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.1944892Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.1945178Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.1945464Z w __gmon_start__ 2025-05-07T20:03:46.1945729Z w __pthread_key_create 2025-05-07T20:03:46.1946193Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.1946611Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:46.1946885Z 2025-05-07T20:03:46.1966535Z linux-vdso.so.1 (0x00007fffbc3fc000) 2025-05-07T20:03:46.1967030Z libtorch.so => not found 2025-05-07T20:03:46.1967398Z libc10.so => not found 2025-05-07T20:03:46.1967654Z libtorch_cpu.so => not found 2025-05-07T20:03:46.1968027Z libtorch_cuda.so => not found 2025-05-07T20:03:46.1968362Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fe4b6e0e000) 2025-05-07T20:03:46.1968794Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe4b6dde000) 2025-05-07T20:03:46.1969178Z libc.so.6 => /lib64/libc.so.6 (0x00007fe4b6bd6000) 2025-05-07T20:03:46.1969587Z libm.so.6 => /lib64/libm.so.6 (0x00007fe4b6afb000) 2025-05-07T20:03:46.1969940Z /lib64/ld-linux-x86-64.so.2 (0x00007fe4b7081000) 2025-05-07T20:03:46.1970195Z 2025-05-07T20:03:46.1970307Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.1970727Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:46.1971051Z 2025-05-07T20:03:46.2001637Z 2025-05-07T20:03:46.2002566Z Dynamic section at offset 0x9af0 contains 34 entries: 2025-05-07T20:03:46.2002982Z Tag Type Name/Value 2025-05-07T20:03:46.2003445Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.2003955Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.2004474Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.2005251Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.2005815Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.2006350Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.2006851Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.2007393Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:03:46.2007849Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:03:46.2008207Z 0x000000000000000d (FINI) 0x799c 2025-05-07T20:03:46.2008544Z 0x0000000000000019 (INIT_ARRAY) 0x9a48 2025-05-07T20:03:46.2008970Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:03:46.2009397Z 0x000000000000001a (FINI_ARRAY) 0x9a58 2025-05-07T20:03:46.2009750Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.2010125Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:03:46.2010464Z 0x000000006ffffef5 (GNU_HASH) 0x708 2025-05-07T20:03:46.2010826Z 0x0000000000000005 (STRTAB) 0x1350 2025-05-07T20:03:46.2011157Z 0x0000000000000006 (SYMTAB) 0x918 2025-05-07T20:03:46.2011535Z 0x000000000000000a (STRSZ) 7049 (bytes) 2025-05-07T20:03:46.2011897Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.2012271Z 0x0000000000000003 (PLTGOT) 0x9d60 2025-05-07T20:03:46.2012657Z 0x0000000000000002 (PLTRELSZ) 1296 (bytes) 2025-05-07T20:03:46.2013009Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.2013357Z 0x0000000000000017 (JMPREL) 0x34e8 2025-05-07T20:03:46.2013689Z 0x0000000000000007 (RELA) 0x3068 2025-05-07T20:03:46.2014066Z 0x0000000000000008 (RELASZ) 1152 (bytes) 2025-05-07T20:03:46.2014433Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.2014791Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.2015136Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.2015520Z 0x000000006ffffffe (VERNEED) 0x2fb8 2025-05-07T20:03:46.2015881Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:03:46.2016216Z 0x000000006ffffff0 (VERSYM) 0x2eda 2025-05-07T20:03:46.2016579Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:03:46.2016894Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.2017145Z 2025-05-07T20:03:46.2017269Z ################################################################################ 2025-05-07T20:03:46.2017610Z 2025-05-07T20:03:46.2017614Z 2025-05-07T20:03:46.2017917Z ################################################################################ 2025-05-07T20:03:46.2018367Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:46.2018827Z [CHECK] Listing out library size: 2025-05-07T20:03:46.2019229Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:46.2019634Z 2025-05-07T20:03:46.2019793Z 6 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:46.2023092Z 2025-05-07T20:03:46.2023453Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:46.2024318Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.2024872Z 2025-05-07T20:03:46.2285144Z GLIBC_2.2.5 2025-05-07T20:03:46.2285550Z GLIBC_2.3 2025-05-07T20:03:46.2285790Z GLIBC_2.14 2025-05-07T20:03:46.2285914Z 2025-05-07T20:03:46.2285922Z 2025-05-07T20:03:46.2286767Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:46.2287718Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.2288305Z 2025-05-07T20:03:46.2546901Z GLIBCXX_3.4 2025-05-07T20:03:46.2547415Z GLIBCXX_3.4.9 2025-05-07T20:03:46.2547691Z GLIBCXX_3.4.11 2025-05-07T20:03:46.2547927Z GLIBCXX_3.4.14 2025-05-07T20:03:46.2548149Z GLIBCXX_3.4.15 2025-05-07T20:03:46.2548358Z GLIBCXX_3.4.18 2025-05-07T20:03:46.2548551Z GLIBCXX_3.4.21 2025-05-07T20:03:46.2548671Z 2025-05-07T20:03:46.2548688Z 2025-05-07T20:03:46.2570334Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.e0AQhDPx76.symbols.txt 2025-05-07T20:03:46.2571565Z 2025-05-07T20:03:46.2791900Z 2025-05-07T20:03:46.2823249Z [CHECK] Total Number of symbols: 4823 2025-05-07T20:03:46.2841241Z [CHECK] Number of fbgemm symbols: 3365 2025-05-07T20:03:46.2858832Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.1leqBNXfES.usymbols.txt 2025-05-07T20:03:46.2859468Z 2025-05-07T20:03:46.2885759Z 2025-05-07T20:03:46.2911920Z [CHECK] Listing out undefined symbols (137 total): 2025-05-07T20:03:46.2933100Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.2933611Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:46.2934044Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.2934495Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.2934802Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.2935129Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:46.2935440Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:46.2935759Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.2936082Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.2936417Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:03:46.2936768Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.2937092Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:46.2937412Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:46.2937735Z U __cxa_throw_bad_array_new_length@CXXABI_1.3.8 2025-05-07T20:03:46.2938101Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.2938422Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:46.2938736Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:46.2939033Z U abort@GLIBC_2.2.5 2025-05-07T20:03:46.2939435Z U asmjit::_abi_1_13::BaseAssembler::bind(asmjit::_abi_1_13::Label const&) 2025-05-07T20:03:46.2939898Z U asmjit::_abi_1_13::BaseAssembler::newLabel() 2025-05-07T20:03:46.2940398Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:46.2941154Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:46.2942130Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:46.2943303Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:46.2944444Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:03:46.2945230Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:46.2945808Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:46.2946426Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:03:46.2947075Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:03:46.2947576Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:03:46.2948280Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:03:46.2949032Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:03:46.2949724Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:03:46.2950189Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:03:46.2950791Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:03:46.2951480Z U asmjit::_abi_1_13::JitRuntime::_add(void**, asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:03:46.2952942Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:03:46.2953387Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:03:46.2953861Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:03:46.2954213Z U cpuinfo_get_packages 2025-05-07T20:03:46.2954516Z U cpuinfo_get_packages_count 2025-05-07T20:03:46.2954830Z U cpuinfo_initialize 2025-05-07T20:03:46.2955104Z U cpuinfo_isa 2025-05-07T20:03:46.2955369Z U fma@GLIBC_2.2.5 2025-05-07T20:03:46.2955634Z U fmaf@GLIBC_2.2.5 2025-05-07T20:03:46.2955926Z U fminf@GLIBC_2.2.5 2025-05-07T20:03:46.2956194Z U free@GLIBC_2.2.5 2025-05-07T20:03:46.2956476Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:46.2956753Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:46.2957038Z U log2@GLIBC_2.2.5 2025-05-07T20:03:46.2957306Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:46.2957593Z U lrintf@GLIBC_2.2.5 2025-05-07T20:03:46.2957882Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:46.2958159Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.2958447Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.2958729Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:03:46.2959037Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:03:46.2959382Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.2959775Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:03:46.2960119Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.2960491Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.2960845Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:03:46.2961144Z U pow@GLIBC_2.2.5 2025-05-07T20:03:46.2961431Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:46.2961823Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:46.2962324Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:46.2962770Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:46.2963442Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:46.2964274Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:03:46.2965227Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:03:46.2966396Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.2967421Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.2968340Z U std::__cxx11::basic_string, std::allocator >::compare(char const*) const@GLIBCXX_3.4.21 2025-05-07T20:03:46.2969118Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:46.2969800Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:46.2970273Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:46.2970632Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:46.2971121Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:03:46.2971583Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:46.2972026Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:46.2972407Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:46.2972710Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:46.2973030Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:46.2973336Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:46.2973651Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:46.2973978Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:46.2974337Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:03:46.2974702Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.2975056Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.2975413Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:46.2975744Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:46.2976503Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.2977232Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:03:46.2977502Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:46.2977889Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:46.2978407Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:03:46.2978771Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:03:46.2979142Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.2979478Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.2980113Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:46.2981048Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:03:46.2981562Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:46.2982054Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.2982574Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.2983043Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:46.2983378Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:46.2983730Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:46.2984177Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:46.2984685Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:46.2985198Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:46.2985550Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.2985872Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:46.2986151Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:46.2986443Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.2986732Z U strstr@GLIBC_2.2.5 2025-05-07T20:03:46.2987007Z U tolower@GLIBC_2.2.5 2025-05-07T20:03:46.2987299Z U toupper@GLIBC_2.2.5 2025-05-07T20:03:46.2987662Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:03:46.2988082Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:46.2988497Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:46.2988880Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:46.2989271Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.2989684Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.2990076Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:46.2990424Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:46.2990959Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.2991339Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.2991750Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.2992056Z w __gmon_start__ 2025-05-07T20:03:46.2992403Z w __pthread_key_create 2025-05-07T20:03:46.2992714Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:46.2993037Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:46.2993360Z w pthread_once 2025-05-07T20:03:46.2993625Z w pthread_rwlock_rdlock 2025-05-07T20:03:46.2993930Z w pthread_rwlock_unlock 2025-05-07T20:03:46.2994219Z w pthread_rwlock_wrlock 2025-05-07T20:03:46.2994520Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:03:46.2994875Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.2995263Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:46.2995510Z 2025-05-07T20:03:46.2995647Z linux-vdso.so.1 (0x00007ffeddd97000) 2025-05-07T20:03:46.2995930Z libc10.so => not found 2025-05-07T20:03:46.2996446Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f087375c000) 2025-05-07T20:03:46.2997001Z libtorch.so => not found 2025-05-07T20:03:46.2997271Z libtorch_cpu.so => not found 2025-05-07T20:03:46.2997553Z libtorch_cuda.so => not found 2025-05-07T20:03:46.2997881Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0872f9c000) 2025-05-07T20:03:46.2998286Z libm.so.6 => /lib64/libm.so.6 (0x00007f0872ec1000) 2025-05-07T20:03:46.2998658Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0872e93000) 2025-05-07T20:03:46.2999042Z libc.so.6 => /lib64/libc.so.6 (0x00007f0872c8b000) 2025-05-07T20:03:46.2999391Z /lib64/ld-linux-x86-64.so.2 (0x00007f08737db000) 2025-05-07T20:03:46.2999729Z libtorch_cpu.so => not found 2025-05-07T20:03:46.2999993Z libtorch_cuda.so => not found 2025-05-07T20:03:46.3000268Z libtorch.so => not found 2025-05-07T20:03:46.3000425Z 2025-05-07T20:03:46.3000546Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.3000910Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:46.3001185Z 2025-05-07T20:03:46.3027262Z 2025-05-07T20:03:46.3027993Z Dynamic section at offset 0x52bcb8 contains 38 entries: 2025-05-07T20:03:46.3029217Z Tag Type Name/Value 2025-05-07T20:03:46.3029999Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.3030499Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:03:46.3030986Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.3031757Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.3032276Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.3032799Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.3043292Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:46.3043870Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.3044387Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.3044897Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:46.3045579Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:03:46.3046070Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:46.3046470Z 0x000000000000000c (INIT) 0xf4000 2025-05-07T20:03:46.3046840Z 0x000000000000000d (FINI) 0x4d67e0 2025-05-07T20:03:46.3047172Z 0x0000000000000019 (INIT_ARRAY) 0x529c40 2025-05-07T20:03:46.3047552Z 0x000000000000001b (INIT_ARRAYSZ) 56 (bytes) 2025-05-07T20:03:46.3047897Z 0x000000000000001a (FINI_ARRAY) 0x529c78 2025-05-07T20:03:46.3048241Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.3048570Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:46.3048988Z 0x000000006ffffef5 (GNU_HASH) 0x6d60 2025-05-07T20:03:46.3049332Z 0x0000000000000005 (STRTAB) 0x2aa98 2025-05-07T20:03:46.3049649Z 0x0000000000000006 (SYMTAB) 0xe658 2025-05-07T20:03:46.3050007Z 0x000000000000000a (STRSZ) 703409 (bytes) 2025-05-07T20:03:46.3050360Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.3050713Z 0x0000000000000003 (PLTGOT) 0x52cf58 2025-05-07T20:03:46.3051059Z 0x0000000000000002 (PLTRELSZ) 23160 (bytes) 2025-05-07T20:03:46.3051417Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.3051745Z 0x0000000000000017 (JMPREL) 0xee050 2025-05-07T20:03:46.3052091Z 0x0000000000000007 (RELA) 0xd8d80 2025-05-07T20:03:46.3052439Z 0x0000000000000008 (RELASZ) 86736 (bytes) 2025-05-07T20:03:46.3052781Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.3053108Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.3053420Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.3053771Z 0x000000006ffffffe (VERNEED) 0xd8c00 2025-05-07T20:03:46.3054092Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:46.3054418Z 0x000000006ffffff0 (VERSYM) 0xd664a 2025-05-07T20:03:46.3054748Z 0x000000006ffffff9 (RELACOUNT) 9 2025-05-07T20:03:46.3055043Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.3055258Z 2025-05-07T20:03:46.3055385Z ################################################################################ 2025-05-07T20:03:46.3055610Z 2025-05-07T20:03:46.3055615Z 2025-05-07T20:03:46.3055726Z ################################################################################ 2025-05-07T20:03:46.3056221Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:46.3056707Z [CHECK] Listing out library size: 2025-05-07T20:03:46.3057146Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:46.3057509Z 2025-05-07T20:03:46.3057714Z 2 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:46.3058007Z 2025-05-07T20:03:46.3058386Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:46.3059351Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.3059928Z 2025-05-07T20:03:46.3106980Z GLIBC_2.2.5 2025-05-07T20:03:46.3107485Z GLIBC_2.3 2025-05-07T20:03:46.3107716Z GLIBC_2.14 2025-05-07T20:03:46.3109458Z 2025-05-07T20:03:46.3109463Z 2025-05-07T20:03:46.3109890Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:46.3110923Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.3111637Z 2025-05-07T20:03:46.3166345Z GLIBCXX_3.4 2025-05-07T20:03:46.3166969Z GLIBCXX_3.4.9 2025-05-07T20:03:46.3167583Z GLIBCXX_3.4.14 2025-05-07T20:03:46.3170515Z GLIBCXX_3.4.20 2025-05-07T20:03:46.3170755Z GLIBCXX_3.4.21 2025-05-07T20:03:46.3170945Z 2025-05-07T20:03:46.3170949Z 2025-05-07T20:03:46.3192237Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.ABEH6m8x3R.symbols.txt 2025-05-07T20:03:46.3192721Z 2025-05-07T20:03:46.3218505Z 2025-05-07T20:03:46.3241097Z [CHECK] Total Number of symbols: 477 2025-05-07T20:03:46.3259673Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:03:46.3284722Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.KhItmPbbHG.usymbols.txt 2025-05-07T20:03:46.3285249Z 2025-05-07T20:03:46.3305006Z 2025-05-07T20:03:46.3331056Z [CHECK] Listing out undefined symbols (200 total): 2025-05-07T20:03:46.3347028Z U GOMP_barrier 2025-05-07T20:03:46.3348020Z U GOMP_parallel 2025-05-07T20:03:46.3349679Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.3350304Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.3350683Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.3351119Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.3351654Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.3352063Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:46.3352499Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:46.3352874Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:46.3353273Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.3353654Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:46.3354016Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.3354361Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.3354687Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.3355045Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:46.3355379Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:46.3355730Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.3356056Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.3356411Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.3356723Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:46.3357065Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.3357410Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:46.3357908Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:46.3358520Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:46.3358989Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:46.3359947Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3360906Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:03:46.3362502Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:46.3363048Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:46.3363851Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:46.3364888Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:46.3365720Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:46.3366552Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3367389Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:46.3367747Z U at::get_num_threads() 2025-05-07T20:03:46.3368040Z U at::get_thread_num() 2025-05-07T20:03:46.3368355Z U at::in_parallel_region() 2025-05-07T20:03:46.3368672Z U at::init_num_threads() 2025-05-07T20:03:46.3368986Z U at::internal::set_thread_num(int) 2025-05-07T20:03:46.3369361Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:46.3369688Z U c10::BoolType::get() 2025-05-07T20:03:46.3370057Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:46.3370688Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:46.3371243Z U c10::Error::what() const 2025-05-07T20:03:46.3371606Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.3372029Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3372463Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:46.3372805Z U c10::IntType::get() 2025-05-07T20:03:46.3373184Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:46.3373595Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:46.3374024Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.3374498Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:46.3374849Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:46.3375227Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:46.3375610Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:46.3376262Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:46.3376895Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:46.3377260Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:46.3377631Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:46.3377971Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:46.3378329Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:46.3378656Z U c10::SymIntType::get() 2025-05-07T20:03:46.3378994Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:46.3379358Z U c10::TensorType::get() 2025-05-07T20:03:46.3379668Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.3380633Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:46.3381562Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:46.3381953Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:46.3382495Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:46.3383170Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:46.3383728Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:46.3384086Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:46.3384438Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:46.3384818Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:46.3385142Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:46.3385612Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:46.3386083Z U c10::cuda::device_count() 2025-05-07T20:03:46.3386412Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:46.3386796Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:46.3387336Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:46.3387768Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:46.3388168Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:46.3388570Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:46.3389315Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.3390192Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:46.3391475Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.3392450Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:46.3393487Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.3394324Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:46.3394693Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:46.3395039Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:46.3395416Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:46.3395775Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:46.3396181Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:46.3396589Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:46.3397014Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:46.3397418Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:46.3397770Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:46.3398136Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:46.3398550Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:46.3399018Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.3399432Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.3399805Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:46.3400198Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:46.3400661Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:46.3401044Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:46.3401410Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.3401801Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:46.3402158Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:46.3402537Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:46.3402923Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:46.3403292Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.3403682Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:46.3404812Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3406356Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3407904Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3409450Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3411040Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3412696Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3414354Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3416020Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3417681Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3419350Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3421025Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3422931Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.3424068Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:03:46.3424573Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:46.3425075Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:46.3425582Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.3426010Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3426402Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.3426810Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3427262Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:46.3427752Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.3428159Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3428510Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.3428832Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.3429118Z U omp_get_max_threads 2025-05-07T20:03:46.3429430Z U omp_get_num_threads 2025-05-07T20:03:46.3429712Z U omp_get_thread_num 2025-05-07T20:03:46.3430072Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.3430484Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.3431124Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:46.3432223Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.3433161Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3434242Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.3435286Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3436219Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3437220Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:46.3438329Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3439295Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:46.3439914Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:46.3440325Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:46.3440758Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:46.3441176Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.3441575Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.3442032Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:46.3442562Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:46.3443295Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.3444511Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.3445681Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.3446432Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:46.3446807Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:46.3447160Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.3447533Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.3447928Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.3448290Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.3448648Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.3448981Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.3449397Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.3449924Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.3450424Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:46.3450959Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:03:46.3451893Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:46.3453022Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:46.3453867Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:46.3454313Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.3454656Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.3454968Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.3455793Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.3456948Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.3457758Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.3458502Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.3459077Z U typeinfo for c10::Error 2025-05-07T20:03:46.3459433Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:46.3459873Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.3460317Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.3460753Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.3461175Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.3461561Z U vtable for c10::Error 2025-05-07T20:03:46.3462140Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.3462918Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.3464351Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.3464922Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.3465366Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.3465714Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.3466026Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.3466353Z w __gmon_start__ 2025-05-07T20:03:46.3466631Z w __pthread_key_create 2025-05-07T20:03:46.3466995Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.3467443Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:46.3467823Z 2025-05-07T20:03:46.3467938Z linux-vdso.so.1 (0x00007fff5dddb000) 2025-05-07T20:03:46.3468355Z libc10.so => not found 2025-05-07T20:03:46.3468592Z libc10_cuda.so => not found 2025-05-07T20:03:46.3469122Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f763b200000) 2025-05-07T20:03:46.3469988Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f763bb05000) 2025-05-07T20:03:46.3470618Z libtorch.so => not found 2025-05-07T20:03:46.3470864Z libtorch_cpu.so => not found 2025-05-07T20:03:46.3471144Z libtorch_cuda.so => not found 2025-05-07T20:03:46.3471504Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.3472005Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f763af9c000) 2025-05-07T20:03:46.3472459Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f763bad5000) 2025-05-07T20:03:46.3472935Z libc.so.6 => /lib64/libc.so.6 (0x00007f763ad94000) 2025-05-07T20:03:46.3473333Z /lib64/ld-linux-x86-64.so.2 (0x00007f763bb14000) 2025-05-07T20:03:46.3473663Z libc10.so => not found 2025-05-07T20:03:46.3474202Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f763ba58000) 2025-05-07T20:03:46.3474779Z libtorch.so => not found 2025-05-07T20:03:46.3475064Z libtorch_cpu.so => not found 2025-05-07T20:03:46.3475357Z libtorch_cuda.so => not found 2025-05-07T20:03:46.3475661Z libm.so.6 => /lib64/libm.so.6 (0x00007f763acb9000) 2025-05-07T20:03:46.3476024Z libtorch.so => not found 2025-05-07T20:03:46.3476281Z libc10.so => not found 2025-05-07T20:03:46.3476556Z libtorch_cpu.so => not found 2025-05-07T20:03:46.3476835Z libtorch_cuda.so => not found 2025-05-07T20:03:46.3477135Z libtorch_cpu.so => not found 2025-05-07T20:03:46.3477407Z libtorch_cuda.so => not found 2025-05-07T20:03:46.3477694Z libtorch.so => not found 2025-05-07T20:03:46.3477856Z 2025-05-07T20:03:46.3477990Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.3478424Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:46.3478767Z 2025-05-07T20:03:46.3478771Z 2025-05-07T20:03:46.3478950Z Dynamic section at offset 0x19c218 contains 40 entries: 2025-05-07T20:03:46.3479356Z Tag Type Name/Value 2025-05-07T20:03:46.3479775Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.3480302Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:46.3480808Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:46.3481338Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:46.3481867Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.3482385Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.3482927Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.3483456Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:46.3484240Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.3484784Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.3485275Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.3485757Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:46.3486293Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:46.3486798Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:46.3487182Z 0x000000000000000c (INIT) 0x12000 2025-05-07T20:03:46.3487516Z 0x000000000000000d (FINI) 0x76a9c 2025-05-07T20:03:46.3487840Z 0x0000000000000019 (INIT_ARRAY) 0x19cdd8 2025-05-07T20:03:46.3488213Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:03:46.3488570Z 0x000000000000001a (FINI_ARRAY) 0x19ce20 2025-05-07T20:03:46.3488915Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.3489254Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:46.3489567Z 0x000000006ffffef5 (GNU_HASH) 0x1840 2025-05-07T20:03:46.3489904Z 0x0000000000000005 (STRTAB) 0x52a8 2025-05-07T20:03:46.3490217Z 0x0000000000000006 (SYMTAB) 0x25d8 2025-05-07T20:03:46.3490712Z 0x000000000000000a (STRSZ) 37674 (bytes) 2025-05-07T20:03:46.3491245Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.3491631Z 0x0000000000000003 (PLTGOT) 0x19d4d8 2025-05-07T20:03:46.3492019Z 0x0000000000000002 (PLTRELSZ) 6336 (bytes) 2025-05-07T20:03:46.3492470Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.3492858Z 0x0000000000000017 (JMPREL) 0xff68 2025-05-07T20:03:46.3493366Z 0x0000000000000007 (RELA) 0xeab0 2025-05-07T20:03:46.3493747Z 0x0000000000000008 (RELASZ) 5304 (bytes) 2025-05-07T20:03:46.3494110Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.3494467Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.3494810Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.3495186Z 0x000000006ffffffe (VERNEED) 0xe990 2025-05-07T20:03:46.3495520Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:46.3495866Z 0x000000006ffffff0 (VERSYM) 0xe5d2 2025-05-07T20:03:46.3496223Z 0x000000006ffffff9 (RELACOUNT) 17 2025-05-07T20:03:46.3496539Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.3496744Z 2025-05-07T20:03:46.3496884Z ################################################################################ 2025-05-07T20:03:46.3497115Z 2025-05-07T20:03:46.3497119Z 2025-05-07T20:03:46.3497238Z ################################################################################ 2025-05-07T20:03:46.3497749Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:46.3498253Z [CHECK] Listing out library size: 2025-05-07T20:03:46.3498696Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:46.3499068Z 2025-05-07T20:03:46.3499296Z 11 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:46.3499595Z 2025-05-07T20:03:46.3499974Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:46.3500949Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.3501522Z 2025-05-07T20:03:46.3514015Z GLIBC_2.2.5 2025-05-07T20:03:46.3514579Z GLIBC_2.14 2025-05-07T20:03:46.3515062Z 2025-05-07T20:03:46.3515500Z 2025-05-07T20:03:46.3516241Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:46.3517294Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.3517892Z 2025-05-07T20:03:46.3592994Z GLIBCXX_3.4 2025-05-07T20:03:46.3593238Z GLIBCXX_3.4.9 2025-05-07T20:03:46.3593443Z GLIBCXX_3.4.11 2025-05-07T20:03:46.3593661Z GLIBCXX_3.4.20 2025-05-07T20:03:46.3593867Z GLIBCXX_3.4.21 2025-05-07T20:03:46.3594001Z 2025-05-07T20:03:46.3594006Z 2025-05-07T20:03:46.3615654Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.28HGBT4jOQ.symbols.txt 2025-05-07T20:03:46.3617071Z 2025-05-07T20:03:46.3664561Z 2025-05-07T20:03:46.3692533Z [CHECK] Total Number of symbols: 839 2025-05-07T20:03:46.3707099Z [CHECK] Number of fbgemm symbols: 80 2025-05-07T20:03:46.3719867Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.z8t6mfpY4k.usymbols.txt 2025-05-07T20:03:46.3720973Z 2025-05-07T20:03:46.3741613Z 2025-05-07T20:03:46.3764694Z [CHECK] Listing out undefined symbols (158 total): 2025-05-07T20:03:46.3785859Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.3787638Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.3788673Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.3789236Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.3789624Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.3790026Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:46.3790406Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:46.3791476Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:46.3791865Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.3792255Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.3792653Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.3792979Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.3793322Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:46.3793653Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.3793999Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.3794322Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.3794666Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.3795021Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:46.3795453Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:46.3796237Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3797412Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3798812Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3799682Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:46.3800558Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3801460Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:46.3802107Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:46.3803222Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3804335Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.3805139Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:46.3805552Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:46.3805905Z U c10::BoolType::get() 2025-05-07T20:03:46.3806245Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:46.3806637Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:46.3807049Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3807520Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:46.3807881Z U c10::IntType::get() 2025-05-07T20:03:46.3808276Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.3808766Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:46.3809150Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:46.3809792Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:46.3810420Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:46.3810761Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:46.3811132Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:46.3811477Z U c10::TensorType::get() 2025-05-07T20:03:46.3811794Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.3812689Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:46.3813570Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:46.3813923Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:46.3814248Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:46.3814579Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:46.3814919Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:46.3815238Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:46.3815693Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:46.3816135Z U c10::cuda::current_device() 2025-05-07T20:03:46.3816441Z U c10::cuda::device_count() 2025-05-07T20:03:46.3816759Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:46.3817140Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:46.3817525Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:46.3817891Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:46.3818288Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:46.3818648Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:46.3819345Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.3820180Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:46.3820981Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.3821932Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:46.3822487Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:46.3822982Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:46.3823363Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:46.3823788Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:46.3824212Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:46.3824752Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:46.3825246Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:46.3825658Z U c10::throwNullDataPtrError() 2025-05-07T20:03:46.3825990Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:46.3826347Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:46.3826774Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:46.3827227Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:46.3827608Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:46.3827980Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.3828376Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.3828746Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:46.3829116Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:46.3829470Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:46.3829829Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:46.3830187Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.3830575Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:46.3830955Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:46.3831392Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:46.3831776Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:46.3832133Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:46.3832502Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:46.3832855Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:46.3833385Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.3833932Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:46.3834284Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:46.3834648Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:46.3835006Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.3835386Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:46.3835789Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3836229Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.3836628Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3836980Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:46.3837367Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:46.3837801Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.3838218Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3838581Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.3838902Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.3839265Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.3839662Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.3840029Z U printf@GLIBC_2.2.5 2025-05-07T20:03:46.3840388Z U puts@GLIBC_2.2.5 2025-05-07T20:03:46.3840949Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:46.3841793Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.3842722Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3843918Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.3845005Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3845885Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3846858Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:46.3847941Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3849047Z U std::__cxx11::basic_string, std::allocator >::replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.3850077Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:46.3850648Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.3851035Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.3851456Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:46.3851858Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:46.3852337Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:46.3853020Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.3853987Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.3854761Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.3855094Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.3855450Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.3855798Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.3856116Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.3856450Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.3856832Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.3857357Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.3857826Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.3858292Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.3858646Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.3859643Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.3860899Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.3861762Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.3862500Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.3863225Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.3863744Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.3864200Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.3864686Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.3865320Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.3866139Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.3866811Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.3867364Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.3867849Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.3868186Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.3868531Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.3868863Z w __gmon_start__ 2025-05-07T20:03:46.3869152Z w __pthread_key_create 2025-05-07T20:03:46.3869490Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:46.3869828Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:46.3870224Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.3870686Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:46.3871027Z 2025-05-07T20:03:46.3871170Z linux-vdso.so.1 (0x00007ffd23fa0000) 2025-05-07T20:03:46.3871575Z libtorch.so => not found 2025-05-07T20:03:46.3871832Z libc10.so => not found 2025-05-07T20:03:46.3872110Z libc10_cuda.so => not found 2025-05-07T20:03:46.3872389Z libtorch_cpu.so => not found 2025-05-07T20:03:46.3872698Z libtorch_cuda.so => not found 2025-05-07T20:03:46.3872981Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.3873358Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f260eb9c000) 2025-05-07T20:03:46.3873784Z libm.so.6 => /lib64/libm.so.6 (0x00007f260eac1000) 2025-05-07T20:03:46.3874174Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f260faa4000) 2025-05-07T20:03:46.3874583Z libc.so.6 => /lib64/libc.so.6 (0x00007f260e8b9000) 2025-05-07T20:03:46.3874948Z /lib64/ld-linux-x86-64.so.2 (0x00007f260fad8000) 2025-05-07T20:03:46.3875204Z 2025-05-07T20:03:46.3875327Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.3875753Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:46.3876115Z 2025-05-07T20:03:46.3876119Z 2025-05-07T20:03:46.3876281Z Dynamic section at offset 0xa7ca60 contains 37 entries: 2025-05-07T20:03:46.3876677Z Tag Type Name/Value 2025-05-07T20:03:46.3877099Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.3877621Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.3878124Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:46.3878662Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.3879207Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.3879733Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:46.3880350Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.3880860Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:46.3881386Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.3881887Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.3882436Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:46.3882917Z 0x000000000000000c (INIT) 0x2c000 2025-05-07T20:03:46.3883254Z 0x000000000000000d (FINI) 0xcd12c 2025-05-07T20:03:46.3883615Z 0x0000000000000019 (INIT_ARRAY) 0xa7c0c8 2025-05-07T20:03:46.3884173Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:03:46.3884565Z 0x000000000000001a (FINI_ARRAY) 0xa7c198 2025-05-07T20:03:46.3884902Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.3885259Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:03:46.3885585Z 0x000000006ffffef5 (GNU_HASH) 0x1ed8 2025-05-07T20:03:46.3885938Z 0x0000000000000005 (STRTAB) 0x8a48 2025-05-07T20:03:46.3886278Z 0x0000000000000006 (SYMTAB) 0x3b88 2025-05-07T20:03:46.3886625Z 0x000000000000000a (STRSZ) 117532 (bytes) 2025-05-07T20:03:46.3887008Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.3887350Z 0x0000000000000003 (PLTGOT) 0xa7cd00 2025-05-07T20:03:46.3887721Z 0x0000000000000002 (PLTRELSZ) 8592 (bytes) 2025-05-07T20:03:46.3888063Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.3888412Z 0x0000000000000017 (JMPREL) 0x292f0 2025-05-07T20:03:46.3888742Z 0x0000000000000007 (RELA) 0x25d08 2025-05-07T20:03:46.3889116Z 0x0000000000000008 (RELASZ) 13800 (bytes) 2025-05-07T20:03:46.3889505Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.3889833Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.3890182Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.3890678Z 0x000000006ffffffe (VERNEED) 0x25bf8 2025-05-07T20:03:46.3891200Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:46.3891584Z 0x000000006ffffff0 (VERSYM) 0x25564 2025-05-07T20:03:46.3891945Z 0x000000006ffffff9 (RELACOUNT) 39 2025-05-07T20:03:46.3892264Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.3892495Z 2025-05-07T20:03:46.3892619Z ################################################################################ 2025-05-07T20:03:46.3892853Z 2025-05-07T20:03:46.3892857Z 2025-05-07T20:03:46.3893011Z ################################################################################ 2025-05-07T20:03:46.3893533Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:46.3894063Z [CHECK] Listing out library size: 2025-05-07T20:03:46.3894532Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:46.3894935Z 2025-05-07T20:03:46.3895157Z 5 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:46.3895468Z 2025-05-07T20:03:46.3895893Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:46.3896899Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.3897527Z 2025-05-07T20:03:46.3942980Z GLIBC_2.2.5 2025-05-07T20:03:46.3943646Z GLIBC_2.3 2025-05-07T20:03:46.3944204Z GLIBC_2.14 2025-05-07T20:03:46.3944532Z 2025-05-07T20:03:46.3944586Z 2025-05-07T20:03:46.3945819Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:46.3949331Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.3950672Z 2025-05-07T20:03:46.3997324Z GLIBCXX_3.4 2025-05-07T20:03:46.3997971Z GLIBCXX_3.4.9 2025-05-07T20:03:46.3998365Z GLIBCXX_3.4.11 2025-05-07T20:03:46.3998613Z GLIBCXX_3.4.18 2025-05-07T20:03:46.3998823Z GLIBCXX_3.4.21 2025-05-07T20:03:46.3998974Z 2025-05-07T20:03:46.3998979Z 2025-05-07T20:03:46.4018925Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.ONlQdWXwYt.symbols.txt 2025-05-07T20:03:46.4020389Z 2025-05-07T20:03:46.4042059Z 2025-05-07T20:03:46.4068329Z [CHECK] Total Number of symbols: 329 2025-05-07T20:03:46.4078471Z [CHECK] Number of fbgemm symbols: 12 2025-05-07T20:03:46.4095416Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.pWbXmXhnJI.usymbols.txt 2025-05-07T20:03:46.4096880Z 2025-05-07T20:03:46.4113036Z 2025-05-07T20:03:46.4138502Z [CHECK] Listing out undefined symbols (133 total): 2025-05-07T20:03:46.4153021Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4153880Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4154447Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.4154813Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.4155255Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.4155779Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.4156164Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:46.4156562Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:46.4156914Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:46.4157293Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.4157642Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.4157976Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.4158314Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.4158618Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.4158953Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:46.4159281Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.4159627Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:46.4159956Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:46.4160374Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:46.4160871Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:46.4161328Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:46.4161718Z U c10::BoolType::get() 2025-05-07T20:03:46.4162077Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:46.4162463Z U c10::FloatType::get() 2025-05-07T20:03:46.4162783Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:46.4163206Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.4163651Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:46.4163999Z U c10::IntType::get() 2025-05-07T20:03:46.4164389Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:46.4164791Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:46.4165189Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:46.4165603Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:46.4166500Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:46.4167153Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:46.4167487Z U c10::TensorType::get() 2025-05-07T20:03:46.4167826Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.4168912Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:46.4169859Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:46.4170282Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:46.4170676Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:46.4171041Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:46.4171400Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:46.4171742Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:46.4172239Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:46.4172703Z U c10::cuda::device_count() 2025-05-07T20:03:46.4173064Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:46.4173445Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:46.4173853Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:46.4174256Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:46.4174657Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:46.4175060Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:46.4175783Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.4176838Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:46.4177720Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.4178663Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:46.4179259Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:46.4179615Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:46.4179962Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:46.4180362Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:46.4180766Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:46.4181141Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:46.4181575Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:46.4182017Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4182425Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.4182802Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:46.4183184Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:46.4183544Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:46.4183926Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:46.4184284Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4184661Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:46.4185035Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:46.4185371Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:46.4185714Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:46.4186110Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:46.4186478Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4186834Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:46.4187193Z U float at::Tensor::item() const 2025-05-07T20:03:46.4187581Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.4187990Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.4188403Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.4188781Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.4189081Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.4189439Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.4189831Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.4190416Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:46.4191468Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.4192380Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4193435Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.4194464Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4195387Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4196426Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4197376Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:46.4198205Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:46.4198814Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:46.4199147Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:46.4199521Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.4199907Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.4200300Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:46.4200794Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:46.4201484Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.4202519Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4203782Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.4204515Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.4204870Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.4205207Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.4205640Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.4205986Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.4206309Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.4206718Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4207244Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4207723Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:46.4208164Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.4208509Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.4208856Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.4209646Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.4210783Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.4211594Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.4212314Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.4212965Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.4213387Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.4213982Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.4214586Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4215325Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4216084Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4216724Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.4217235Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.4217688Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.4218002Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.4218319Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.4218628Z w __gmon_start__ 2025-05-07T20:03:46.4218883Z w __pthread_key_create 2025-05-07T20:03:46.4219194Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:46.4219499Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:46.4219875Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.4220312Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:46.4220639Z 2025-05-07T20:03:46.4220743Z linux-vdso.so.1 (0x00007ffc09130000) 2025-05-07T20:03:46.4221021Z libtorch.so => not found 2025-05-07T20:03:46.4221129Z libc10.so => not found 2025-05-07T20:03:46.4221224Z libc10_cuda.so => not found 2025-05-07T20:03:46.4221317Z libtorch_cpu.so => not found 2025-05-07T20:03:46.4221432Z libtorch_cuda.so => not found 2025-05-07T20:03:46.4221526Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.4221680Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd6c859c000) 2025-05-07T20:03:46.4221856Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd6c8d04000) 2025-05-07T20:03:46.4221977Z libc.so.6 => /lib64/libc.so.6 (0x00007fd6c8394000) 2025-05-07T20:03:46.4222097Z /lib64/ld-linux-x86-64.so.2 (0x00007fd6c8d38000) 2025-05-07T20:03:46.4222294Z libm.so.6 => /lib64/libm.so.6 (0x00007fd6c82b9000) 2025-05-07T20:03:46.4222300Z 2025-05-07T20:03:46.4222427Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.4222660Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:46.4222665Z 2025-05-07T20:03:46.4233349Z 2025-05-07T20:03:46.4234076Z Dynamic section at offset 0x4695c8 contains 37 entries: 2025-05-07T20:03:46.4234214Z Tag Type Name/Value 2025-05-07T20:03:46.4234433Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.4234625Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.4235977Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:46.4236252Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.4236468Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.4236704Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:46.4236899Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.4237101Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.4237284Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.4237495Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:46.4237731Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:03:46.4237842Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:03:46.4237949Z 0x000000000000000d (FINI) 0x3451c 2025-05-07T20:03:46.4238073Z 0x0000000000000019 (INIT_ARRAY) 0x469348 2025-05-07T20:03:46.4238193Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:03:46.4238301Z 0x000000000000001a (FINI_ARRAY) 0x469378 2025-05-07T20:03:46.4238421Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.4238538Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:46.4238648Z 0x000000006ffffef5 (GNU_HASH) 0x1130 2025-05-07T20:03:46.4238751Z 0x0000000000000005 (STRTAB) 0x3a58 2025-05-07T20:03:46.4238863Z 0x0000000000000006 (SYMTAB) 0x1b68 2025-05-07T20:03:46.4238991Z 0x000000000000000a (STRSZ) 35846 (bytes) 2025-05-07T20:03:46.4239106Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.4239220Z 0x0000000000000003 (PLTGOT) 0x469868 2025-05-07T20:03:46.4239359Z 0x0000000000000002 (PLTRELSZ) 3504 (bytes) 2025-05-07T20:03:46.4239466Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.4239572Z 0x0000000000000017 (JMPREL) 0xdde0 2025-05-07T20:03:46.4239690Z 0x0000000000000007 (RELA) 0xca18 2025-05-07T20:03:46.4239810Z 0x0000000000000008 (RELASZ) 5064 (bytes) 2025-05-07T20:03:46.4239932Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.4240039Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.4240158Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.4240285Z 0x000000006ffffffe (VERNEED) 0xc8f8 2025-05-07T20:03:46.4240392Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:46.4240509Z 0x000000006ffffff0 (VERSYM) 0xc65e 2025-05-07T20:03:46.4240613Z 0x000000006ffffff9 (RELACOUNT) 10 2025-05-07T20:03:46.4240709Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.4240716Z 2025-05-07T20:03:46.4240841Z ################################################################################ 2025-05-07T20:03:46.4240848Z 2025-05-07T20:03:46.4240854Z 2025-05-07T20:03:46.4240962Z ################################################################################ 2025-05-07T20:03:46.4241221Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:46.4241527Z [CHECK] Listing out library size: 2025-05-07T20:03:46.4241774Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:46.4241779Z 2025-05-07T20:03:46.4247445Z 8 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:46.4247523Z 2025-05-07T20:03:46.4248125Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:46.4250260Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.4250296Z 2025-05-07T20:03:46.4308716Z GLIBC_2.2.5 2025-05-07T20:03:46.4308988Z GLIBC_2.14 2025-05-07T20:03:46.4309320Z 2025-05-07T20:03:46.4309334Z 2025-05-07T20:03:46.4310283Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:46.4310802Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.4310807Z 2025-05-07T20:03:46.4368660Z GLIBCXX_3.4 2025-05-07T20:03:46.4368828Z GLIBCXX_3.4.9 2025-05-07T20:03:46.4368934Z GLIBCXX_3.4.20 2025-05-07T20:03:46.4369025Z GLIBCXX_3.4.21 2025-05-07T20:03:46.4372733Z 2025-05-07T20:03:46.4372747Z 2025-05-07T20:03:46.4393144Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.1YEsZJmqPB.symbols.txt 2025-05-07T20:03:46.4393165Z 2025-05-07T20:03:46.4421701Z 2025-05-07T20:03:46.4447042Z [CHECK] Total Number of symbols: 515 2025-05-07T20:03:46.4461399Z [CHECK] Number of fbgemm symbols: 12 2025-05-07T20:03:46.4474703Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.zYgnceXUMg.usymbols.txt 2025-05-07T20:03:46.4474759Z 2025-05-07T20:03:46.4491945Z 2025-05-07T20:03:46.4515984Z [CHECK] Listing out undefined symbols (161 total): 2025-05-07T20:03:46.4533827Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4534320Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.4534791Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.4535221Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.4535606Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.4536007Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:46.4536417Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:46.4536767Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:46.4537178Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.4537543Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:46.4537836Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.4538147Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.4538478Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.4538774Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:46.4539079Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.4539453Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.4539579Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.4539681Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:46.4539794Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.4540062Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:46.4540227Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:46.4542236Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4542896Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4543063Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:46.4543256Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:46.4543425Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:46.4543637Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:46.4543852Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:46.4544303Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4544851Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4544973Z U c10::BoolType::get() 2025-05-07T20:03:46.4545135Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:46.4545275Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:46.4545405Z U c10::IntType::get() 2025-05-07T20:03:46.4545568Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:46.4545696Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:46.4545937Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.4546090Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:46.4546235Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:46.4546639Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:46.4546772Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:46.4546888Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:46.4547022Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:46.4547129Z U c10::SymIntType::get() 2025-05-07T20:03:46.4547276Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:46.4547425Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:46.4547546Z U c10::TensorType::get() 2025-05-07T20:03:46.4547667Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.4548337Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:46.4548488Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:46.4548605Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:46.4548726Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:46.4548862Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:46.4548978Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:46.4549087Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:46.4549347Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:46.4549458Z U c10::cuda::current_device() 2025-05-07T20:03:46.4549566Z U c10::cuda::device_count() 2025-05-07T20:03:46.4549767Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:46.4549900Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:46.4550039Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:46.4550193Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:46.4550346Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:46.4550461Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:46.4550967Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.4551305Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:46.4551990Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.4552365Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:46.4552953Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.4553077Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:46.4553217Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:46.4553376Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:46.4553553Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:46.4553711Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:46.4553864Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:46.4554012Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:46.4554159Z U c10::throwNullDataPtrError() 2025-05-07T20:03:46.4554271Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:46.4554391Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:46.4554618Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:46.4554743Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:46.4554884Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:46.4555046Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4555189Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.4555319Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:46.4555455Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:46.4555595Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:46.4555723Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:46.4555856Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4556016Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:46.4556135Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:46.4556266Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:46.4556430Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:46.4556558Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:46.4556685Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:46.4556808Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:46.4556958Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:46.4557254Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.4557386Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:46.4557572Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:46.4557694Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:46.4557827Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4557977Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:46.4558113Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.4558261Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.4558364Z U log2@GLIBC_2.2.5 2025-05-07T20:03:46.4558571Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:46.4558730Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.4558905Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.4559031Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.4559139Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.4559299Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.4559454Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.4559556Z U printf@GLIBC_2.2.5 2025-05-07T20:03:46.4559654Z U puts@GLIBC_2.2.5 2025-05-07T20:03:46.4560027Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:46.4560426Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.4560829Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4561397Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.4561792Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4562204Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4562684Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:46.4563205Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4563745Z U std::__cxx11::basic_string, std::allocator >::replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4564182Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:46.4564324Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.4564477Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.4564643Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:46.4564875Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:46.4565227Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.4565772Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4565937Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:46.4566078Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.4566196Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.4566310Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.4566443Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.4566557Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.4566667Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.4566866Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4567128Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4567270Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:46.4567401Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.4567499Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.4567620Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.4568195Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.4568626Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.4568868Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.4569233Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.4569360Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:46.4569505Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.4569853Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.4570012Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.4570358Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4570702Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4570905Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.4571130Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.4571270Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.4571382Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.4571493Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.4571610Z w __gmon_start__ 2025-05-07T20:03:46.4571717Z w __pthread_key_create 2025-05-07T20:03:46.4571871Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.4572092Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:46.4572098Z 2025-05-07T20:03:46.4572325Z linux-vdso.so.1 (0x00007ffc86154000) 2025-05-07T20:03:46.4572449Z libtorch.so => not found 2025-05-07T20:03:46.4572568Z libc10.so => not found 2025-05-07T20:03:46.4572803Z libc10_cuda.so => not found 2025-05-07T20:03:46.4573297Z libtorch_cpu.so => not found 2025-05-07T20:03:46.4573409Z libtorch_cuda.so => not found 2025-05-07T20:03:46.4573526Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.4573762Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1395d9c000) 2025-05-07T20:03:46.4573925Z libm.so.6 => /lib64/libm.so.6 (0x00007f139691d000) 2025-05-07T20:03:46.4574088Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f13968ef000) 2025-05-07T20:03:46.4574209Z libc.so.6 => /lib64/libc.so.6 (0x00007f1395b94000) 2025-05-07T20:03:46.4574517Z /lib64/ld-linux-x86-64.so.2 (0x00007f13969fe000) 2025-05-07T20:03:46.4574535Z 2025-05-07T20:03:46.4574657Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.4574888Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:46.4574894Z 2025-05-07T20:03:46.4608150Z 2025-05-07T20:03:46.4608481Z Dynamic section at offset 0x7f4140 contains 37 entries: 2025-05-07T20:03:46.4608636Z Tag Type Name/Value 2025-05-07T20:03:46.4608865Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.4609088Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.4609556Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:46.4609757Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.4609960Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.4610187Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:46.4610385Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.4610575Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:46.4610781Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.4610965Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.4611184Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:46.4611299Z 0x000000000000000c (INIT) 0x14000 2025-05-07T20:03:46.4611425Z 0x000000000000000d (FINI) 0x75aac 2025-05-07T20:03:46.4611544Z 0x0000000000000019 (INIT_ARRAY) 0x7f4ca0 2025-05-07T20:03:46.4611666Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:03:46.4611795Z 0x000000000000001a (FINI_ARRAY) 0x7f4d00 2025-05-07T20:03:46.4611919Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.4612024Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:03:46.4612152Z 0x000000006ffffef5 (GNU_HASH) 0x19c0 2025-05-07T20:03:46.4612259Z 0x0000000000000005 (STRTAB) 0x5aa0 2025-05-07T20:03:46.4612365Z 0x0000000000000006 (SYMTAB) 0x2a40 2025-05-07T20:03:46.4612494Z 0x000000000000000a (STRSZ) 44012 (bytes) 2025-05-07T20:03:46.4612628Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.4612743Z 0x0000000000000003 (PLTGOT) 0x7f53e0 2025-05-07T20:03:46.4612873Z 0x0000000000000002 (PLTRELSZ) 5352 (bytes) 2025-05-07T20:03:46.4612996Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.4613109Z 0x0000000000000017 (JMPREL) 0x12848 2025-05-07T20:03:46.4613216Z 0x0000000000000007 (RELA) 0x10b98 2025-05-07T20:03:46.4626851Z 0x0000000000000008 (RELASZ) 7344 (bytes) 2025-05-07T20:03:46.4627025Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.4627121Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.4627241Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.4627363Z 0x000000006ffffffe (VERNEED) 0x10a98 2025-05-07T20:03:46.4627462Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:46.4627568Z 0x000000006ffffff0 (VERSYM) 0x1068c 2025-05-07T20:03:46.4627680Z 0x000000006ffffff9 (RELACOUNT) 26 2025-05-07T20:03:46.4627773Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.4627795Z 2025-05-07T20:03:46.4627909Z ################################################################################ 2025-05-07T20:03:46.4627916Z 2025-05-07T20:03:46.4627920Z 2025-05-07T20:03:46.4628039Z ################################################################################ 2025-05-07T20:03:46.4628461Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:46.4628558Z [CHECK] Listing out library size: 2025-05-07T20:03:46.4628850Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:46.4628855Z 2025-05-07T20:03:46.4632936Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:46.4632942Z 2025-05-07T20:03:46.4633359Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:46.4633887Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.4633983Z 2025-05-07T20:03:46.4680692Z GLIBC_2.2.5 2025-05-07T20:03:46.4680809Z GLIBC_2.3 2025-05-07T20:03:46.4680890Z GLIBC_2.14 2025-05-07T20:03:46.4682271Z 2025-05-07T20:03:46.4682380Z 2025-05-07T20:03:46.4683002Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:46.4683558Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.4683564Z 2025-05-07T20:03:46.4741648Z GLIBCXX_3.4 2025-05-07T20:03:46.4741743Z GLIBCXX_3.4.9 2025-05-07T20:03:46.4741849Z GLIBCXX_3.4.21 2025-05-07T20:03:46.4741856Z 2025-05-07T20:03:46.4741861Z 2025-05-07T20:03:46.4762958Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.8KAROwxleC.symbols.txt 2025-05-07T20:03:46.4762996Z 2025-05-07T20:03:46.4792762Z 2025-05-07T20:03:46.4821808Z [CHECK] Total Number of symbols: 326 2025-05-07T20:03:46.4834181Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:03:46.4853071Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.jvJ3UQ6pRt.usymbols.txt 2025-05-07T20:03:46.4853112Z 2025-05-07T20:03:46.4868890Z 2025-05-07T20:03:46.4893402Z [CHECK] Listing out undefined symbols (147 total): 2025-05-07T20:03:46.4909722Z U GOMP_parallel 2025-05-07T20:03:46.4910824Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4911469Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.4911925Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.4912378Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.4912762Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.4913195Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:46.4913581Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:46.4913948Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:46.4914349Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.4914653Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.4914985Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.4915279Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.4915568Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:46.4915902Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.4916202Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.4916513Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.4916768Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:46.4916991Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:46.4917594Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4918638Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4918818Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:46.4918932Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:46.4919408Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4919515Z U at::get_num_threads() 2025-05-07T20:03:46.4919658Z U at::get_thread_num() 2025-05-07T20:03:46.4919837Z U at::in_parallel_region() 2025-05-07T20:03:46.4919938Z U at::init_num_threads() 2025-05-07T20:03:46.4920053Z U at::internal::set_thread_num(int) 2025-05-07T20:03:46.4920619Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.4920879Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:46.4921050Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.4921229Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:46.4921380Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.4921522Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:46.4921680Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:46.4921813Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:46.4921968Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:46.4922112Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:46.4922289Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:46.4922440Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:46.4922546Z U c10::TensorType::get() 2025-05-07T20:03:46.4922690Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.4923354Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:46.4923487Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:46.4923627Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:46.4923739Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:46.4923844Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:46.4923974Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:46.4924079Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:46.4924319Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:46.4924432Z U c10::cuda::device_count() 2025-05-07T20:03:46.4924558Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:46.4924681Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:46.4924826Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:46.4924955Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:46.4925101Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:46.4925217Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:46.4925737Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.4925973Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:46.4926449Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.4926761Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:46.4926878Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:46.4927033Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:46.4927206Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:46.4927368Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:46.4927514Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:46.4927650Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:46.4927784Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:46.4927917Z U c10::throwNullDataPtrError() 2025-05-07T20:03:46.4928023Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:46.4928132Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:46.4928325Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:46.4928431Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:46.4928561Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:46.4928685Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4928835Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.4928951Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:46.4929072Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:46.4929203Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:46.4929321Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:46.4929437Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4929582Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:46.4929692Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:46.4929801Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:46.4929912Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:46.4930045Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:46.4930157Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:46.4930429Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.4930567Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:46.4930671Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:46.4930782Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:46.4930912Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.4931026Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:46.4931161Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.4931277Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.4931450Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:46.4931583Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.4931678Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.4931782Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.4931875Z U omp_get_num_threads 2025-05-07T20:03:46.4931962Z U omp_get_thread_num 2025-05-07T20:03:46.4932198Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.4932317Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.4932408Z U printf@GLIBC_2.2.5 2025-05-07T20:03:46.4932499Z U puts@GLIBC_2.2.5 2025-05-07T20:03:46.4932834Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:46.4933199Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.4933717Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.4934145Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4934622Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.4934949Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:46.4935077Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:46.4935210Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:46.4935360Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.4935489Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.4935707Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:46.4936049Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.4936590Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4936715Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:46.4936829Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.4936949Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.4937057Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.4937176Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.4937286Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.4937396Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.4937576Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.4937785Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:46.4937888Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.4937988Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.4938105Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.4938651Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.4939093Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.4939337Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.4939714Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.4939868Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.4940018Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.4940162Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.4940502Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4940804Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.4941032Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.4941265Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.4941368Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.4941476Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.4941591Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.4941674Z w __gmon_start__ 2025-05-07T20:03:46.4941768Z w __pthread_key_create 2025-05-07T20:03:46.4941913Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.4942137Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:46.4942143Z 2025-05-07T20:03:46.4950791Z linux-vdso.so.1 (0x00007fffa8d9e000) 2025-05-07T20:03:46.4951177Z libc10.so => not found 2025-05-07T20:03:46.4952015Z libc10_cuda.so => not found 2025-05-07T20:03:46.4952152Z libtorch.so => not found 2025-05-07T20:03:46.4952293Z libtorch_cpu.so => not found 2025-05-07T20:03:46.4952419Z libtorch_cuda.so => not found 2025-05-07T20:03:46.4952525Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.4952749Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fb4f84c4000) 2025-05-07T20:03:46.4952993Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb4f8496000) 2025-05-07T20:03:46.4953122Z libc.so.6 => /lib64/libc.so.6 (0x00007fb4f828e000) 2025-05-07T20:03:46.4953273Z /lib64/ld-linux-x86-64.so.2 (0x00007fb4f8831000) 2025-05-07T20:03:46.4953397Z libm.so.6 => /lib64/libm.so.6 (0x00007fb4f81b3000) 2025-05-07T20:03:46.4953402Z 2025-05-07T20:03:46.4953516Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.4953808Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:46.4953814Z 2025-05-07T20:03:46.4982953Z 2025-05-07T20:03:46.4983237Z Dynamic section at offset 0xcc670 contains 38 entries: 2025-05-07T20:03:46.4983371Z Tag Type Name/Value 2025-05-07T20:03:46.4983663Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.4983884Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:46.4984106Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.4984341Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.4984544Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.4984752Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:46.4984968Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.4985222Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.4985412Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.4985645Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:46.4985899Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:46.4986084Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:46.4986199Z 0x000000000000000c (INIT) 0xc000 2025-05-07T20:03:46.4986481Z 0x000000000000000d (FINI) 0x23d1c 2025-05-07T20:03:46.4986616Z 0x0000000000000019 (INIT_ARRAY) 0xcc2c0 2025-05-07T20:03:46.4986761Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:03:46.4986867Z 0x000000000000001a (FINI_ARRAY) 0xcc2e0 2025-05-07T20:03:46.4986998Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.4987109Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:46.4987237Z 0x000000006ffffef5 (GNU_HASH) 0xff8 2025-05-07T20:03:46.4987345Z 0x0000000000000005 (STRTAB) 0x3788 2025-05-07T20:03:46.4987453Z 0x0000000000000006 (SYMTAB) 0x18e0 2025-05-07T20:03:46.4987631Z 0x000000000000000a (STRSZ) 24640 (bytes) 2025-05-07T20:03:46.4987805Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.4987925Z 0x0000000000000003 (PLTGOT) 0xcc910 2025-05-07T20:03:46.4988059Z 0x0000000000000002 (PLTRELSZ) 3912 (bytes) 2025-05-07T20:03:46.4988189Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.4988301Z 0x0000000000000017 (JMPREL) 0xab30 2025-05-07T20:03:46.4988406Z 0x0000000000000007 (RELA) 0x9b58 2025-05-07T20:03:46.4988553Z 0x0000000000000008 (RELASZ) 4056 (bytes) 2025-05-07T20:03:46.4988673Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.4988776Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.4988903Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.4989039Z 0x000000006ffffffe (VERNEED) 0x9a58 2025-05-07T20:03:46.4989148Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:46.4989260Z 0x000000006ffffff0 (VERSYM) 0x97c8 2025-05-07T20:03:46.4989385Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:03:46.4989490Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.4989495Z 2025-05-07T20:03:46.4989617Z ################################################################################ 2025-05-07T20:03:46.4989623Z 2025-05-07T20:03:46.4989627Z 2025-05-07T20:03:46.4989755Z ################################################################################ 2025-05-07T20:03:46.4990081Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:46.4990187Z [CHECK] Listing out library size: 2025-05-07T20:03:46.4990709Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:46.4990714Z 2025-05-07T20:03:46.4999122Z 8 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:46.4999981Z 2025-05-07T20:03:46.5000478Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:46.5001055Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.5001060Z 2025-05-07T20:03:46.5433478Z GLIBC_2.2.5 2025-05-07T20:03:46.5433717Z GLIBC_2.3 2025-05-07T20:03:46.5433992Z GLIBC_2.14 2025-05-07T20:03:46.5437019Z 2025-05-07T20:03:46.5437034Z 2025-05-07T20:03:46.5438060Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:46.5438656Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.5438664Z 2025-05-07T20:03:46.5868157Z GLIBCXX_3.4 2025-05-07T20:03:46.5868267Z GLIBCXX_3.4.9 2025-05-07T20:03:46.5868388Z GLIBCXX_3.4.11 2025-05-07T20:03:46.5868479Z GLIBCXX_3.4.15 2025-05-07T20:03:46.5868566Z GLIBCXX_3.4.18 2025-05-07T20:03:46.5868654Z GLIBCXX_3.4.20 2025-05-07T20:03:46.5868760Z GLIBCXX_3.4.21 2025-05-07T20:03:46.5871581Z 2025-05-07T20:03:46.5871586Z 2025-05-07T20:03:46.5895021Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.syBFtQfCGh.symbols.txt 2025-05-07T20:03:46.5895042Z 2025-05-07T20:03:46.6281583Z 2025-05-07T20:03:46.6313009Z [CHECK] Total Number of symbols: 4263 2025-05-07T20:03:46.6341778Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:46.6355675Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.xTAjmVaN00.usymbols.txt 2025-05-07T20:03:46.6355691Z 2025-05-07T20:03:46.6393161Z 2025-05-07T20:03:46.6419760Z [CHECK] Listing out undefined symbols (198 total): 2025-05-07T20:03:46.6435621Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.6436467Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.6436820Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:46.6437114Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.6437461Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.6437760Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.6438069Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:46.6438387Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:46.6438686Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.6438975Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.6439397Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.6439670Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:46.6439999Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.6440281Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:46.6440795Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:46.6441181Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:46.6441429Z U at::RecordFunction::end() 2025-05-07T20:03:46.6441558Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:46.6441700Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:46.6442003Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:46.6442298Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:46.6445201Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:46.6445454Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:46.6446077Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.6446257Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:46.6446438Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:46.6446563Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:46.6446709Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:46.6446851Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:46.6446949Z U c10::AnyType::get() 2025-05-07T20:03:46.6447045Z U c10::BoolType::get() 2025-05-07T20:03:46.6447257Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:46.6447438Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:46.6447550Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:46.6448117Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:46.6448740Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:46.6449086Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:46.6449208Z U c10::Error::what() const 2025-05-07T20:03:46.6449311Z U c10::FloatType::get() 2025-05-07T20:03:46.6449417Z U c10::GradMode::is_enabled() 2025-05-07T20:03:46.6449567Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:46.6449754Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:46.6449872Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:46.6449981Z U c10::IValue::isBoolList() const 2025-05-07T20:03:46.6450115Z U c10::IValue::isDoubleList() const 2025-05-07T20:03:46.6450219Z U c10::IValue::isIntList() const 2025-05-07T20:03:46.6450326Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:46.6450446Z U c10::IValue::isTensorList() const 2025-05-07T20:03:46.6450582Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:46.6450677Z U c10::IntType::get() 2025-05-07T20:03:46.6451144Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.6451308Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:46.6451423Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:46.6451567Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:46.6451692Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:46.6451900Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.6452186Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:46.6452291Z U c10::StringType::get() 2025-05-07T20:03:46.6452423Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:46.6452577Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:46.6452763Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:46.6452913Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:46.6453311Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:46.6453448Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:46.6453573Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:46.6453723Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:46.6453834Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:46.6453958Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:46.6454085Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:46.6454348Z U c10::SymIntType::get() 2025-05-07T20:03:46.6454475Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:46.6454581Z U c10::TensorType::get() 2025-05-07T20:03:46.6454729Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.6455149Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.6455706Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.6455961Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:46.6456446Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.6456799Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:46.6457376Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.6457734Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:46.6457937Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:46.6458063Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:46.6458212Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:46.6458600Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:46.6458723Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:46.6458885Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:46.6459059Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:46.6459205Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:46.6459406Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:46.6459543Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:46.6459802Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:46.6460086Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:46.6460204Z U free@GLIBC_2.2.5 2025-05-07T20:03:46.6460380Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:46.6460484Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:46.6460604Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.6460734Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:46.6460835Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.6460992Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.6461143Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.6461248Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:46.6461463Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:46.6461817Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:46.6462203Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.6462597Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6463151Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.6463536Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6464131Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6464646Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6465006Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6465587Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6465960Z U std::__cxx11::basic_string, std::allocator >::~basic_string()@GLIBCXX_3.4.21 2025-05-07T20:03:46.6466315Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:46.6466644Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:46.6467025Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.6467429Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:46.6467554Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:46.6467673Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:46.6467844Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.6467994Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.6468171Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:46.6468331Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:46.6468470Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:46.6468714Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:46.6469085Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.6469699Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.6470220Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.6470377Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:46.6470503Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.6470629Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.6470774Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.6470896Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.6471009Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.6471146Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.6471430Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.6471679Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.6471830Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:46.6472002Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6472161Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:46.6472620Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:46.6472766Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:46.6472881Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.6473009Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:46.6473110Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.6473234Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.6473854Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.6474362Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.6474630Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.6474772Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:46.6475072Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:46.6475266Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:46.6475485Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:46.6475675Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:46.6476034Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:46.6476207Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:46.6476405Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:46.6476592Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:46.6476738Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:46.6476857Z U torch::autograd::Node::metadata() 2025-05-07T20:03:46.6476997Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:46.6477264Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:46.6477553Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:46.6477699Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:46.6477933Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:46.6478148Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:46.6480865Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:46.6481032Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:46.6481229Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:46.6481400Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:46.6482203Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:46.6482381Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:46.6482802Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:46.6483185Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.6483335Z U typeinfo for c10::Error 2025-05-07T20:03:46.6483488Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:46.6483622Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:46.6483783Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:46.6483918Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:46.6484041Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:46.6484214Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.6484383Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.6484549Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:46.6484734Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.6484841Z U vtable for c10::Error 2025-05-07T20:03:46.6485199Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.6485545Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.6485685Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:46.6485890Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.6486139Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.6486257Z U vtable for torch::autograd::Node 2025-05-07T20:03:46.6486464Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:46.6486587Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.6486718Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.6486851Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.6486947Z w __gmon_start__ 2025-05-07T20:03:46.6487066Z w __pthread_key_create 2025-05-07T20:03:46.6487182Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:46.6487299Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:46.6487465Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.6487720Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:46.6487728Z 2025-05-07T20:03:46.6487969Z linux-vdso.so.1 (0x00007fff7f342000) 2025-05-07T20:03:46.6489076Z libc10.so => not found 2025-05-07T20:03:46.6489983Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f3d14600000) 2025-05-07T20:03:46.6490116Z libtorch.so => not found 2025-05-07T20:03:46.6490970Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f3d151f8000) 2025-05-07T20:03:46.6491086Z libtorch_cpu.so => not found 2025-05-07T20:03:46.6491185Z libtorch_cuda.so => not found 2025-05-07T20:03:46.6491526Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f3d1439c000) 2025-05-07T20:03:46.6491708Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f3d151c8000) 2025-05-07T20:03:46.6491837Z libc.so.6 => /lib64/libc.so.6 (0x00007f3d14194000) 2025-05-07T20:03:46.6492034Z /lib64/ld-linux-x86-64.so.2 (0x00007f3d15207000) 2025-05-07T20:03:46.6492122Z libc10.so => not found 2025-05-07T20:03:46.6492217Z libc10_cuda.so => not found 2025-05-07T20:03:46.6492600Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f3d13c00000) 2025-05-07T20:03:46.6492702Z libtorch.so => not found 2025-05-07T20:03:46.6492798Z libtorch_cpu.so => not found 2025-05-07T20:03:46.6492993Z libtorch_cuda.so => not found 2025-05-07T20:03:46.6493170Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.6493262Z libtorch.so => not found 2025-05-07T20:03:46.6493349Z libc10.so => not found 2025-05-07T20:03:46.6493464Z libtorch_cpu.so => not found 2025-05-07T20:03:46.6493559Z libtorch_cuda.so => not found 2025-05-07T20:03:46.6493684Z libm.so.6 => /lib64/libm.so.6 (0x00007f3d14925000) 2025-05-07T20:03:46.6493791Z libc10.so => not found 2025-05-07T20:03:46.6494140Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f3d15147000) 2025-05-07T20:03:46.6494229Z libtorch.so => not found 2025-05-07T20:03:46.6494322Z libtorch_cpu.so => not found 2025-05-07T20:03:46.6494433Z libtorch_cuda.so => not found 2025-05-07T20:03:46.6494528Z libtorch_cpu.so => not found 2025-05-07T20:03:46.6494623Z libtorch_cuda.so => not found 2025-05-07T20:03:46.6494734Z libtorch.so => not found 2025-05-07T20:03:46.6494748Z 2025-05-07T20:03:46.6494857Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.6495146Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:46.6495151Z 2025-05-07T20:03:46.6532433Z 2025-05-07T20:03:46.6533376Z Dynamic section at offset 0x721230 contains 38 entries: 2025-05-07T20:03:46.6533798Z Tag Type Name/Value 2025-05-07T20:03:46.6534365Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.6535044Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:46.6535606Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.6536217Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:46.6536813Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.6537755Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.6538349Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.6538938Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.6539477Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.6540085Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:46.6540900Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:03:46.6541408Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:46.6541705Z 0x000000000000000c (INIT) 0x178000 2025-05-07T20:03:46.6541815Z 0x000000000000000d (FINI) 0x67a6cc 2025-05-07T20:03:46.6541939Z 0x0000000000000019 (INIT_ARRAY) 0x71cd78 2025-05-07T20:03:46.6542068Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:03:46.6542282Z 0x000000000000001a (FINI_ARRAY) 0x71ce78 2025-05-07T20:03:46.6542410Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.6542511Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:03:46.6542617Z 0x000000006ffffef5 (GNU_HASH) 0x6408 2025-05-07T20:03:46.6542733Z 0x0000000000000005 (STRTAB) 0x25358 2025-05-07T20:03:46.6542869Z 0x0000000000000006 (SYMTAB) 0xc398 2025-05-07T20:03:46.6543001Z 0x000000000000000a (STRSZ) 1180564 (bytes) 2025-05-07T20:03:46.6543111Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.6543234Z 0x0000000000000003 (PLTGOT) 0x7224d0 2025-05-07T20:03:46.6543359Z 0x0000000000000002 (PLTRELSZ) 20952 (bytes) 2025-05-07T20:03:46.6543460Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.6543583Z 0x0000000000000017 (JMPREL) 0x171e58 2025-05-07T20:03:46.6543689Z 0x0000000000000007 (RELA) 0x147960 2025-05-07T20:03:46.6543815Z 0x0000000000000008 (RELASZ) 173304 (bytes) 2025-05-07T20:03:46.6543984Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.6544099Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.6544217Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.6544329Z 0x000000006ffffffe (VERNEED) 0x147840 2025-05-07T20:03:46.6544452Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:46.6544562Z 0x000000006ffffff0 (VERSYM) 0x1456ec 2025-05-07T20:03:46.6544665Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:03:46.6544778Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.6544792Z 2025-05-07T20:03:46.6544901Z ################################################################################ 2025-05-07T20:03:46.6544906Z 2025-05-07T20:03:46.6544910Z 2025-05-07T20:03:46.6545017Z ################################################################################ 2025-05-07T20:03:46.6545289Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:46.6545391Z [CHECK] Listing out library size: 2025-05-07T20:03:46.6545642Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:46.6545646Z 2025-05-07T20:03:46.6553241Z 213 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:46.6553247Z 2025-05-07T20:03:46.6554171Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:46.6554676Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.6554683Z 2025-05-07T20:03:46.6946820Z GLIBC_2.2.5 2025-05-07T20:03:46.6947122Z GLIBC_2.3 2025-05-07T20:03:46.6947382Z GLIBC_2.14 2025-05-07T20:03:46.6947916Z 2025-05-07T20:03:46.6947936Z 2025-05-07T20:03:46.6949464Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:46.6950526Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.6951108Z 2025-05-07T20:03:46.7331711Z GLIBCXX_3.4 2025-05-07T20:03:46.7332396Z GLIBCXX_3.4.9 2025-05-07T20:03:46.7333014Z GLIBCXX_3.4.11 2025-05-07T20:03:46.7333622Z GLIBCXX_3.4.14 2025-05-07T20:03:46.7334180Z GLIBCXX_3.4.18 2025-05-07T20:03:46.7334761Z GLIBCXX_3.4.20 2025-05-07T20:03:46.7335321Z GLIBCXX_3.4.21 2025-05-07T20:03:46.7335693Z 2025-05-07T20:03:46.7335706Z 2025-05-07T20:03:46.7352617Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.E3wpCzdhm8.symbols.txt 2025-05-07T20:03:46.7353132Z 2025-05-07T20:03:46.7700323Z 2025-05-07T20:03:46.7728875Z [CHECK] Total Number of symbols: 4944 2025-05-07T20:03:46.7753408Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:03:46.7770794Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.jPT8JQAn0M.usymbols.txt 2025-05-07T20:03:46.7771443Z 2025-05-07T20:03:46.7803803Z 2025-05-07T20:03:46.7829019Z [CHECK] Listing out undefined symbols (265 total): 2025-05-07T20:03:46.7842794Z U GOMP_parallel 2025-05-07T20:03:46.7844815Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.7847132Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.7848707Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:46.7849736Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.7850326Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:46.7850726Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.7851301Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:46.7851743Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:46.7852101Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:46.7852449Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:46.7852822Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:46.7853133Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:46.7853456Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:46.7853782Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:46.7854093Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:46.7854433Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:46.7854742Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:46.7855079Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:46.7855395Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:46.7855718Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:46.7856029Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:46.7856364Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:46.7856768Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:46.7857579Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.7858757Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.7860122Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.7861017Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:46.7861762Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.7862509Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:46.7863073Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:46.7864275Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:46.7865498Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.7866178Z U at::detail::getCUDAHooks() 2025-05-07T20:03:46.7866524Z U at::detail::getHIPHooks() 2025-05-07T20:03:46.7866838Z U at::get_num_threads() 2025-05-07T20:03:46.7867157Z U at::get_thread_num() 2025-05-07T20:03:46.7867523Z U at::globalContext() 2025-05-07T20:03:46.7867824Z U at::in_parallel_region() 2025-05-07T20:03:46.7868161Z U at::init_num_threads() 2025-05-07T20:03:46.7868479Z U at::internal::set_thread_num(int) 2025-05-07T20:03:46.7868889Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:46.7869337Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.7869865Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.7870355Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:03:46.7871016Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:03:46.7872058Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:46.7873023Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.7874176Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:46.7874802Z U c10::Error::what() const 2025-05-07T20:03:46.7875159Z U c10::GradMode::is_enabled() 2025-05-07T20:03:46.7875491Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:46.7875884Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.7876326Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.7876810Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:46.7877234Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:03:46.7877600Z U c10::IValue::isTensorList() const 2025-05-07T20:03:46.7877999Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:46.7878363Z U c10::IntType::get() 2025-05-07T20:03:46.7879070Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.7879836Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:46.7880308Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:46.7880674Z U c10::NoneType::get() 2025-05-07T20:03:46.7881103Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.7881601Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:46.7881969Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:46.7882376Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:46.7882777Z U c10::StringType::get() 2025-05-07T20:03:46.7883128Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:46.7883802Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:46.7884582Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:46.7884963Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:46.7885355Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:46.7886049Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:46.7886712Z U c10::TensorType::get() 2025-05-07T20:03:46.7887706Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:46.7888724Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:46.7889663Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:46.7891169Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:03:46.7891819Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:46.7892269Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:46.7892624Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:46.7892990Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:46.7893342Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:46.7893711Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:46.7894205Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:46.7894665Z U c10::cuda::device_count() 2025-05-07T20:03:46.7895030Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:46.7895410Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:46.7895831Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:46.7896223Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:46.7896658Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:46.7897064Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:46.7897709Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:46.7898781Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.7900557Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:46.7901941Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:46.7902824Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.7903818Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:46.7904863Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.7905755Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:03:46.7906161Z U c10::get_default_dtype() 2025-05-07T20:03:46.7906652Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:46.7907266Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:46.7907705Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:46.7908066Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:46.7908428Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:03:46.7908829Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:46.7909444Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:46.7910087Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:03:46.7910521Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:03:46.7911035Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:03:46.7911623Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:46.7912082Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:46.7912524Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:03:46.7912929Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:46.7913365Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:46.7913807Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:46.7914192Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.7914571Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:46.7914963Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:46.7915323Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:46.7915689Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:46.7916054Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:46.7916401Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.7916783Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:46.7917143Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:46.7917506Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:46.7917845Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:46.7918218Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:46.7918579Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:46.7919584Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7921324Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7923095Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7924870Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7926465Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7928164Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7929731Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:46.7931264Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:46.7932910Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7934582Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:46.7936373Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7938017Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:46.7939680Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:46.7941441Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7943081Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:46.7944621Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:46.7946323Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7948257Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:46.7950140Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7952229Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:46.7954045Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:46.7956039Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7957934Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7959811Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7961758Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7963676Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7965686Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7967619Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:46.7968786Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.7969237Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.7969686Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.7970084Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.7970780Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:03:46.7971487Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:46.7971940Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.7972396Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.7973245Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:03:46.7974378Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:03:46.7975031Z U memcpy@GLIBC_2.14 2025-05-07T20:03:46.7975331Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:46.7975654Z U memset@GLIBC_2.2.5 2025-05-07T20:03:46.7975953Z U omp_get_num_threads 2025-05-07T20:03:46.7976273Z U omp_get_thread_num 2025-05-07T20:03:46.7976615Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:46.7977041Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:46.7977515Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:46.7978185Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:46.7979049Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.7979972Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.7981031Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:46.7982089Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:46.7983025Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.7984065Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:46.7985154Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.7986178Z U std::__cxx11::basic_string, std::allocator >::find(char, unsigned long) const@GLIBCXX_3.4.21 2025-05-07T20:03:46.7987180Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.7988092Z U std::__cxx11::basic_stringbuf, std::allocator >::_M_sync(char*, unsigned long, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:46.7988985Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:46.7989747Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:46.7990791Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:46.7991950Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:46.7992599Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:46.7993049Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:46.7993441Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:46.7993912Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:46.7994295Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:46.7994737Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.7995175Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.7995613Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:46.7996080Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:46.7996582Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:46.7997316Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:46.7998386Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.7999594Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:46.8000442Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:46.8000978Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:03:46.8001440Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:46.8001897Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:46.8002267Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:46.8002666Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:46.8003060Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:46.8003430Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.8003864Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:46.8004451Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:03:46.8004880Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:46.8005228Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:46.8005830Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:46.8006468Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:03:46.8006877Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.8007409Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:46.8007891Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:46.8008277Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:46.8008699Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:46.8009145Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:46.8009594Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:46.8009931Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:46.8010219Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:46.8010557Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:46.8011345Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:46.8012502Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.8013328Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:46.8014678Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:03:46.8016147Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:03:46.8016960Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:46.8017779Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:03:46.8018427Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:03:46.8018997Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:03:46.8021552Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:03:46.8022470Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:03:46.8023055Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:03:46.8023777Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:03:46.8024473Z U typeinfo for c10::Error 2025-05-07T20:03:46.8024806Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:46.8025192Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:03:46.8025645Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:46.8026055Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:46.8026559Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:46.8027020Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:46.8027454Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:46.8027890Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:46.8028255Z U vtable for c10::Error 2025-05-07T20:03:46.8028804Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.8029566Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.8030334Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:46.8030989Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:46.8031616Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:46.8032374Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:46.8032733Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:46.8033085Z w _ITM_registerTMCloneTable 2025-05-07T20:03:46.8033427Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:46.8033746Z w __gmon_start__ 2025-05-07T20:03:46.8034066Z w __pthread_key_create 2025-05-07T20:03:46.8034378Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:46.8034780Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:46.8035150Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:46.8035641Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:46.8035973Z 2025-05-07T20:03:46.8036151Z linux-vdso.so.1 (0x00007ffcc9073000) 2025-05-07T20:03:46.8036450Z libc10.so => not found 2025-05-07T20:03:46.8036727Z libc10_cuda.so => not found 2025-05-07T20:03:46.8037264Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f9234a00000) 2025-05-07T20:03:46.8038211Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f9233c00000) 2025-05-07T20:03:46.8038892Z libtorch.so => not found 2025-05-07T20:03:46.8039160Z libtorch_cpu.so => not found 2025-05-07T20:03:46.8039450Z libtorch_cuda.so => not found 2025-05-07T20:03:46.8039732Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.8040087Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f923399c000) 2025-05-07T20:03:46.8040511Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f92429b9000) 2025-05-07T20:03:46.8040918Z libc.so.6 => /lib64/libc.so.6 (0x00007f9233794000) 2025-05-07T20:03:46.8041313Z /lib64/ld-linux-x86-64.so.2 (0x00007f92429ed000) 2025-05-07T20:03:46.8041639Z libc10.so => not found 2025-05-07T20:03:46.8042206Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f924293c000) 2025-05-07T20:03:46.8042767Z libtorch.so => not found 2025-05-07T20:03:46.8043036Z libtorch_cpu.so => not found 2025-05-07T20:03:46.8043315Z libtorch_cuda.so => not found 2025-05-07T20:03:46.8043634Z libm.so.6 => /lib64/libm.so.6 (0x00007f9234925000) 2025-05-07T20:03:46.8044092Z libtorch.so => not found 2025-05-07T20:03:46.8044352Z libc10.so => not found 2025-05-07T20:03:46.8044605Z libc10_cuda.so => not found 2025-05-07T20:03:46.8044864Z libtorch_cpu.so => not found 2025-05-07T20:03:46.8045141Z libtorch_cuda.so => not found 2025-05-07T20:03:46.8045439Z libcudart.so.11.0 => not found 2025-05-07T20:03:46.8045747Z libtorch_cpu.so => not found 2025-05-07T20:03:46.8046010Z libtorch_cuda.so => not found 2025-05-07T20:03:46.8046290Z libtorch.so => not found 2025-05-07T20:03:46.8046444Z 2025-05-07T20:03:46.8046549Z [CHECK] Displaying ELF information: 2025-05-07T20:03:46.8046988Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:46.8047312Z 2025-05-07T20:03:46.8047317Z 2025-05-07T20:03:46.8047492Z Dynamic section at offset 0xd445b20 contains 40 entries: 2025-05-07T20:03:46.8047851Z Tag Type Name/Value 2025-05-07T20:03:46.8048260Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:46.8048726Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:46.8049214Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:46.8049691Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:46.8050211Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:46.8050706Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:46.8051192Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:46.8051701Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:46.8052183Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:46.8052674Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:46.8053127Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:46.8053632Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:46.8054218Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:03:46.8054707Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:46.8055113Z 0x000000000000000c (INIT) 0x198000 2025-05-07T20:03:46.8055428Z 0x000000000000000d (FINI) 0x7e57ec 2025-05-07T20:03:46.8055769Z 0x0000000000000019 (INIT_ARRAY) 0xd445f00 2025-05-07T20:03:46.8056102Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:03:46.8056455Z 0x000000000000001a (FINI_ARRAY) 0xd446088 2025-05-07T20:03:46.8056808Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:46.8057117Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:46.8057452Z 0x000000006ffffef5 (GNU_HASH) 0x6f70 2025-05-07T20:03:46.8057768Z 0x0000000000000005 (STRTAB) 0x2b828 2025-05-07T20:03:46.8058091Z 0x0000000000000006 (SYMTAB) 0xe890 2025-05-07T20:03:46.8058431Z 0x000000000000000a (STRSZ) 1358400 (bytes) 2025-05-07T20:03:46.8058787Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:46.8059133Z 0x0000000000000003 (PLTGOT) 0xd446de0 2025-05-07T20:03:46.8059470Z 0x0000000000000002 (PLTRELSZ) 15480 (bytes) 2025-05-07T20:03:46.8059834Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:46.8060142Z 0x0000000000000017 (JMPREL) 0x1941f0 2025-05-07T20:03:46.8060521Z 0x0000000000000007 (RELA) 0x179a60 2025-05-07T20:03:46.8060860Z 0x0000000000000008 (RELASZ) 108432 (bytes) 2025-05-07T20:03:46.8061220Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:46.8061534Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:46.8061837Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:46.8062165Z 0x000000006ffffffe (VERNEED) 0x179910 2025-05-07T20:03:46.8062469Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:46.8062778Z 0x000000006ffffff0 (VERSYM) 0x177268 2025-05-07T20:03:46.8063081Z 0x000000006ffffff9 (RELACOUNT) 79 2025-05-07T20:03:46.8063428Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:46.8063643Z 2025-05-07T20:03:46.8063753Z ################################################################################ 2025-05-07T20:03:46.8063977Z 2025-05-07T20:03:46.8063981Z 2025-05-07T20:03:46.8064093Z ################################################################################ 2025-05-07T20:03:46.8064598Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:46.8065079Z [CHECK] Listing out library size: 2025-05-07T20:03:46.8065529Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:46.8065899Z 2025-05-07T20:03:46.8066113Z 192 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:46.8066439Z 2025-05-07T20:03:46.8066822Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:46.8067799Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.8068374Z 2025-05-07T20:03:46.8937315Z GLIBC_2.2.5 2025-05-07T20:03:46.8937627Z GLIBC_2.3 2025-05-07T20:03:46.8937846Z GLIBC_2.14 2025-05-07T20:03:46.8937997Z 2025-05-07T20:03:46.8938031Z 2025-05-07T20:03:46.8938509Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:46.8939629Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:46.8940321Z 2025-05-07T20:03:46.9935152Z GLIBCXX_3.4 2025-05-07T20:03:46.9935832Z GLIBCXX_3.4.9 2025-05-07T20:03:46.9936505Z GLIBCXX_3.4.20 2025-05-07T20:03:46.9937077Z GLIBCXX_3.4.21 2025-05-07T20:03:46.9938724Z 2025-05-07T20:03:46.9938740Z 2025-05-07T20:03:46.9958805Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.SbuJV5Vo8n.symbols.txt 2025-05-07T20:03:46.9959338Z 2025-05-07T20:03:47.0882565Z 2025-05-07T20:03:47.0925058Z [CHECK] Total Number of symbols: 12654 2025-05-07T20:03:47.0974559Z [CHECK] Number of fbgemm symbols: 5268 2025-05-07T20:03:47.0999521Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.IOhksndaqI.usymbols.txt 2025-05-07T20:03:47.1000059Z 2025-05-07T20:03:47.1056824Z 2025-05-07T20:03:47.1092575Z [CHECK] Listing out undefined symbols (183 total): 2025-05-07T20:03:47.1113976Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.1114628Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:47.1115018Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.1115440Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.1115860Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.1116267Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:47.1116648Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:47.1117219Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:47.1117604Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.1117988Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:47.1118313Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:47.1118638Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:47.1118967Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:47.1119283Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:47.1119619Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:47.1119939Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:47.1120356Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:47.1120744Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:47.1121061Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:47.1121372Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:47.1121705Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:47.1122106Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:47.1122517Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:47.1123072Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:47.1123787Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:47.1124442Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:47.1125068Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:47.1126112Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.1127141Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:47.1127876Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:47.1128323Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:47.1128757Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:47.1129188Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.1129703Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.1130118Z U c10::BoolType::get() 2025-05-07T20:03:47.1130458Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:47.1130904Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:47.1131310Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:47.1132005Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:47.1133193Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:47.1134243Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:47.1134809Z U c10::Error::what() const 2025-05-07T20:03:47.1135105Z U c10::FloatType::get() 2025-05-07T20:03:47.1135463Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.1135875Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.1136368Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:47.1136721Z U c10::IntType::get() 2025-05-07T20:03:47.1137063Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:47.1137466Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:47.1137800Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.1138167Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.1138542Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:47.1138919Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:47.1139362Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:47.1140015Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:47.1140643Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:47.1141018Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:03:47.1141371Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:47.1141733Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:47.1142062Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:47.1142431Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:03:47.1142779Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:47.1143135Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:47.1143490Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:47.1143812Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:47.1144119Z U c10::SymIntType::get() 2025-05-07T20:03:47.1144404Z U c10::TensorType::get() 2025-05-07T20:03:47.1144720Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:47.1145610Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:47.1146496Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:47.1146848Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:47.1147169Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:47.1147536Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:47.1147870Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:47.1148189Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:47.1148640Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:47.1149078Z U c10::cuda::device_count() 2025-05-07T20:03:47.1149414Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:47.1149772Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:47.1150152Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:47.1150534Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:47.1150909Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:47.1151384Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:47.1152382Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:47.1153286Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:47.1154197Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.1155145Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:47.1156183Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.1156998Z U c10::get_default_dtype() 2025-05-07T20:03:47.1157323Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:47.1157678Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:47.1158264Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:47.1158939Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:47.1159378Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:47.1159728Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.1160121Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:47.1160505Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:03:47.1160875Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:03:47.1161247Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:47.1161592Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:47.1161978Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:47.1162371Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:47.1162802Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:47.1163220Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:47.1163599Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:47.1164142Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:47.1164554Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.1164922Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:47.1165270Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:47.1165624Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:47.1165956Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:47.1166293Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:47.1166665Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.1166999Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:47.1167340Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:47.1167660Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:47.1167995Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:47.1168326Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.1168681Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:47.1169374Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:47.1169879Z U float at::Tensor::item() const 2025-05-07T20:03:47.1170256Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.1170663Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.1171036Z U free@GLIBC_2.2.5 2025-05-07T20:03:47.1171154Z U int at::Tensor::item() const 2025-05-07T20:03:47.1171300Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.1171613Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.1171794Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:47.1171983Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.1172139Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.1172242Z U memcpy@GLIBC_2.14 2025-05-07T20:03:47.1172347Z U memset@GLIBC_2.2.5 2025-05-07T20:03:47.1172524Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:47.1172651Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:47.1173006Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:47.1173528Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:47.1174008Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.1174569Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.1174959Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:47.1175387Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.1175845Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:47.1176364Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.1176733Z U std::__cxx11::basic_string, std::allocator >::append(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:47.1177067Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:47.1177215Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.1177379Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.1177589Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:47.1177837Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:47.1178206Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:47.1178788Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.1179323Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.1179458Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:47.1179582Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:47.1179730Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:47.1179854Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.1179980Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.1180121Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:47.1180237Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:47.1180458Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.1180718Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.1180851Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:47.1180969Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:47.1181073Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:47.1181221Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:47.1181821Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:47.1182361Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.1182627Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.1182998Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:47.1183137Z U typeinfo for c10::Error 2025-05-07T20:03:47.1183297Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:47.1183469Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:47.1183651Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:47.1183762Z U vtable for c10::Error 2025-05-07T20:03:47.1184118Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.1184477Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.1184687Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:47.1184918Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:47.1185117Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.1185239Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:47.1185355Z w _ITM_registerTMCloneTable 2025-05-07T20:03:47.1185466Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:47.1185586Z w __gmon_start__ 2025-05-07T20:03:47.1185741Z w __pthread_key_create 2025-05-07T20:03:47.1185892Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:47.1186161Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:47.1186168Z 2025-05-07T20:03:47.1186283Z linux-vdso.so.1 (0x00007fffde4f7000) 2025-05-07T20:03:47.1186380Z libc10.so => not found 2025-05-07T20:03:47.1186505Z libc10_cuda.so => not found 2025-05-07T20:03:47.1205075Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f0e02e00000) 2025-05-07T20:03:47.1205305Z libtorch.so => not found 2025-05-07T20:03:47.1205432Z libtorch_cpu.so => not found 2025-05-07T20:03:47.1205535Z libtorch_cuda.so => not found 2025-05-07T20:03:47.1205637Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.1205835Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0e02b9c000) 2025-05-07T20:03:47.1205989Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0e0fb18000) 2025-05-07T20:03:47.1206131Z libc.so.6 => /lib64/libc.so.6 (0x00007f0e02994000) 2025-05-07T20:03:47.1206285Z /lib64/ld-linux-x86-64.so.2 (0x00007f0e0fb4c000) 2025-05-07T20:03:47.1206381Z libc10.so => not found 2025-05-07T20:03:47.1206475Z libc10_cuda.so => not found 2025-05-07T20:03:47.1206838Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f0e02400000) 2025-05-07T20:03:47.1207437Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f0e0fb0b000) 2025-05-07T20:03:47.1207536Z libtorch.so => not found 2025-05-07T20:03:47.1207643Z libtorch_cpu.so => not found 2025-05-07T20:03:47.1207776Z libtorch_cuda.so => not found 2025-05-07T20:03:47.1207874Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.1208003Z libm.so.6 => /lib64/libm.so.6 (0x00007f0e0fa2e000) 2025-05-07T20:03:47.1208125Z libc10.so => not found 2025-05-07T20:03:47.1208487Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f0e0f9b3000) 2025-05-07T20:03:47.1208633Z libtorch.so => not found 2025-05-07T20:03:47.1208774Z libtorch_cpu.so => not found 2025-05-07T20:03:47.1208894Z libtorch_cuda.so => not found 2025-05-07T20:03:47.1208986Z libtorch.so => not found 2025-05-07T20:03:47.1209074Z libc10.so => not found 2025-05-07T20:03:47.1209191Z libtorch_cpu.so => not found 2025-05-07T20:03:47.1209298Z libtorch_cuda.so => not found 2025-05-07T20:03:47.1209398Z libtorch_cpu.so => not found 2025-05-07T20:03:47.1209504Z libtorch_cuda.so => not found 2025-05-07T20:03:47.1209630Z libtorch.so => not found 2025-05-07T20:03:47.1209654Z 2025-05-07T20:03:47.1209761Z [CHECK] Displaying ELF information: 2025-05-07T20:03:47.1210027Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:47.1210033Z 2025-05-07T20:03:47.1234132Z 2025-05-07T20:03:47.1234885Z Dynamic section at offset 0xbf1a3c0 contains 39 entries: 2025-05-07T20:03:47.1235307Z Tag Type Name/Value 2025-05-07T20:03:47.1235910Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:47.1236508Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:47.1237152Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:47.1237746Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:47.1238394Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:47.1238600Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:47.1238811Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:47.1239034Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:47.1239233Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:47.1239582Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:47.1239834Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:47.1240094Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:03:47.1240283Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:47.1240429Z 0x000000000000000c (INIT) 0x449000 2025-05-07T20:03:47.1240551Z 0x000000000000000d (FINI) 0x2257c8c 2025-05-07T20:03:47.1240671Z 0x0000000000000019 (INIT_ARRAY) 0xbf197b8 2025-05-07T20:03:47.1240803Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:03:47.1240942Z 0x000000000000001a (FINI_ARRAY) 0xbf19aa8 2025-05-07T20:03:47.1241063Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:47.1241173Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:03:47.1241310Z 0x000000006ffffef5 (GNU_HASH) 0x10760 2025-05-07T20:03:47.1241422Z 0x0000000000000005 (STRTAB) 0x6ec58 2025-05-07T20:03:47.1241537Z 0x0000000000000006 (SYMTAB) 0x249f0 2025-05-07T20:03:47.1241698Z 0x000000000000000a (STRSZ) 3684386 (bytes) 2025-05-07T20:03:47.1241819Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:47.1241940Z 0x0000000000000003 (PLTGOT) 0xbf1b670 2025-05-07T20:03:47.1242125Z 0x0000000000000002 (PLTRELSZ) 10392 (bytes) 2025-05-07T20:03:47.1242256Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:47.1242372Z 0x0000000000000017 (JMPREL) 0x4457a8 2025-05-07T20:03:47.1242486Z 0x0000000000000007 (RELA) 0x3f8858 2025-05-07T20:03:47.1242641Z 0x0000000000000008 (RELASZ) 315216 (bytes) 2025-05-07T20:03:47.1242762Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:47.1242864Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:47.1243012Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:47.1244862Z 0x000000006ffffffe (VERNEED) 0x3f8758 2025-05-07T20:03:47.1245013Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:47.1245129Z 0x000000006ffffff0 (VERSYM) 0x3f247a 2025-05-07T20:03:47.1245259Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:03:47.1245361Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:47.1245383Z 2025-05-07T20:03:47.1245504Z ################################################################################ 2025-05-07T20:03:47.1245510Z 2025-05-07T20:03:47.1245530Z 2025-05-07T20:03:47.1245645Z ################################################################################ 2025-05-07T20:03:47.1246001Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:47.1246106Z [CHECK] Listing out library size: 2025-05-07T20:03:47.1246470Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:47.1246477Z 2025-05-07T20:03:47.1256385Z 4 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:47.1256398Z 2025-05-07T20:03:47.1256955Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:47.1257593Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.1257598Z 2025-05-07T20:03:47.1476926Z GLIBC_2.2.5 2025-05-07T20:03:47.1477190Z GLIBC_2.3 2025-05-07T20:03:47.1477864Z GLIBC_2.14 2025-05-07T20:03:47.1482778Z 2025-05-07T20:03:47.1482860Z 2025-05-07T20:03:47.1483498Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:47.1484369Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.1484390Z 2025-05-07T20:03:47.1710651Z GLIBCXX_3.4 2025-05-07T20:03:47.1711808Z GLIBCXX_3.4.9 2025-05-07T20:03:47.1712126Z GLIBCXX_3.4.11 2025-05-07T20:03:47.1712394Z GLIBCXX_3.4.15 2025-05-07T20:03:47.1712631Z GLIBCXX_3.4.18 2025-05-07T20:03:47.1712885Z GLIBCXX_3.4.20 2025-05-07T20:03:47.1713118Z GLIBCXX_3.4.21 2025-05-07T20:03:47.1713754Z 2025-05-07T20:03:47.1713804Z 2025-05-07T20:03:47.1740213Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.cUMHLTLcsp.symbols.txt 2025-05-07T20:03:47.1740295Z 2025-05-07T20:03:47.1920937Z 2025-05-07T20:03:47.1950674Z [CHECK] Total Number of symbols: 2656 2025-05-07T20:03:47.1978893Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:03:47.1998389Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.gsRxylGUKN.usymbols.txt 2025-05-07T20:03:47.2000102Z 2025-05-07T20:03:47.2028491Z 2025-05-07T20:03:47.2061678Z [CHECK] Listing out undefined symbols (202 total): 2025-05-07T20:03:47.2079194Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2079840Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:47.2080307Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:47.2080701Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:47.2081064Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:47.2081410Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:47.2081787Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:47.2082141Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:47.2082517Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:47.2082873Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:47.2083283Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:47.2084690Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:47.2085141Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:47.2085487Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:47.2085859Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:47.2086291Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:47.2086764Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:47.2087134Z U at::RecordFunction::end() 2025-05-07T20:03:47.2087528Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:47.2087959Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:47.2088989Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.2090217Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:47.2091514Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.2092905Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.2093839Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:47.2094356Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:47.2094934Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:47.2095447Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:47.2095921Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:47.2096342Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:47.2096797Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:47.2097308Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:47.2097673Z U c10::AnyType::get() 2025-05-07T20:03:47.2097980Z U c10::BoolType::get() 2025-05-07T20:03:47.2098387Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:47.2098819Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:47.2099536Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:47.2100750Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:47.2101884Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:47.2102442Z U c10::Error::what() const 2025-05-07T20:03:47.2102767Z U c10::FloatType::get() 2025-05-07T20:03:47.2103067Z U c10::GradMode::is_enabled() 2025-05-07T20:03:47.2103401Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:47.2103792Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:47.2104171Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:47.2104529Z U c10::IValue::isBoolList() const 2025-05-07T20:03:47.2104852Z U c10::IValue::isIntList() const 2025-05-07T20:03:47.2105241Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:47.2105598Z U c10::IValue::isTensorList() const 2025-05-07T20:03:47.2105980Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:47.2106356Z U c10::IntType::get() 2025-05-07T20:03:47.2107013Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.2107773Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:47.2108162Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:47.2108520Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.2108895Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.2109330Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.2109937Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:47.2110416Z U c10::StringType::get() 2025-05-07T20:03:47.2110777Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:47.2111196Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:47.2111857Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:47.2112351Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:47.2112839Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:47.2113568Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:47.2114308Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:47.2114707Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:47.2115128Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:47.2115528Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:47.2115941Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:47.2116318Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:47.2116742Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:47.2117144Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:47.2117505Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:47.2117960Z U c10::SymIntType::get() 2025-05-07T20:03:47.2118279Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:47.2118632Z U c10::TensorType::get() 2025-05-07T20:03:47.2118981Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:47.2119608Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.2120639Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:47.2121503Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:47.2122334Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.2123246Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:47.2124219Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.2125252Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:47.2125866Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:47.2126271Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.2126662Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:47.2127298Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:47.2127878Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:47.2128291Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:47.2128705Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:47.2129123Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:47.2129579Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:47.2129994Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:47.2130659Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:47.2131130Z U free@GLIBC_2.2.5 2025-05-07T20:03:47.2131531Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:47.2131931Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:47.2132254Z U memcpy@GLIBC_2.14 2025-05-07T20:03:47.2132579Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:47.2132877Z U memset@GLIBC_2.2.5 2025-05-07T20:03:47.2133444Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:47.2133878Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:47.2134257Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:47.2134674Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:47.2135387Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:47.2136276Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:47.2137189Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2138287Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.2139378Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2140305Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2141406Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2142428Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2143462Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2144501Z U std::__cxx11::basic_string, std::allocator >::~basic_string()@GLIBCXX_3.4.21 2025-05-07T20:03:47.2145314Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:47.2146175Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:47.2147171Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:47.2148134Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:47.2148709Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:47.2149080Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:47.2149440Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.2149860Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.2150290Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:47.2150698Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:47.2151100Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:47.2151635Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:47.2152549Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:47.2153665Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.2154909Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.2155708Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:47.2156111Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:47.2156481Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:47.2156874Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.2157232Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.2157618Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:47.2157997Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:47.2158416Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.2158996Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.2159489Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:47.2159932Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2160395Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:47.2161098Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2161858Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:47.2162240Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:47.2162600Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:47.2162909Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:47.2163269Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:47.2164237Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:47.2165341Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.2166212Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.2166715Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:47.2167223Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:47.2167807Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:47.2168291Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:47.2168809Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:47.2169462Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:47.2170049Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:47.2170519Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:47.2171021Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:47.2171429Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:47.2171797Z U torch::autograd::Node::metadata() 2025-05-07T20:03:47.2172148Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:47.2172651Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:47.2173252Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:47.2173780Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:47.2174253Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:47.2174806Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:47.2177843Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:47.2180942Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:47.2181408Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:47.2181855Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:47.2183002Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:47.2184110Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:47.2184812Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:47.2185739Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:47.2186365Z U typeinfo for c10::Error 2025-05-07T20:03:47.2186757Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:47.2187201Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:47.2187582Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:47.2188000Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:47.2188408Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:47.2188825Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:47.2189262Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:47.2189722Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:47.2190154Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:47.2190724Z U vtable for c10::Error 2025-05-07T20:03:47.2191457Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2192263Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2192863Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:47.2193317Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:47.2193903Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:47.2194408Z U vtable for torch::autograd::Node 2025-05-07T20:03:47.2194816Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.2195266Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:47.2195613Z w _ITM_registerTMCloneTable 2025-05-07T20:03:47.2195973Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:47.2196365Z w __gmon_start__ 2025-05-07T20:03:47.2196664Z w __pthread_key_create 2025-05-07T20:03:47.2197016Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:47.2197366Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:47.2197786Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:47.2198342Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:47.2198767Z 2025-05-07T20:03:47.2198921Z linux-vdso.so.1 (0x00007ffe057fb000) 2025-05-07T20:03:47.2199248Z libc10.so => not found 2025-05-07T20:03:47.2199872Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fbcca1bb000) 2025-05-07T20:03:47.2200943Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fbcc9400000) 2025-05-07T20:03:47.2201664Z libtorch.so => not found 2025-05-07T20:03:47.2201953Z libtorch_cpu.so => not found 2025-05-07T20:03:47.2202279Z libtorch_cuda.so => not found 2025-05-07T20:03:47.2202643Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbcc919c000) 2025-05-07T20:03:47.2203106Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbcc9dd2000) 2025-05-07T20:03:47.2203497Z libc.so.6 => /lib64/libc.so.6 (0x00007fbcc8f94000) 2025-05-07T20:03:47.2203935Z /lib64/ld-linux-x86-64.so.2 (0x00007fbcca1ca000) 2025-05-07T20:03:47.2204284Z libtorch.so => not found 2025-05-07T20:03:47.2204578Z libc10.so => not found 2025-05-07T20:03:47.2204867Z libtorch_cpu.so => not found 2025-05-07T20:03:47.2205149Z libtorch_cuda.so => not found 2025-05-07T20:03:47.2205449Z libtorch.so => not found 2025-05-07T20:03:47.2205708Z libc10.so => not found 2025-05-07T20:03:47.2205980Z libc10_cuda.so => not found 2025-05-07T20:03:47.2206252Z libtorch_cpu.so => not found 2025-05-07T20:03:47.2206548Z libtorch_cuda.so => not found 2025-05-07T20:03:47.2206841Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.2207337Z libm.so.6 => /lib64/libm.so.6 (0x00007fbcc9cf7000) 2025-05-07T20:03:47.2207611Z 2025-05-07T20:03:47.2207727Z [CHECK] Displaying ELF information: 2025-05-07T20:03:47.2208269Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:47.2208688Z 2025-05-07T20:03:47.2208692Z 2025-05-07T20:03:47.2208877Z Dynamic section at offset 0x3a5ba0 contains 38 entries: 2025-05-07T20:03:47.2209257Z Tag Type Name/Value 2025-05-07T20:03:47.2209699Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:47.2210219Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:47.2210790Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:47.2211341Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:47.2211850Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:47.2212396Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:47.2212905Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:47.2213425Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:47.2213923Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:47.2214456Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:47.2215103Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:03:47.2215698Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:47.2216128Z 0x000000000000000c (INIT) 0xb9000 2025-05-07T20:03:47.2216472Z 0x000000000000000d (FINI) 0x3491ac 2025-05-07T20:03:47.2216872Z 0x0000000000000019 (INIT_ARRAY) 0x3a2b18 2025-05-07T20:03:47.2217235Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:03:47.2217619Z 0x000000000000001a (FINI_ARRAY) 0x3a2c48 2025-05-07T20:03:47.2217992Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:47.2218334Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:03:47.2218701Z 0x000000006ffffef5 (GNU_HASH) 0x3b10 2025-05-07T20:03:47.2219046Z 0x0000000000000005 (STRTAB) 0x17278 2025-05-07T20:03:47.2219406Z 0x0000000000000006 (SYMTAB) 0x7960 2025-05-07T20:03:47.2219760Z 0x000000000000000a (STRSZ) 530122 (bytes) 2025-05-07T20:03:47.2220149Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:47.2220497Z 0x0000000000000003 (PLTGOT) 0x3a5e40 2025-05-07T20:03:47.2220892Z 0x0000000000000002 (PLTRELSZ) 14136 (bytes) 2025-05-07T20:03:47.2221272Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:47.2221595Z 0x0000000000000017 (JMPREL) 0xb5390 2025-05-07T20:03:47.2221954Z 0x0000000000000007 (RELA) 0x99f28 2025-05-07T20:03:47.2222318Z 0x0000000000000008 (RELASZ) 111720 (bytes) 2025-05-07T20:03:47.2222710Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:47.2223043Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:47.2223397Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:47.2223802Z 0x000000006ffffffe (VERNEED) 0x99e08 2025-05-07T20:03:47.2224139Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:47.2224491Z 0x000000006ffffff0 (VERSYM) 0x98942 2025-05-07T20:03:47.2224821Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:03:47.2225163Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:47.2225366Z 2025-05-07T20:03:47.2225485Z ################################################################################ 2025-05-07T20:03:47.2225723Z 2025-05-07T20:03:47.2225727Z 2025-05-07T20:03:47.2225894Z ################################################################################ 2025-05-07T20:03:47.2226448Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:47.2226997Z [CHECK] Listing out library size: 2025-05-07T20:03:47.2227471Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:47.2227878Z 2025-05-07T20:03:47.2228091Z 18 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:47.2228412Z 2025-05-07T20:03:47.2228833Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:47.2229819Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.2230437Z 2025-05-07T20:03:47.2313867Z GLIBC_2.2.5 2025-05-07T20:03:47.2314524Z GLIBC_2.3 2025-05-07T20:03:47.2315122Z GLIBC_2.14 2025-05-07T20:03:47.2315462Z 2025-05-07T20:03:47.2315475Z 2025-05-07T20:03:47.2316261Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:47.2317336Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.2317993Z 2025-05-07T20:03:47.2431761Z GLIBCXX_3.4 2025-05-07T20:03:47.2432443Z GLIBCXX_3.4.9 2025-05-07T20:03:47.2433047Z GLIBCXX_3.4.11 2025-05-07T20:03:47.2433661Z GLIBCXX_3.4.15 2025-05-07T20:03:47.2434230Z GLIBCXX_3.4.18 2025-05-07T20:03:47.2434828Z GLIBCXX_3.4.20 2025-05-07T20:03:47.2435405Z GLIBCXX_3.4.21 2025-05-07T20:03:47.2435785Z 2025-05-07T20:03:47.2435799Z 2025-05-07T20:03:47.2454626Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.B3RfqAm612.symbols.txt 2025-05-07T20:03:47.2456086Z 2025-05-07T20:03:47.2533479Z 2025-05-07T20:03:47.2559124Z [CHECK] Total Number of symbols: 1448 2025-05-07T20:03:47.2577725Z [CHECK] Number of fbgemm symbols: 213 2025-05-07T20:03:47.2595141Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.ylbr7ntNUc.usymbols.txt 2025-05-07T20:03:47.2596630Z 2025-05-07T20:03:47.2617483Z 2025-05-07T20:03:47.2641884Z [CHECK] Listing out undefined symbols (277 total): 2025-05-07T20:03:47.2656842Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2659368Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2660933Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:47.2661966Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.2663109Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.2664192Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.2664582Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:47.2664992Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:47.2665373Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:47.2665885Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.2666281Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:47.2666615Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:47.2666965Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:47.2667285Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:47.2667630Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:47.2667970Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:47.2668292Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:47.2668634Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:47.2669814Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:47.2670194Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:47.2670515Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:47.2670860Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:47.2671193Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:47.2671648Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:47.2672050Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:47.2672482Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:47.2672922Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:47.2673285Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:47.2673678Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:47.2674140Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:47.2674557Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:47.2674917Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:47.2675299Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:03:47.2675747Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:47.2676634Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.2677962Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.2678926Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:47.2679458Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:47.2679937Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:47.2680441Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:47.2681031Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:47.2681685Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:47.2682151Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:47.2682550Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:47.2683057Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:03:47.2683551Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:47.2684224Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:47.2684899Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:47.2685956Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:47.2686879Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:47.2687344Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:47.2688257Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.2689449Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.2690349Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:47.2690874Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:47.2691321Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:47.2691751Z U at::globalContext() 2025-05-07T20:03:47.2692118Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:03:47.2692500Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:47.2692877Z U bool at::Tensor::item() const 2025-05-07T20:03:47.2693230Z U c10::AnyType::get() 2025-05-07T20:03:47.2693597Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:47.2694102Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.2694525Z U c10::BoolType::get() 2025-05-07T20:03:47.2694909Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:47.2695376Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:47.2695784Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:47.2696542Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:47.2697792Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:47.2698975Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:47.2699594Z U c10::Error::what() const 2025-05-07T20:03:47.2699914Z U c10::GradMode::is_enabled() 2025-05-07T20:03:47.2700271Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:47.2700671Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.2701146Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:47.2701568Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:47.2701913Z U c10::IValue::isBoolList() const 2025-05-07T20:03:47.2702268Z U c10::IValue::isIntList() const 2025-05-07T20:03:47.2702602Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:47.2702967Z U c10::IValue::isTensorList() const 2025-05-07T20:03:47.2703359Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:47.2703723Z U c10::IntType::get() 2025-05-07T20:03:47.2704605Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.2705354Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:47.2705776Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:47.2706194Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.2706550Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.2707106Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:47.2707819Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:47.2708226Z U c10::StringType::get() 2025-05-07T20:03:47.2708587Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:47.2709286Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:47.2710042Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:47.2710415Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:47.2710793Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:47.2711127Z U c10::SymIntType::get() 2025-05-07T20:03:47.2711584Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:47.2712003Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:47.2712686Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:47.2713434Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:47.2713809Z U c10::TensorType::get() 2025-05-07T20:03:47.2714243Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:47.2714687Z U c10::Type::is_module() const 2025-05-07T20:03:47.2715032Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:47.2716014Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:47.2716990Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:47.2717390Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:47.2717770Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:47.2718129Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:47.2718502Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:47.2718862Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:47.2719430Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:47.2719940Z U c10::cuda::device_count() 2025-05-07T20:03:47.2720296Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:47.2720708Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:47.2721117Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:47.2721549Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:47.2721967Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:47.2722381Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:47.2723069Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.2724142Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:47.2725055Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:47.2725965Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.2727118Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:47.2728109Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.2729041Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:47.2729595Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:47.2730048Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:47.2730393Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:47.2730926Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:47.2731541Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:47.2731967Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:47.2732401Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:47.2732786Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:47.2733143Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.2733537Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:47.2734138Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:47.2734754Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:47.2735132Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:47.2735543Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:47.2735936Z U c10::throwNullDataPtrError() 2025-05-07T20:03:47.2736262Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:47.2736604Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:47.2736914Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:47.2737328Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:47.2737733Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:47.2738101Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:47.2738505Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.2738870Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:47.2739249Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:47.2739588Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:47.2739947Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:47.2740279Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:47.2740637Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.2741002Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:47.2741359Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:47.2741741Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:47.2742082Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:47.2742450Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:47.2742785Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:47.2743156Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.2743507Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:47.2743937Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:47.2744430Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.2744787Z U free@GLIBC_2.2.5 2025-05-07T20:03:47.2745272Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.2745611Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:47.2745929Z U long at::Tensor::item() const 2025-05-07T20:03:47.2746353Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:47.2746755Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.2747164Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.2747537Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:47.2747870Z U memcpy@GLIBC_2.14 2025-05-07T20:03:47.2748318Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:47.2748696Z U memset@GLIBC_2.2.5 2025-05-07T20:03:47.2749037Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:47.2749462Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:47.2749826Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:47.2750403Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:47.2751108Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:47.2752149Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:47.2753060Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2754158Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.2755234Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2756150Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2757155Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:47.2758290Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2759468Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2760494Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:47.2761338Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:47.2761946Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:47.2762314Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:47.2762687Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.2763110Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.2763554Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:47.2763985Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:47.2764396Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:47.2764919Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:47.2765646Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:47.2766709Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.2767909Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.2768720Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:47.2769136Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:47.2769495Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:47.2769875Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.2770004Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.2770126Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:47.2770267Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:47.2770466Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.2770711Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.2770868Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:47.2771115Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2771261Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:47.2771703Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:47.2771875Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:47.2771992Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:47.2772098Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:47.2772203Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:47.2772358Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:47.2772952Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:47.2773468Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.2773741Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.2773870Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:47.2774198Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:47.2774387Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:47.2774599Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:47.2774821Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:47.2775174Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:47.2775333Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:47.2775553Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:47.2775736Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:47.2775872Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:47.2776035Z U torch::autograd::Node::metadata() 2025-05-07T20:03:47.2776180Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:47.2776435Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:47.2776730Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:47.2776879Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:47.2777101Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:47.2777459Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:47.2780147Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:47.2780327Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:47.2780483Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:47.2780651Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:47.2780829Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:47.2781242Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:47.2781604Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:47.2782178Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:47.2782314Z U typeinfo for c10::Error 2025-05-07T20:03:47.2782449Z U typeinfo for c10::Type 2025-05-07T20:03:47.2782595Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:47.2782735Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:47.2782872Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:47.2783020Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:47.2783176Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:47.2783339Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:47.2783520Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:47.2783680Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:47.2783844Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:47.2783971Z U vtable for c10::Error 2025-05-07T20:03:47.2784080Z U vtable for c10::ListType 2025-05-07T20:03:47.2784432Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2784784Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2785147Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.2785286Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:47.2785510Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:47.2785742Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:47.2785875Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:47.2786015Z U vtable for torch::autograd::Node 2025-05-07T20:03:47.2786219Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.2786357Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:47.2786489Z w _ITM_registerTMCloneTable 2025-05-07T20:03:47.2786600Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:47.2786714Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:47.2786833Z w __gmon_start__ 2025-05-07T20:03:47.2786936Z w __pthread_key_create 2025-05-07T20:03:47.2787051Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:47.2787166Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:47.2787334Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:47.2787563Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:47.2787570Z 2025-05-07T20:03:47.2787677Z linux-vdso.so.1 (0x00007ffe73b95000) 2025-05-07T20:03:47.2787793Z libc10.so => not found 2025-05-07T20:03:47.2787893Z libc10_cuda.so => not found 2025-05-07T20:03:47.2788443Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f6dd0efd000) 2025-05-07T20:03:47.2788929Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f6dd0600000) 2025-05-07T20:03:47.2789027Z libtorch.so => not found 2025-05-07T20:03:47.2789128Z libtorch_cpu.so => not found 2025-05-07T20:03:47.2789230Z libtorch_cuda.so => not found 2025-05-07T20:03:47.2789350Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.2789518Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f6dd039c000) 2025-05-07T20:03:47.2789649Z libm.so.6 => /lib64/libm.so.6 (0x00007f6dd02c1000) 2025-05-07T20:03:47.2789820Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f6dd234a000) 2025-05-07T20:03:47.2789975Z libc.so.6 => /lib64/libc.so.6 (0x00007f6dd00b9000) 2025-05-07T20:03:47.2790108Z /lib64/ld-linux-x86-64.so.2 (0x00007f6dd237e000) 2025-05-07T20:03:47.2790226Z libc10.so => not found 2025-05-07T20:03:47.2790326Z libc10_cuda.so => not found 2025-05-07T20:03:47.2790425Z libtorch.so => not found 2025-05-07T20:03:47.2790682Z libtorch_cpu.so => not found 2025-05-07T20:03:47.2790811Z libtorch_cuda.so => not found 2025-05-07T20:03:47.2791082Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.2791181Z libtorch.so => not found 2025-05-07T20:03:47.2791353Z libc10.so => not found 2025-05-07T20:03:47.2791590Z libc10_cuda.so => not found 2025-05-07T20:03:47.2791694Z libtorch_cpu.so => not found 2025-05-07T20:03:47.2791801Z libtorch_cuda.so => not found 2025-05-07T20:03:47.2791929Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.2791934Z 2025-05-07T20:03:47.2792048Z [CHECK] Displaying ELF information: 2025-05-07T20:03:47.2792310Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:47.2792318Z 2025-05-07T20:03:47.2792322Z 2025-05-07T20:03:47.2792518Z Dynamic section at offset 0x11a7a20 contains 41 entries: 2025-05-07T20:03:47.2792643Z Tag Type Name/Value 2025-05-07T20:03:47.2792842Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:47.2793073Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:47.2793387Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:47.2793614Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:47.2793816Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:47.2794017Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:47.2794216Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:47.2794422Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:47.2794670Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:47.2794889Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:47.2795081Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:47.2795278Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:47.2795486Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:47.2795718Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:03:47.2795908Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:47.2796020Z 0x000000000000000c (INIT) 0x4e000 2025-05-07T20:03:47.2796133Z 0x000000000000000d (FINI) 0x147b8c 2025-05-07T20:03:47.2796251Z 0x0000000000000019 (INIT_ARRAY) 0x11a6ca8 2025-05-07T20:03:47.2796391Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:03:47.2796512Z 0x000000000000001a (FINI_ARRAY) 0x11a6d38 2025-05-07T20:03:47.2796631Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:47.2796753Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:47.2796870Z 0x000000006ffffef5 (GNU_HASH) 0x2890 2025-05-07T20:03:47.2796980Z 0x0000000000000005 (STRTAB) 0xd658 2025-05-07T20:03:47.2797101Z 0x0000000000000006 (SYMTAB) 0x4e80 2025-05-07T20:03:47.2797233Z 0x000000000000000a (STRSZ) 223344 (bytes) 2025-05-07T20:03:47.2797349Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:47.2797466Z 0x0000000000000003 (PLTGOT) 0x11a7cf0 2025-05-07T20:03:47.2797611Z 0x0000000000000002 (PLTRELSZ) 11832 (bytes) 2025-05-07T20:03:47.2797721Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:47.2797833Z 0x0000000000000017 (JMPREL) 0x4ac08 2025-05-07T20:03:47.2797988Z 0x0000000000000007 (RELA) 0x44b90 2025-05-07T20:03:47.2798120Z 0x0000000000000008 (RELASZ) 24696 (bytes) 2025-05-07T20:03:47.2798239Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:47.2798349Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:47.2798473Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:47.2798591Z 0x000000006ffffffe (VERNEED) 0x44a20 2025-05-07T20:03:47.2798698Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:47.2798831Z 0x000000006ffffff0 (VERSYM) 0x43ec8 2025-05-07T20:03:47.2798938Z 0x000000006ffffff9 (RELACOUNT) 29 2025-05-07T20:03:47.2799038Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:47.2799043Z 2025-05-07T20:03:47.2799178Z ################################################################################ 2025-05-07T20:03:47.2799182Z 2025-05-07T20:03:47.2799186Z 2025-05-07T20:03:47.2799304Z ################################################################################ 2025-05-07T20:03:47.2799614Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:47.2799736Z [CHECK] Listing out library size: 2025-05-07T20:03:47.2800032Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:47.2800037Z 2025-05-07T20:03:47.2800291Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:47.2800296Z 2025-05-07T20:03:47.2800731Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:47.2801255Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.2801260Z 2025-05-07T20:03:47.2822919Z GLIBC_2.2.5 2025-05-07T20:03:47.2823328Z GLIBC_2.3 2025-05-07T20:03:47.2823582Z GLIBC_2.14 2025-05-07T20:03:47.2823621Z 2025-05-07T20:03:47.2823828Z 2025-05-07T20:03:47.2825171Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:47.2826918Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.2826934Z 2025-05-07T20:03:47.2875528Z GLIBCXX_3.4 2025-05-07T20:03:47.2875800Z GLIBCXX_3.4.9 2025-05-07T20:03:47.2876029Z GLIBCXX_3.4.18 2025-05-07T20:03:47.2876250Z GLIBCXX_3.4.20 2025-05-07T20:03:47.2876471Z GLIBCXX_3.4.21 2025-05-07T20:03:47.2876829Z 2025-05-07T20:03:47.2876960Z 2025-05-07T20:03:47.2897990Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.H3xnJU0KR6.symbols.txt 2025-05-07T20:03:47.2898021Z 2025-05-07T20:03:47.2919410Z 2025-05-07T20:03:47.2947696Z [CHECK] Total Number of symbols: 345 2025-05-07T20:03:47.2959077Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:03:47.2977964Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.sxABAnOHKb.usymbols.txt 2025-05-07T20:03:47.2978007Z 2025-05-07T20:03:47.2993786Z 2025-05-07T20:03:47.3019113Z [CHECK] Listing out undefined symbols (128 total): 2025-05-07T20:03:47.3034283Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.3035365Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.3035657Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:47.3036099Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.3036525Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.3036901Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.3037552Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:47.3037956Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:47.3038298Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:47.3038688Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.3038997Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:47.3039302Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:47.3039595Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:47.3039902Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:47.3040206Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:47.3040500Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:47.3040838Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:47.3040960Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:47.3041071Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:47.3041292Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:47.3041883Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.3042570Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.3042750Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:47.3042891Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:47.3042991Z U c10::IntType::get() 2025-05-07T20:03:47.3043159Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:47.3043298Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:47.3043514Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.3044090Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:47.3044244Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:47.3044352Z U c10::TensorType::get() 2025-05-07T20:03:47.3044472Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:47.3045162Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:47.3045299Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:47.3045421Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:47.3045571Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:47.3045686Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:47.3045805Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:47.3045941Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:47.3046181Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:47.3046294Z U c10::cuda::device_count() 2025-05-07T20:03:47.3046449Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:47.3046585Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:47.3046724Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:47.3046884Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:47.3047044Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:47.3047187Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:47.3047697Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:47.3047944Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:47.3048415Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.3048759Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:47.3048878Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:47.3048989Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:47.3049136Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:47.3049280Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:47.3049419Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:47.3049552Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:47.3049737Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:47.3049895Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.3050048Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:47.3050164Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:47.3050287Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:47.3050417Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:47.3050533Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:47.3050661Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.3050797Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:47.3050936Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:47.3051075Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:47.3051252Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:47.3051397Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:47.3051530Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.3051658Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:47.3051822Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.3051993Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:47.3052146Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.3052246Z U memcpy@GLIBC_2.14 2025-05-07T20:03:47.3052371Z U memset@GLIBC_2.2.5 2025-05-07T20:03:47.3052527Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:47.3052650Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:47.3053002Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:47.3053372Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:47.3053931Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.3054494Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.3054901Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:47.3055332Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.3055786Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:47.3056292Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.3056641Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:47.3057011Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:47.3057138Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:47.3057282Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:47.3057431Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.3057576Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.3057767Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:47.3058028Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:47.3058376Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:47.3058968Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.3059483Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.3059680Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:47.3059809Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:47.3059936Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.3060080Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.3060202Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:47.3060326Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:47.3060538Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.3060778Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.3060911Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:47.3061030Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:47.3061156Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:47.3061287Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:47.3061873Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:47.3062350Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.3062612Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.3063150Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:47.3063375Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.3063574Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:47.3063884Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:47.3064047Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:47.3064406Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.3064763Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.3065106Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.3065310Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:47.3065555Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:47.3065675Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:47.3065792Z w _ITM_registerTMCloneTable 2025-05-07T20:03:47.3065922Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:47.3066019Z w __gmon_start__ 2025-05-07T20:03:47.3066131Z w __pthread_key_create 2025-05-07T20:03:47.3066303Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:47.3066577Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:47.3066584Z 2025-05-07T20:03:47.3081486Z linux-vdso.so.1 (0x00007ffdeadb6000) 2025-05-07T20:03:47.3081662Z libtorch.so => not found 2025-05-07T20:03:47.3081780Z libc10.so => not found 2025-05-07T20:03:47.3081883Z libc10_cuda.so => not found 2025-05-07T20:03:47.3081990Z libtorch_cpu.so => not found 2025-05-07T20:03:47.3082349Z libtorch_cuda.so => not found 2025-05-07T20:03:47.3082469Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.3082661Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd5e7779000) 2025-05-07T20:03:47.3082945Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd5e774b000) 2025-05-07T20:03:47.3083119Z libc.so.6 => /lib64/libc.so.6 (0x00007fd5e7543000) 2025-05-07T20:03:47.3083269Z /lib64/ld-linux-x86-64.so.2 (0x00007fd5e7a30000) 2025-05-07T20:03:47.3083429Z libm.so.6 => /lib64/libm.so.6 (0x00007fd5e7468000) 2025-05-07T20:03:47.3086225Z 2025-05-07T20:03:47.3086419Z [CHECK] Displaying ELF information: 2025-05-07T20:03:47.3087939Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:47.3087968Z 2025-05-07T20:03:47.3116865Z 2025-05-07T20:03:47.3117138Z Dynamic section at offset 0x4b598 contains 37 entries: 2025-05-07T20:03:47.3117360Z Tag Type Name/Value 2025-05-07T20:03:47.3117608Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:47.3117818Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:47.3118970Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:47.3119667Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:47.3120087Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:47.3120311Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:47.3120551Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:47.3120761Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:47.3120962Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:47.3121221Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:47.3121495Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:03:47.3121742Z 0x000000000000000c (INIT) 0xe000 2025-05-07T20:03:47.3121901Z 0x000000000000000d (FINI) 0x2b16c 2025-05-07T20:03:47.3122028Z 0x0000000000000019 (INIT_ARRAY) 0x4b240 2025-05-07T20:03:47.3122166Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:03:47.3122287Z 0x000000000000001a (FINI_ARRAY) 0x4b268 2025-05-07T20:03:47.3122439Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:47.3122563Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:47.3122690Z 0x000000006ffffef5 (GNU_HASH) 0x1208 2025-05-07T20:03:47.3122949Z 0x0000000000000005 (STRTAB) 0x3da0 2025-05-07T20:03:47.3123058Z 0x0000000000000006 (SYMTAB) 0x1d30 2025-05-07T20:03:47.3123198Z 0x000000000000000a (STRSZ) 30770 (bytes) 2025-05-07T20:03:47.3123432Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:47.3123572Z 0x0000000000000003 (PLTGOT) 0x4b838 2025-05-07T20:03:47.3123712Z 0x0000000000000002 (PLTRELSZ) 4272 (bytes) 2025-05-07T20:03:47.3123830Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:47.3123966Z 0x0000000000000017 (JMPREL) 0xc9c8 2025-05-07T20:03:47.3124075Z 0x0000000000000007 (RELA) 0xb9a8 2025-05-07T20:03:47.3124200Z 0x0000000000000008 (RELASZ) 4128 (bytes) 2025-05-07T20:03:47.3124343Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:47.3124500Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:47.3124628Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:47.3124744Z 0x000000006ffffffe (VERNEED) 0xb888 2025-05-07T20:03:47.3124881Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:47.3124998Z 0x000000006ffffff0 (VERSYM) 0xb5d2 2025-05-07T20:03:47.3125111Z 0x000000006ffffff9 (RELACOUNT) 10 2025-05-07T20:03:47.3125246Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:47.3125266Z 2025-05-07T20:03:47.3125390Z ################################################################################ 2025-05-07T20:03:47.3125433Z 2025-05-07T20:03:47.3125437Z 2025-05-07T20:03:47.3127089Z ################################################################################ 2025-05-07T20:03:47.3127420Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:47.3127524Z [CHECK] Listing out library size: 2025-05-07T20:03:47.3127823Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:47.3127828Z 2025-05-07T20:03:47.3137412Z 497 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:47.3137493Z 2025-05-07T20:03:47.3138052Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:47.3138631Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.3138647Z 2025-05-07T20:03:47.4938727Z GLIBC_2.2.5 2025-05-07T20:03:47.4939047Z GLIBC_2.3 2025-05-07T20:03:47.4939342Z GLIBC_2.14 2025-05-07T20:03:47.4939362Z 2025-05-07T20:03:47.4939376Z 2025-05-07T20:03:47.4940725Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:47.4941732Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.4941737Z 2025-05-07T20:03:47.6741305Z GLIBCXX_3.4 2025-05-07T20:03:47.6742126Z GLIBCXX_3.4.9 2025-05-07T20:03:47.6742307Z GLIBCXX_3.4.11 2025-05-07T20:03:47.6742409Z GLIBCXX_3.4.14 2025-05-07T20:03:47.6742512Z GLIBCXX_3.4.15 2025-05-07T20:03:47.6742619Z GLIBCXX_3.4.18 2025-05-07T20:03:47.6742704Z GLIBCXX_3.4.20 2025-05-07T20:03:47.6742789Z GLIBCXX_3.4.21 2025-05-07T20:03:47.6742799Z 2025-05-07T20:03:47.6744544Z 2025-05-07T20:03:47.6759807Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.p7H3lfEOyO.symbols.txt 2025-05-07T20:03:47.6759833Z 2025-05-07T20:03:47.8546821Z 2025-05-07T20:03:47.8622373Z [CHECK] Total Number of symbols: 12207 2025-05-07T20:03:47.8708642Z [CHECK] Number of fbgemm symbols: 2031 2025-05-07T20:03:47.8723950Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.b07IZYab2p.usymbols.txt 2025-05-07T20:03:47.8725530Z 2025-05-07T20:03:47.8793048Z 2025-05-07T20:03:47.8820423Z [CHECK] Listing out undefined symbols (298 total): 2025-05-07T20:03:47.8833946Z U GOMP_parallel 2025-05-07T20:03:47.8835777Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.8838138Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.8839720Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:47.8840782Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.8841955Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.8842544Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.8843260Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:47.8843641Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:47.8844019Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:47.8844377Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.8844758Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:47.8845078Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:47.8845415Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:47.8845759Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:47.8846074Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:47.8846480Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:47.8846850Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:47.8847185Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:47.8847500Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:47.8847831Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:47.8848141Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:47.8848473Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:47.8848821Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:47.8849152Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:47.8849571Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:47.8849989Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:47.8850428Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:47.8850823Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:47.8851204Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:47.8851602Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:47.8852033Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:47.8852657Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:47.8853222Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:47.8854251Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.8855662Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.8856668Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:47.8857721Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.8858991Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:47.8859563Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:47.8859972Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:47.8860726Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.8861835Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.8862706Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:47.8863140Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:47.8863507Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:47.8864104Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:47.8864479Z U at::get_num_threads() 2025-05-07T20:03:47.8864828Z U at::get_thread_num() 2025-05-07T20:03:47.8865169Z U at::globalContext() 2025-05-07T20:03:47.8865484Z U at::in_parallel_region() 2025-05-07T20:03:47.8865840Z U at::init_num_threads() 2025-05-07T20:03:47.8866202Z U at::internal::set_thread_num(int) 2025-05-07T20:03:47.8866617Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:47.8867034Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:47.8867510Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:47.8867890Z U c10::AnyType::get() 2025-05-07T20:03:47.8868300Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.8868752Z U c10::BoolType::get() 2025-05-07T20:03:47.8869115Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:47.8869588Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:47.8870031Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:47.8870772Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:47.8872344Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:47.8873501Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:47.8874106Z U c10::Error::what() const 2025-05-07T20:03:47.8874459Z U c10::FloatType::get() 2025-05-07T20:03:47.8874798Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:47.8875181Z U c10::GradMode::is_enabled() 2025-05-07T20:03:47.8875519Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:47.8875975Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.8876469Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.8876948Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:47.8877387Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:47.8877749Z U c10::IValue::isBoolList() const 2025-05-07T20:03:47.8878229Z U c10::IValue::isIntList() const 2025-05-07T20:03:47.8878555Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:47.8878915Z U c10::IValue::isTensorList() const 2025-05-07T20:03:47.8879299Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:47.8879646Z U c10::IntType::get() 2025-05-07T20:03:47.8880027Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:47.8880426Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:47.8901489Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.8902127Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:47.8902625Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.8903282Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:47.8903842Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:47.8904407Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:47.8904977Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:47.8905383Z U c10::StringType::get() 2025-05-07T20:03:47.8905754Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:47.8906189Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:47.8906905Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:47.8907610Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:47.8908058Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:47.8908399Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:47.8908759Z U c10::SymIntType::get() 2025-05-07T20:03:47.8909132Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:47.8909564Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:47.8909995Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:47.8910379Z U c10::TensorType::get() 2025-05-07T20:03:47.8910750Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:47.8912007Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:47.8913035Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:47.8913462Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:47.8913837Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:47.8914230Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:47.8914588Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:47.8914981Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:47.8915496Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:47.8915978Z U c10::cuda::device_count() 2025-05-07T20:03:47.8916362Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:47.8916762Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:47.8917257Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:47.8917677Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:47.8918129Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:47.8918561Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:47.8919236Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:47.8920338Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:47.8921274Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:47.8922163Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.8923161Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:47.8924388Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.8925146Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:47.8925480Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:47.8925996Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:47.8926602Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:47.8927042Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:47.8927447Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:47.8927873Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:47.8928233Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.8928615Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:47.8929246Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:47.8929840Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:47.8930233Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:47.8930614Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:47.8931025Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:47.8931452Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:47.8931818Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:03:47.8932199Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:47.8932557Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:47.8932929Z U c10::throwNullDataPtrError() 2025-05-07T20:03:47.8933255Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:47.8933612Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:47.8934046Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:47.8934470Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:47.8934834Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:47.8935185Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.8935573Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:47.8935918Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:47.8936281Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:47.8936631Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:47.8936946Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:47.8937269Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.8937606Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:47.8937963Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:47.8938301Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:47.8938645Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:47.8938986Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:47.8939296Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:47.8939656Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.8939998Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:47.8940933Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:47.8942074Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:03:47.8942633Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:47.8943030Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:47.8943431Z U float at::Tensor::item() const 2025-05-07T20:03:47.8943799Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.8944196Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.8944543Z U free@GLIBC_2.2.5 2025-05-07T20:03:47.8944860Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.8945221Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.8945674Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:47.8946104Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.8946491Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.8946835Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:47.8947113Z U memcpy@GLIBC_2.14 2025-05-07T20:03:47.8947389Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:47.8947662Z U memset@GLIBC_2.2.5 2025-05-07T20:03:47.8947957Z U omp_get_num_threads 2025-05-07T20:03:47.8948221Z U omp_get_thread_num 2025-05-07T20:03:47.8948560Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:47.8948917Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:47.8949457Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.8950170Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.8950846Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.8951874Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.8952640Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.8953418Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.8953972Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:47.8954660Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:03:47.8955656Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:03:47.8956298Z U sqrt@GLIBC_2.2.5 2025-05-07T20:03:47.8956592Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:47.8957035Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:47.8957701Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:47.8958571Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:47.8959560Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8960641Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.8961715Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8962636Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8963618Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:47.8964716Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8965693Z U std::__cxx11::basic_string, std::allocator >::append(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8966789Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8967795Z U std::__cxx11::basic_string, std::allocator >::~basic_string()@GLIBCXX_3.4.21 2025-05-07T20:03:47.8968561Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:47.8969395Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:47.8970031Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:47.8970624Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:47.8971016Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:47.8971372Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:47.8971757Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:47.8972144Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.8972556Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.8972987Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:47.8973415Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:47.8973810Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:47.8974309Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:47.8975044Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:47.8976094Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.8977295Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.8978051Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:47.8978427Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:47.8978790Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:47.8979146Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:47.8979496Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.8979856Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.8980210Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:47.8980554Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:47.8980962Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.8981528Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.8982023Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:47.8982436Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8982868Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:47.8983354Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:47.8984353Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:47.8985020Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:47.8985395Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:47.8985684Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:47.8985971Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:47.8986258Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:47.8987045Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:47.8988125Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.8988898Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.8989376Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:47.8989869Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:47.8990442Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:47.8991374Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:47.8991892Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:47.8992563Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:47.8993180Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:47.8993653Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:47.8994153Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:47.8994644Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:47.8995006Z U torch::autograd::Node::metadata() 2025-05-07T20:03:47.8995371Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:47.8995873Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:47.8996519Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:47.8997046Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:47.8997532Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:47.8998076Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:47.9001149Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:47.9004211Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:47.9004635Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:47.9005071Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:47.9005501Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:47.9006210Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:47.9007105Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:47.9008129Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:47.9008892Z U typeinfo for c10::Error 2025-05-07T20:03:47.9009231Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:47.9009620Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:47.9009972Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:47.9010351Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:47.9010724Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:47.9011990Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:47.9014166Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:47.9015515Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:47.9015906Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:47.9016359Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:47.9016777Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:47.9017126Z U vtable for c10::Error 2025-05-07T20:03:47.9017660Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9018396Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9019145Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9019705Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:47.9020119Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:47.9020646Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:47.9021090Z U vtable for torch::autograd::Node 2025-05-07T20:03:47.9021464Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:47.9021857Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:47.9022154Z w _ITM_registerTMCloneTable 2025-05-07T20:03:47.9022499Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:47.9022782Z w __gmon_start__ 2025-05-07T20:03:47.9023059Z w __pthread_key_create 2025-05-07T20:03:47.9023348Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:47.9023673Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:47.9024035Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:47.9024489Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:47.9024824Z 2025-05-07T20:03:47.9024943Z linux-vdso.so.1 (0x00007ffe76fd6000) 2025-05-07T20:03:47.9025242Z libc10.so => not found 2025-05-07T20:03:47.9025482Z libc10_cuda.so => not found 2025-05-07T20:03:47.9026105Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f6795000000) 2025-05-07T20:03:47.9027158Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f67b5a07000) 2025-05-07T20:03:47.9027867Z libtorch.so => not found 2025-05-07T20:03:47.9028342Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f6794a00000) 2025-05-07T20:03:47.9029206Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f6794000000) 2025-05-07T20:03:47.9029828Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9030105Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9030379Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9030696Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f6793d9c000) 2025-05-07T20:03:47.9031071Z libm.so.6 => /lib64/libm.so.6 (0x00007f6795325000) 2025-05-07T20:03:47.9031502Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f67b59d5000) 2025-05-07T20:03:47.9032067Z libc.so.6 => /lib64/libc.so.6 (0x00007f6793b94000) 2025-05-07T20:03:47.9032521Z /lib64/ld-linux-x86-64.so.2 (0x00007f67b5b0e000) 2025-05-07T20:03:47.9032866Z libc10.so => not found 2025-05-07T20:03:47.9033108Z libc10_cuda.so => not found 2025-05-07T20:03:47.9033748Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f67b59ca000) 2025-05-07T20:03:47.9034412Z libtorch.so => not found 2025-05-07T20:03:47.9034670Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9034960Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9035234Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9035519Z libc10.so => not found 2025-05-07T20:03:47.9035796Z libc10_cuda.so => not found 2025-05-07T20:03:47.9036086Z libtorch.so => not found 2025-05-07T20:03:47.9036338Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9036614Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9036892Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9037162Z libc10.so => not found 2025-05-07T20:03:47.9037689Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f67952aa000) 2025-05-07T20:03:47.9038255Z libtorch.so => not found 2025-05-07T20:03:47.9038526Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9038798Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9039088Z libtorch.so => not found 2025-05-07T20:03:47.9039331Z libc10.so => not found 2025-05-07T20:03:47.9039583Z libc10_cuda.so => not found 2025-05-07T20:03:47.9039851Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9040142Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9040430Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9040700Z libtorch.so => not found 2025-05-07T20:03:47.9040956Z libc10.so => not found 2025-05-07T20:03:47.9041227Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9041487Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9041775Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9042039Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9042312Z libtorch.so => not found 2025-05-07T20:03:47.9042468Z 2025-05-07T20:03:47.9042627Z [CHECK] Displaying ELF information: 2025-05-07T20:03:47.9043084Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:47.9043464Z 2025-05-07T20:03:47.9043486Z 2025-05-07T20:03:47.9043647Z Dynamic section at offset 0x1effce18 contains 43 entries: 2025-05-07T20:03:47.9044139Z Tag Type Name/Value 2025-05-07T20:03:47.9044540Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:47.9045015Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:47.9045529Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:47.9046160Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:47.9046680Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:47.9047156Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:47.9047639Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:47.9048158Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:47.9048650Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:47.9049132Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:47.9049632Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:47.9050094Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:47.9050567Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:47.9051025Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:47.9051508Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:47.9052073Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:47.9052585Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:47.9052977Z 0x000000000000000c (INIT) 0x573000 2025-05-07T20:03:47.9053302Z 0x000000000000000d (FINI) 0x31fa10c 2025-05-07T20:03:47.9053644Z 0x0000000000000019 (INIT_ARRAY) 0x1effb908 2025-05-07T20:03:47.9053983Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:03:47.9054332Z 0x000000000000001a (FINI_ARRAY) 0x1effc028 2025-05-07T20:03:47.9054707Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:47.9055025Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:47.9055348Z 0x000000006ffffef5 (GNU_HASH) 0xf0b8 2025-05-07T20:03:47.9055655Z 0x0000000000000005 (STRTAB) 0x67300 2025-05-07T20:03:47.9055974Z 0x0000000000000006 (SYMTAB) 0x1fa80 2025-05-07T20:03:47.9056306Z 0x000000000000000a (STRSZ) 4903735 (bytes) 2025-05-07T20:03:47.9056665Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:47.9056993Z 0x0000000000000003 (PLTGOT) 0x1effe108 2025-05-07T20:03:47.9057352Z 0x0000000000000002 (PLTRELSZ) 49656 (bytes) 2025-05-07T20:03:47.9057698Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:47.9058000Z 0x0000000000000017 (JMPREL) 0x566b38 2025-05-07T20:03:47.9058327Z 0x0000000000000007 (RELA) 0x51a728 2025-05-07T20:03:47.9058650Z 0x0000000000000008 (RELASZ) 312336 (bytes) 2025-05-07T20:03:47.9059005Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:47.9059312Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:47.9059633Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:47.9059962Z 0x000000006ffffffe (VERNEED) 0x51a598 2025-05-07T20:03:47.9060286Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:47.9060743Z 0x000000006ffffff0 (VERSYM) 0x514638 2025-05-07T20:03:47.9061059Z 0x000000006ffffff9 (RELACOUNT) 557 2025-05-07T20:03:47.9061368Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:47.9061559Z 2025-05-07T20:03:47.9061667Z ################################################################################ 2025-05-07T20:03:47.9061892Z 2025-05-07T20:03:47.9061896Z 2025-05-07T20:03:47.9062005Z ################################################################################ 2025-05-07T20:03:47.9062535Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:47.9063033Z [CHECK] Listing out library size: 2025-05-07T20:03:47.9063534Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:47.9063946Z 2025-05-07T20:03:47.9064193Z 76 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:47.9064549Z 2025-05-07T20:03:47.9064959Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:47.9065971Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.9066566Z 2025-05-07T20:03:47.9237104Z GLIBC_2.2.5 2025-05-07T20:03:47.9237762Z GLIBC_2.3 2025-05-07T20:03:47.9238349Z GLIBC_2.14 2025-05-07T20:03:47.9238501Z 2025-05-07T20:03:47.9238505Z 2025-05-07T20:03:47.9238985Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:47.9240147Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:47.9240821Z 2025-05-07T20:03:47.9507699Z GLIBCXX_3.4 2025-05-07T20:03:47.9508376Z GLIBCXX_3.4.9 2025-05-07T20:03:47.9509003Z GLIBCXX_3.4.11 2025-05-07T20:03:47.9509587Z GLIBCXX_3.4.18 2025-05-07T20:03:47.9510181Z GLIBCXX_3.4.20 2025-05-07T20:03:47.9510745Z GLIBCXX_3.4.21 2025-05-07T20:03:47.9511098Z 2025-05-07T20:03:47.9511138Z 2025-05-07T20:03:47.9529392Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.crDv50RiKD.symbols.txt 2025-05-07T20:03:47.9529960Z 2025-05-07T20:03:47.9763145Z 2025-05-07T20:03:47.9792382Z [CHECK] Total Number of symbols: 1597 2025-05-07T20:03:47.9812781Z [CHECK] Number of fbgemm symbols: 228 2025-05-07T20:03:47.9829670Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.RTUf3Lusb4.usymbols.txt 2025-05-07T20:03:47.9830824Z 2025-05-07T20:03:47.9852645Z 2025-05-07T20:03:47.9877262Z [CHECK] Listing out undefined symbols (184 total): 2025-05-07T20:03:47.9894174Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9896577Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9898300Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:47.9899346Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.9900529Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:47.9901685Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.9902206Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:47.9902584Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:47.9902952Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:47.9903354Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:47.9903673Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:47.9903975Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:47.9904426Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:47.9904725Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:47.9905028Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:47.9905329Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:47.9905639Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:47.9905941Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:47.9906314Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:47.9906709Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:47.9907169Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:47.9907650Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:47.9908438Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.9909688Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.9910590Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:47.9911150Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:47.9912323Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.9913501Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:47.9914348Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:47.9914755Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:47.9915085Z U at::globalContext() 2025-05-07T20:03:47.9915487Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.9915904Z U c10::BoolType::get() 2025-05-07T20:03:47.9916258Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:47.9916671Z U c10::FloatType::get() 2025-05-07T20:03:47.9916980Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:47.9917424Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.9917895Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:47.9918266Z U c10::IntType::get() 2025-05-07T20:03:47.9918662Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:47.9919054Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:47.9919463Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:47.9919895Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:47.9920355Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:47.9921070Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:47.9921741Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:47.9922158Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:47.9922507Z U c10::SymIntType::get() 2025-05-07T20:03:47.9922904Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:47.9923388Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:47.9923774Z U c10::TensorType::get() 2025-05-07T20:03:47.9924257Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:47.9925153Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:47.9926087Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:47.9926469Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:47.9927901Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:47.9928299Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:47.9928619Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:47.9928929Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:47.9929372Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:47.9929802Z U c10::cuda::device_count() 2025-05-07T20:03:47.9930127Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:47.9930474Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:47.9930840Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:47.9931193Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:47.9931580Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:47.9931949Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:47.9932630Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:47.9933449Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:47.9934252Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.9935121Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:47.9936149Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.9936909Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:47.9937215Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:47.9937565Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:47.9937959Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:47.9938346Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:47.9938681Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:47.9939044Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:47.9939577Z U c10::throwNullDataPtrError() 2025-05-07T20:03:47.9939680Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:47.9939785Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:47.9939984Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:47.9940102Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:47.9940236Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:47.9940369Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.9940521Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:47.9940665Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:47.9940790Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:47.9940914Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:47.9941028Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:47.9941149Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.9941279Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:47.9941417Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:47.9941541Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:47.9941831Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:47.9942051Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:47.9942192Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:47.9942318Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:47.9942464Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:47.9944664Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:47.9944883Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:47.9945009Z U float at::Tensor::item() const 2025-05-07T20:03:47.9945149Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.9945331Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.9945455Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.9945596Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.9945786Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:47.9945924Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:47.9946070Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:47.9946180Z U memcpy@GLIBC_2.14 2025-05-07T20:03:47.9946299Z U memset@GLIBC_2.2.5 2025-05-07T20:03:47.9946450Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:47.9946576Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:47.9946932Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.9947243Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.9947573Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.9947904Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:47.9948245Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:47.9948636Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:47.9949045Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.9949602Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:47.9949996Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:47.9950398Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.9950852Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:47.9951501Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:47.9951880Z U std::__cxx11::basic_string, std::allocator >::append(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:47.9952217Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:47.9952587Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:47.9952704Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:47.9952832Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:47.9952972Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.9953117Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.9953305Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:47.9953438Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:47.9953682Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:47.9954051Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:47.9954632Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.9955169Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:47.9955311Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:47.9955443Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:47.9955570Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.9955710Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:47.9955822Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:47.9955940Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:47.9956136Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.9956369Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:47.9956491Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:47.9956612Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:47.9956713Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:47.9956838Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:47.9957444Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:47.9957954Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.9958219Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:47.9958595Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:47.9959148Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:47.9960732Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:47.9962186Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:47.9963567Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:47.9964980Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:47.9966259Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:47.9967510Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:47.9969328Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:47.9971192Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:47.9972956Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:47.9974789Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:47.9976543Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:47.9978536Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:47.9980474Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:47.9980637Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:47.9980808Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:47.9980965Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:47.9981314Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9981648Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9981990Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:47.9982205Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:47.9982427Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:47.9982567Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:47.9982675Z w _ITM_registerTMCloneTable 2025-05-07T20:03:47.9982791Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:47.9982879Z w __gmon_start__ 2025-05-07T20:03:47.9982973Z w __pthread_key_create 2025-05-07T20:03:47.9983094Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:47.9983204Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:47.9983356Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:47.9983620Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:47.9983652Z 2025-05-07T20:03:47.9983798Z linux-vdso.so.1 (0x00007fff237f5000) 2025-05-07T20:03:47.9983915Z libc10.so => not found 2025-05-07T20:03:47.9984026Z libc10_cuda.so => not found 2025-05-07T20:03:47.9984602Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fe4dcc00000) 2025-05-07T20:03:47.9984694Z libtorch.so => not found 2025-05-07T20:03:47.9984809Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9984903Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9984998Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9985167Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fe4dc99c000) 2025-05-07T20:03:47.9985313Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe5021e2000) 2025-05-07T20:03:47.9985437Z libc.so.6 => /lib64/libc.so.6 (0x00007fe4dc794000) 2025-05-07T20:03:47.9985563Z /lib64/ld-linux-x86-64.so.2 (0x00007fe502216000) 2025-05-07T20:03:47.9985654Z libc10.so => not found 2025-05-07T20:03:47.9985747Z libc10_cuda.so => not found 2025-05-07T20:03:47.9986219Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fe4dc400000) 2025-05-07T20:03:47.9986784Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fe5020dd000) 2025-05-07T20:03:47.9986876Z libtorch.so => not found 2025-05-07T20:03:47.9987229Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fe4dbe00000) 2025-05-07T20:03:47.9987696Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fe4db400000) 2025-05-07T20:03:47.9987793Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9987893Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9988044Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9988185Z libm.so.6 => /lib64/libm.so.6 (0x00007fe4dc6b9000) 2025-05-07T20:03:47.9988292Z libc10.so => not found 2025-05-07T20:03:47.9988397Z libc10_cuda.so => not found 2025-05-07T20:03:47.9988870Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fe5020ce000) 2025-05-07T20:03:47.9988977Z libtorch.so => not found 2025-05-07T20:03:47.9989086Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9989216Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9989323Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9989423Z libc10.so => not found 2025-05-07T20:03:47.9989526Z libc10_cuda.so => not found 2025-05-07T20:03:47.9989653Z libtorch.so => not found 2025-05-07T20:03:47.9989758Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9989868Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9990008Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9990106Z libc10.so => not found 2025-05-07T20:03:47.9990794Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fe4dc63e000) 2025-05-07T20:03:47.9990911Z libtorch.so => not found 2025-05-07T20:03:47.9991053Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9991166Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9991340Z libtorch.so => not found 2025-05-07T20:03:47.9991482Z libc10.so => not found 2025-05-07T20:03:47.9991668Z libc10_cuda.so => not found 2025-05-07T20:03:47.9991781Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9991900Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9992046Z libcudart.so.11.0 => not found 2025-05-07T20:03:47.9992154Z libtorch.so => not found 2025-05-07T20:03:47.9992257Z libc10.so => not found 2025-05-07T20:03:47.9992402Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9992513Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9992624Z libtorch_cpu.so => not found 2025-05-07T20:03:47.9992733Z libtorch_cuda.so => not found 2025-05-07T20:03:47.9992870Z libtorch.so => not found 2025-05-07T20:03:47.9992909Z 2025-05-07T20:03:47.9993028Z [CHECK] Displaying ELF information: 2025-05-07T20:03:47.9993360Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:47.9993365Z 2025-05-07T20:03:47.9993370Z 2025-05-07T20:03:47.9993568Z Dynamic section at offset 0x4be4eb0 contains 39 entries: 2025-05-07T20:03:47.9993701Z Tag Type Name/Value 2025-05-07T20:03:47.9993907Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:47.9994144Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:47.9994415Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:47.9994624Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:47.9994864Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:47.9995078Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:47.9995296Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:47.9995537Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:47.9995742Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:47.9995945Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:47.9996169Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:47.9996477Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:03:47.9996670Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:47.9996797Z 0x000000000000000c (INIT) 0xac000 2025-05-07T20:03:47.9996952Z 0x000000000000000d (FINI) 0x622c9c 2025-05-07T20:03:47.9997121Z 0x0000000000000019 (INIT_ARRAY) 0x4be5a40 2025-05-07T20:03:47.9997261Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:03:47.9997426Z 0x000000000000001a (FINI_ARRAY) 0x4be5b08 2025-05-07T20:03:47.9997558Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:47.9997677Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:47.9997806Z 0x000000006ffffef5 (GNU_HASH) 0x2ad0 2025-05-07T20:03:47.9997950Z 0x0000000000000005 (STRTAB) 0xea78 2025-05-07T20:03:47.9998068Z 0x0000000000000006 (SYMTAB) 0x54a8 2025-05-07T20:03:47.9998216Z 0x000000000000000a (STRSZ) 591335 (bytes) 2025-05-07T20:03:47.9998367Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:47.9998496Z 0x0000000000000003 (PLTGOT) 0x4be6160 2025-05-07T20:03:47.9998640Z 0x0000000000000002 (PLTRELSZ) 11232 (bytes) 2025-05-07T20:03:47.9998788Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:47.9998915Z 0x0000000000000017 (JMPREL) 0xa86d8 2025-05-07T20:03:47.9999033Z 0x0000000000000007 (RELA) 0x9fe10 2025-05-07T20:03:47.9999178Z 0x0000000000000008 (RELASZ) 35016 (bytes) 2025-05-07T20:03:47.9999332Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:47.9999444Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:47.9999602Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:47.9999756Z 0x000000006ffffffe (VERNEED) 0x9fce0 2025-05-07T20:03:47.9999873Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:47.9999997Z 0x000000006ffffff0 (VERSYM) 0x9f060 2025-05-07T20:03:48.0000119Z 0x000000006ffffff9 (RELACOUNT) 52 2025-05-07T20:03:48.0000253Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:48.0000258Z 2025-05-07T20:03:48.0000390Z ################################################################################ 2025-05-07T20:03:48.0000395Z 2025-05-07T20:03:48.0000400Z 2025-05-07T20:03:48.0000550Z ################################################################################ 2025-05-07T20:03:48.0000941Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:48.0001056Z [CHECK] Listing out library size: 2025-05-07T20:03:48.0001380Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:48.0001408Z 2025-05-07T20:03:48.0001684Z 176 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:48.0001688Z 2025-05-07T20:03:48.0002137Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:48.0002712Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:48.0002716Z 2025-05-07T20:03:48.0577170Z GLIBC_2.2.5 2025-05-07T20:03:48.0577606Z GLIBC_2.3 2025-05-07T20:03:48.0578117Z GLIBC_2.14 2025-05-07T20:03:48.0578155Z 2025-05-07T20:03:48.0578160Z 2025-05-07T20:03:48.0578695Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:48.0579403Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:48.0579409Z 2025-05-07T20:03:48.1154026Z GLIBCXX_3.4 2025-05-07T20:03:48.1154288Z GLIBCXX_3.4.9 2025-05-07T20:03:48.1154582Z GLIBCXX_3.4.11 2025-05-07T20:03:48.1154815Z GLIBCXX_3.4.18 2025-05-07T20:03:48.1155044Z GLIBCXX_3.4.20 2025-05-07T20:03:48.1155277Z GLIBCXX_3.4.21 2025-05-07T20:03:48.1155294Z 2025-05-07T20:03:48.1155308Z 2025-05-07T20:03:48.1176578Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.fBgg8InHxF.symbols.txt 2025-05-07T20:03:48.1722505Z 2025-05-07T20:03:48.1722545Z 2025-05-07T20:03:48.1756007Z [CHECK] Total Number of symbols: 3621 2025-05-07T20:03:48.1792619Z [CHECK] Number of fbgemm symbols: 456 2025-05-07T20:03:48.1810119Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.Lp2W6bNEkB.usymbols.txt 2025-05-07T20:03:48.1810140Z 2025-05-07T20:03:48.1839511Z 2025-05-07T20:03:48.1867964Z [CHECK] Listing out undefined symbols (191 total): 2025-05-07T20:03:48.1890226Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.1891410Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.1892000Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:48.1892350Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:48.1892772Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:48.1893189Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:48.1893578Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:48.1893975Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:48.1894333Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:48.1894908Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:48.1895270Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:48.1895601Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:48.1895918Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:48.1896246Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:48.1896574Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:48.1896909Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:48.1897248Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:48.1897562Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:48.1898099Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:48.1898645Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:48.1899064Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:48.1899493Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:48.1899916Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:48.1900733Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.1901983Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.1902880Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:48.1903467Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:48.1904306Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.1905381Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.1906187Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:48.1906585Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:48.1906958Z U at::globalContext() 2025-05-07T20:03:48.1907361Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.1907770Z U c10::BoolType::get() 2025-05-07T20:03:48.1908128Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:48.1908466Z U c10::FloatType::get() 2025-05-07T20:03:48.1908797Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:48.1909194Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.1909591Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:48.1909928Z U c10::IntType::get() 2025-05-07T20:03:48.1910266Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:48.1910652Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:48.1911008Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:48.1911518Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:48.1912112Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:48.1912577Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:48.1913042Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:48.1913746Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:48.1914433Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:48.1914842Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:48.1915208Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:48.1915577Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:48.1915946Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:48.1916336Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:48.1916821Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:48.1917145Z U c10::SymIntType::get() 2025-05-07T20:03:48.1917542Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:48.1917963Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:48.1918358Z U c10::TensorType::get() 2025-05-07T20:03:48.1918682Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:48.1919652Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:48.1920668Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:48.1921058Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:48.1921403Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:48.1921769Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:48.1922133Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:48.1922484Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:48.1922975Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:48.1923452Z U c10::cuda::device_count() 2025-05-07T20:03:48.1923803Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:48.1924325Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:48.1924692Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:48.1925072Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:48.1925480Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:48.1925864Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:48.1926564Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:48.1927362Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:48.1928165Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.1929031Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:48.1929972Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.1930736Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:48.1931049Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:48.1931398Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:48.1931806Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:48.1932253Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:48.1932592Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:48.1932914Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:48.1933271Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:48.1933631Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:48.1933997Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:48.1934355Z U c10::throwNullDataPtrError() 2025-05-07T20:03:48.1934652Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:48.1934992Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:48.1935393Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:48.1935802Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:48.1936148Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:48.1936492Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.1936853Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:48.1937188Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:48.1937522Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:48.1937841Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:48.1938162Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:48.1938475Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.1938821Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:48.1939168Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:48.1939506Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:48.1939827Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:48.1940318Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:48.1940685Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:48.1941021Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.1941375Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:48.1944011Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:48.1946502Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:48.1946939Z U float at::Tensor::item() const 2025-05-07T20:03:48.1947291Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.1947700Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.1948088Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.1948468Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.1948884Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:48.1949308Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.1949706Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.1950058Z U memcpy@GLIBC_2.14 2025-05-07T20:03:48.1950345Z U memset@GLIBC_2.2.5 2025-05-07T20:03:48.1950676Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:48.1951097Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:48.1951770Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.1952601Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.1953348Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.1954099Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.1954914Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.1955717Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.1956494Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:48.1957352Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:48.1958262Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.1959316Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.1960370Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:48.1961304Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.1962263Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:48.1963358Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.1964461Z U std::__cxx11::basic_string, std::allocator >::append(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:48.1965184Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:48.1965966Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:48.1966710Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:48.1967026Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:48.1967383Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.1967748Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.1968159Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:48.1968555Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:48.1969028Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:48.1969691Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:48.1970645Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.1971804Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.1972514Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:48.1972842Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:48.1973183Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:48.1973502Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:48.1973826Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:48.1974181Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:48.1975814Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.1976327Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.1976768Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:48.1977101Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:48.1977404Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:48.1977703Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:48.1978477Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:48.1979549Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:48.1980338Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:48.1981041Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:48.1981992Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:48.1984482Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:48.1988773Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:48.1993007Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:48.1997000Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:48.2000893Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:48.2005149Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:48.2008856Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:48.2010833Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:48.2011302Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:48.2011761Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:48.2012421Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2013284Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2014102Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2014804Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:48.2015362Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:48.2015833Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:48.2016171Z w _ITM_registerTMCloneTable 2025-05-07T20:03:48.2016631Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:48.2016937Z w __gmon_start__ 2025-05-07T20:03:48.2017229Z w __pthread_key_create 2025-05-07T20:03:48.2017569Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:48.2017908Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:48.2018311Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:48.2018829Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:48.2019220Z 2025-05-07T20:03:48.2019370Z linux-vdso.so.1 (0x00007ffd4cb5f000) 2025-05-07T20:03:48.2019689Z libc10.so => not found 2025-05-07T20:03:48.2019942Z libc10_cuda.so => not found 2025-05-07T20:03:48.2020721Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007ff226400000) 2025-05-07T20:03:48.2021474Z libtorch.so => not found 2025-05-07T20:03:48.2021765Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2022064Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2022333Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2022672Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff22619c000) 2025-05-07T20:03:48.2023071Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff2521f4000) 2025-05-07T20:03:48.2023448Z libc.so.6 => /lib64/libc.so.6 (0x00007ff225f94000) 2025-05-07T20:03:48.2023793Z /lib64/ld-linux-x86-64.so.2 (0x00007ff252228000) 2025-05-07T20:03:48.2024164Z libc10.so => not found 2025-05-07T20:03:48.2024437Z libc10_cuda.so => not found 2025-05-07T20:03:48.2025075Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007ff225c00000) 2025-05-07T20:03:48.2026185Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007ff225e91000) 2025-05-07T20:03:48.2026917Z libtorch.so => not found 2025-05-07T20:03:48.2027425Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007ff225600000) 2025-05-07T20:03:48.2028328Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007ff224c00000) 2025-05-07T20:03:48.2028995Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2029292Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2029569Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2029881Z libm.so.6 => /lib64/libm.so.6 (0x00007ff225525000) 2025-05-07T20:03:48.2030199Z libc10.so => not found 2025-05-07T20:03:48.2030465Z libc10_cuda.so => not found 2025-05-07T20:03:48.2031064Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007ff2521e3000) 2025-05-07T20:03:48.2031951Z libtorch.so => not found 2025-05-07T20:03:48.2032216Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2032521Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2032810Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2033080Z libc10.so => not found 2025-05-07T20:03:48.2033344Z libc10_cuda.so => not found 2025-05-07T20:03:48.2033615Z libtorch.so => not found 2025-05-07T20:03:48.2033886Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2034161Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2034435Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2034738Z libc10.so => not found 2025-05-07T20:03:48.2035254Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007ff252164000) 2025-05-07T20:03:48.2035821Z libtorch.so => not found 2025-05-07T20:03:48.2036094Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2036379Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2036649Z libtorch.so => not found 2025-05-07T20:03:48.2036907Z libc10.so => not found 2025-05-07T20:03:48.2037151Z libc10_cuda.so => not found 2025-05-07T20:03:48.2037428Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2037697Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2037992Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2038256Z libtorch.so => not found 2025-05-07T20:03:48.2038522Z libc10.so => not found 2025-05-07T20:03:48.2038769Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2039043Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2039333Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2039596Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2039876Z libtorch.so => not found 2025-05-07T20:03:48.2040034Z 2025-05-07T20:03:48.2040150Z [CHECK] Displaying ELF information: 2025-05-07T20:03:48.2040661Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:48.2041064Z 2025-05-07T20:03:48.2041068Z 2025-05-07T20:03:48.2041269Z Dynamic section at offset 0xafd93a8 contains 39 entries: 2025-05-07T20:03:48.2041667Z Tag Type Name/Value 2025-05-07T20:03:48.2042095Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:48.2042598Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:48.2043185Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:48.2043752Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:48.2044379Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:48.2044892Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:48.2045430Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:48.2045930Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:48.2046398Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:48.2046881Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:48.2047345Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:48.2047925Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:03:48.2048444Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:48.2048840Z 0x000000000000000c (INIT) 0x196000 2025-05-07T20:03:48.2049172Z 0x000000000000000d (FINI) 0xef6a5c 2025-05-07T20:03:48.2049485Z 0x0000000000000019 (INIT_ARRAY) 0xafd8638 2025-05-07T20:03:48.2049856Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:03:48.2050179Z 0x000000000000001a (FINI_ARRAY) 0xafd88e0 2025-05-07T20:03:48.2050516Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:48.2050823Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:48.2051144Z 0x000000006ffffef5 (GNU_HASH) 0x4ac8 2025-05-07T20:03:48.2051470Z 0x0000000000000005 (STRTAB) 0x1f3f8 2025-05-07T20:03:48.2051768Z 0x0000000000000006 (SYMTAB) 0xa068 2025-05-07T20:03:48.2052117Z 0x000000000000000a (STRSZ) 1414051 (bytes) 2025-05-07T20:03:48.2052455Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:48.2052799Z 0x0000000000000003 (PLTGOT) 0xafd9658 2025-05-07T20:03:48.2053141Z 0x0000000000000002 (PLTRELSZ) 17880 (bytes) 2025-05-07T20:03:48.2053518Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:48.2053836Z 0x0000000000000017 (JMPREL) 0x191540 2025-05-07T20:03:48.2054167Z 0x0000000000000007 (RELA) 0x17a518 2025-05-07T20:03:48.2054521Z 0x0000000000000008 (RELASZ) 94248 (bytes) 2025-05-07T20:03:48.2054860Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:48.2055192Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:48.2055509Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:48.2055859Z 0x000000006ffffffe (VERNEED) 0x17a3e8 2025-05-07T20:03:48.2056183Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:48.2056507Z 0x000000006ffffff0 (VERSYM) 0x17879c 2025-05-07T20:03:48.2056823Z 0x000000006ffffff9 (RELACOUNT) 156 2025-05-07T20:03:48.2057141Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:48.2057328Z 2025-05-07T20:03:48.2057461Z ################################################################################ 2025-05-07T20:03:48.2057675Z 2025-05-07T20:03:48.2057681Z 2025-05-07T20:03:48.2057795Z ################################################################################ 2025-05-07T20:03:48.2058326Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:48.2058832Z [CHECK] Listing out library size: 2025-05-07T20:03:48.2059362Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:48.2059759Z 2025-05-07T20:03:48.2060020Z 31 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:48.2060358Z 2025-05-07T20:03:48.2060777Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:48.2061804Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:48.2062409Z 2025-05-07T20:03:48.2155286Z GLIBC_2.2.5 2025-05-07T20:03:48.2155882Z GLIBC_2.3 2025-05-07T20:03:48.2156742Z GLIBC_2.14 2025-05-07T20:03:48.2157086Z 2025-05-07T20:03:48.2157099Z 2025-05-07T20:03:48.2158493Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:48.2161954Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:48.2162630Z 2025-05-07T20:03:48.2310787Z GLIBCXX_3.4 2025-05-07T20:03:48.2311106Z GLIBCXX_3.4.9 2025-05-07T20:03:48.2311651Z GLIBCXX_3.4.11 2025-05-07T20:03:48.2311862Z GLIBCXX_3.4.15 2025-05-07T20:03:48.2312081Z GLIBCXX_3.4.18 2025-05-07T20:03:48.2312357Z GLIBCXX_3.4.20 2025-05-07T20:03:48.2312577Z GLIBCXX_3.4.21 2025-05-07T20:03:48.2312711Z 2025-05-07T20:03:48.2312715Z 2025-05-07T20:03:48.2332661Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.VzHHtRN1zq.symbols.txt 2025-05-07T20:03:48.2334286Z 2025-05-07T20:03:48.2443430Z 2025-05-07T20:03:48.2470238Z [CHECK] Total Number of symbols: 1779 2025-05-07T20:03:48.2485973Z [CHECK] Number of fbgemm symbols: 94 2025-05-07T20:03:48.2505692Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.sdaI0OxUCF.usymbols.txt 2025-05-07T20:03:48.2507309Z 2025-05-07T20:03:48.2523328Z 2025-05-07T20:03:48.2549770Z [CHECK] Listing out undefined symbols (276 total): 2025-05-07T20:03:48.2563748Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2587628Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2589367Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:48.2589922Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:48.2590680Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:48.2591329Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:48.2591725Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:48.2592116Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:48.2592488Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:48.2592877Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:48.2593229Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:48.2593576Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:48.2593876Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:48.2594200Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:48.2594515Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:48.2594851Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:48.2595275Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:48.2595588Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:48.2595889Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:48.2596169Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:48.2596567Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:48.2596851Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:48.2597160Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:48.2597449Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:48.2597799Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:48.2598193Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:48.2598574Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:48.2598900Z U at::RecordFunction::end() 2025-05-07T20:03:48.2599210Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:48.2599629Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:48.2600088Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:48.2600530Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:48.2601337Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.2602573Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.2603457Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:48.2604165Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.2605243Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.2606015Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:48.2606376Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:48.2606722Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:48.2607066Z U at::globalContext() 2025-05-07T20:03:48.2607361Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:48.2607666Z U c10::AnyType::get() 2025-05-07T20:03:48.2608081Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.2608459Z U c10::BoolType::get() 2025-05-07T20:03:48.2608787Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:48.2609198Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:48.2609568Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:48.2610261Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:48.2611418Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:48.2612458Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:48.2613024Z U c10::Error::what() const 2025-05-07T20:03:48.2613489Z U c10::FloatType::get() 2025-05-07T20:03:48.2613852Z U c10::GradMode::is_enabled() 2025-05-07T20:03:48.2614164Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:48.2614546Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.2615185Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:48.2615583Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:48.2616006Z U c10::IValue::isBoolList() const 2025-05-07T20:03:48.2616328Z U c10::IValue::isIntList() const 2025-05-07T20:03:48.2616668Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:48.2616998Z U c10::IValue::isTensorList() const 2025-05-07T20:03:48.2617378Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:48.2617745Z U c10::IntType::get() 2025-05-07T20:03:48.2618110Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:48.2618579Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:48.2618924Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:48.2619274Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:48.2619722Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:48.2620337Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:48.2620885Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:48.2621242Z U c10::StringType::get() 2025-05-07T20:03:48.2621590Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:48.2621982Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:48.2622407Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:48.2622836Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:48.2623241Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:48.2623915Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:48.2624668Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:48.2625037Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:48.2625393Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:48.2625741Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:48.2626083Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:48.2626430Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:48.2626819Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:48.2627162Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:48.2627467Z U c10::SymIntType::get() 2025-05-07T20:03:48.2627824Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:48.2628189Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:48.2628572Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:48.2628929Z U c10::TensorType::get() 2025-05-07T20:03:48.2629362Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:48.2630239Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:48.2631116Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:48.2631526Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:48.2632043Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:48.2632394Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:48.2632757Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:48.2633140Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:48.2633614Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:48.2634070Z U c10::cuda::device_count() 2025-05-07T20:03:48.2634418Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:48.2634788Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:48.2635170Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:48.2635564Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:48.2635961Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:48.2636374Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:48.2637089Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:48.2638142Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:48.2639018Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:48.2639884Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.2640825Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:48.2641855Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.2642662Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:48.2643000Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:48.2643530Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:48.2644267Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:48.2644699Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:48.2645108Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:48.2645500Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:48.2645847Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:48.2646222Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:48.2646938Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:48.2647487Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:48.2647844Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:48.2648197Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:48.2648577Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:48.2648977Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:48.2649339Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:48.2649679Z U c10::throwNullDataPtrError() 2025-05-07T20:03:48.2650150Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:48.2650470Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:48.2650861Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:48.2651275Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:48.2651627Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:48.2652129Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.2652500Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:48.2652839Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:48.2653197Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:48.2653711Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:48.2654056Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:48.2654414Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.2654761Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:48.2655168Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:48.2655551Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:48.2655904Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:48.2656237Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:48.2656579Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:48.2656924Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.2657285Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:48.2659720Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:48.2662255Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:48.2662755Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.2663163Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.2663526Z U free@GLIBC_2.2.5 2025-05-07T20:03:48.2663836Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.2664329Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.2664732Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:48.2665150Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.2665592Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.2665947Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:48.2666228Z U memcpy@GLIBC_2.14 2025-05-07T20:03:48.2666490Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:48.2666776Z U memset@GLIBC_2.2.5 2025-05-07T20:03:48.2667097Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:48.2667474Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:48.2668006Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.2668921Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:48.2669470Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:48.2669870Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:48.2670546Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:48.2671459Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:48.2672386Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2673443Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.2674456Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2675366Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2676401Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:48.2677476Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2678431Z U std::__cxx11::basic_string, std::allocator >::append(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2679237Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2680257Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2681232Z U std::__cxx11::basic_string, std::allocator >::~basic_string()@GLIBCXX_3.4.21 2025-05-07T20:03:48.2682011Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:48.2682789Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:48.2683590Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:48.2684560Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:48.2684672Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:48.2684813Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:48.2684968Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.2685102Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.2685270Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:48.2685414Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:48.2685553Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:48.2685780Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:48.2686132Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:48.2686701Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.2687206Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.2687336Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:48.2687450Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:48.2687611Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:48.2687729Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:48.2687841Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:48.2687965Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:48.2688075Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:48.2688253Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.2688480Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.2688645Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:48.2688834Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2688960Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:48.2689396Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:48.2689534Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:48.2689638Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:48.2689737Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:48.2689835Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:48.2689951Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:48.2690701Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:48.2691336Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:48.2691592Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:48.2691732Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:48.2692032Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:48.2692217Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:48.2692432Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:48.2692618Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:48.2693043Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:48.2693197Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:48.2693383Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:48.2693565Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:48.2693697Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:48.2693809Z U torch::autograd::Node::metadata() 2025-05-07T20:03:48.2693946Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:48.2694201Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:48.2694464Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:48.2694604Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:48.2694831Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:48.2695049Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:48.2697767Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:48.2698012Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:48.2698173Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:48.2698338Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:48.2699130Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:48.2699295Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:48.2699706Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:48.2700071Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:48.2700644Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:48.2700748Z U typeinfo for c10::Error 2025-05-07T20:03:48.2700907Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:48.2701032Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:48.2701155Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:48.2701282Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:48.2701413Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:48.2702864Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:48.2704356Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:48.2705599Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:48.2706907Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:48.2708158Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:48.2709405Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:48.2709605Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:48.2709766Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:48.2709916Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:48.2710061Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:48.2710171Z U vtable for c10::Error 2025-05-07T20:03:48.2710494Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2710797Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2711124Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.2711308Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:48.2711510Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:48.2711896Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:48.2712011Z U vtable for torch::autograd::Node 2025-05-07T20:03:48.2712191Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:48.2712412Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:48.2712522Z w _ITM_registerTMCloneTable 2025-05-07T20:03:48.2712659Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:48.2712772Z w __gmon_start__ 2025-05-07T20:03:48.2712870Z w __pthread_key_create 2025-05-07T20:03:48.2712980Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:48.2713101Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:48.2713245Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:48.2713503Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:48.2713510Z 2025-05-07T20:03:48.2713678Z linux-vdso.so.1 (0x00007ffee9bf9000) 2025-05-07T20:03:48.2713769Z libc10.so => not found 2025-05-07T20:03:48.2713863Z libc10_cuda.so => not found 2025-05-07T20:03:48.2714451Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007ff53ee00000) 2025-05-07T20:03:48.2714543Z libtorch.so => not found 2025-05-07T20:03:48.2714643Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2714758Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2714854Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2715017Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff53eb9c000) 2025-05-07T20:03:48.2715164Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff5614c3000) 2025-05-07T20:03:48.2716539Z libc.so.6 => /lib64/libc.so.6 (0x00007ff53e994000) 2025-05-07T20:03:48.2716674Z /lib64/ld-linux-x86-64.so.2 (0x00007ff5614f7000) 2025-05-07T20:03:48.2716769Z libc10.so => not found 2025-05-07T20:03:48.2716873Z libc10_cuda.so => not found 2025-05-07T20:03:48.2717348Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007ff53e600000) 2025-05-07T20:03:48.2717886Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007ff53e891000) 2025-05-07T20:03:48.2717988Z libtorch.so => not found 2025-05-07T20:03:48.2718332Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007ff53e000000) 2025-05-07T20:03:48.2718846Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007ff53d600000) 2025-05-07T20:03:48.2718955Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2719050Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2719144Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2719274Z libm.so.6 => /lib64/libm.so.6 (0x00007ff53df25000) 2025-05-07T20:03:48.2719374Z libc10.so => not found 2025-05-07T20:03:48.2719460Z libc10_cuda.so => not found 2025-05-07T20:03:48.2719894Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007ff5614b2000) 2025-05-07T20:03:48.2720004Z libtorch.so => not found 2025-05-07T20:03:48.2720096Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2720194Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2720296Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2720395Z libc10.so => not found 2025-05-07T20:03:48.2720489Z libc10_cuda.so => not found 2025-05-07T20:03:48.2720581Z libtorch.so => not found 2025-05-07T20:03:48.2720687Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2720776Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2720871Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2720955Z libc10.so => not found 2025-05-07T20:03:48.2721313Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007ff561433000) 2025-05-07T20:03:48.2721403Z libtorch.so => not found 2025-05-07T20:03:48.2721493Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2721603Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2721688Z libtorch.so => not found 2025-05-07T20:03:48.2721773Z libc10.so => not found 2025-05-07T20:03:48.2721877Z libc10_cuda.so => not found 2025-05-07T20:03:48.2722007Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2722101Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2722192Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.2722297Z libtorch.so => not found 2025-05-07T20:03:48.2722381Z libc10.so => not found 2025-05-07T20:03:48.2722469Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2722563Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2722673Z libtorch_cpu.so => not found 2025-05-07T20:03:48.2722768Z libtorch_cuda.so => not found 2025-05-07T20:03:48.2722854Z libtorch.so => not found 2025-05-07T20:03:48.2722860Z 2025-05-07T20:03:48.2722982Z [CHECK] Displaying ELF information: 2025-05-07T20:03:48.2723275Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:48.2723280Z 2025-05-07T20:03:48.2723284Z 2025-05-07T20:03:48.2723449Z Dynamic section at offset 0x1e2beb0 contains 39 entries: 2025-05-07T20:03:48.2723573Z Tag Type Name/Value 2025-05-07T20:03:48.2723767Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:48.2724082Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:48.2724326Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:48.2724503Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:48.2724718Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:48.2724902Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:48.2725099Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:48.2725284Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:48.2725468Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:48.2725648Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:48.2725843Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:48.2726126Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:03:48.2726332Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:48.2726434Z 0x000000000000000c (INIT) 0x73000 2025-05-07T20:03:48.2726549Z 0x000000000000000d (FINI) 0x26732c 2025-05-07T20:03:48.2726672Z 0x0000000000000019 (INIT_ARRAY) 0x1e2b390 2025-05-07T20:03:48.2726785Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:03:48.2726899Z 0x000000000000001a (FINI_ARRAY) 0x1e2b448 2025-05-07T20:03:48.2727008Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:48.2727114Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:48.2727216Z 0x000000006ffffef5 (GNU_HASH) 0x2dd0 2025-05-07T20:03:48.2727315Z 0x0000000000000005 (STRTAB) 0x10020 2025-05-07T20:03:48.2727423Z 0x0000000000000006 (SYMTAB) 0x5940 2025-05-07T20:03:48.2727544Z 0x000000000000000a (STRSZ) 353303 (bytes) 2025-05-07T20:03:48.2727650Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:48.2727762Z 0x0000000000000003 (PLTGOT) 0x1e2c160 2025-05-07T20:03:48.2727894Z 0x0000000000000002 (PLTRELSZ) 13176 (bytes) 2025-05-07T20:03:48.2727993Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:48.2728093Z 0x0000000000000017 (JMPREL) 0x6efd8 2025-05-07T20:03:48.2728209Z 0x0000000000000007 (RELA) 0x67370 2025-05-07T20:03:48.2728325Z 0x0000000000000008 (RELASZ) 31848 (bytes) 2025-05-07T20:03:48.2728434Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:48.2728536Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:48.2728646Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:48.2728752Z 0x000000006ffffffe (VERNEED) 0x67220 2025-05-07T20:03:48.2728881Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:48.2728997Z 0x000000006ffffff0 (VERSYM) 0x66438 2025-05-07T20:03:48.2729097Z 0x000000006ffffff9 (RELACOUNT) 43 2025-05-07T20:03:48.2729188Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:48.2729193Z 2025-05-07T20:03:48.2729319Z ################################################################################ 2025-05-07T20:03:48.2729323Z 2025-05-07T20:03:48.2729327Z 2025-05-07T20:03:48.2729427Z ################################################################################ 2025-05-07T20:03:48.2729643Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:48.2729749Z [CHECK] Listing out library size: 2025-05-07T20:03:48.2729952Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:48.2729957Z 2025-05-07T20:03:48.2730109Z 40 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:48.2730114Z 2025-05-07T20:03:48.2730608Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:48.2731019Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:48.2731023Z 2025-05-07T20:03:48.3031549Z GLIBC_2.2.5 2025-05-07T20:03:48.3032205Z GLIBC_2.3 2025-05-07T20:03:48.3032311Z GLIBC_2.14 2025-05-07T20:03:48.3032354Z 2025-05-07T20:03:48.3032359Z 2025-05-07T20:03:48.3032827Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:48.3033318Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:48.3033324Z 2025-05-07T20:03:48.3391746Z GLIBCXX_3.4 2025-05-07T20:03:48.3392269Z GLIBCXX_3.4.9 2025-05-07T20:03:48.3392619Z GLIBCXX_3.4.11 2025-05-07T20:03:48.3393503Z GLIBCXX_3.4.14 2025-05-07T20:03:48.3394322Z GLIBCXX_3.4.15 2025-05-07T20:03:48.3394547Z GLIBCXX_3.4.18 2025-05-07T20:03:48.3394909Z GLIBCXX_3.4.19 2025-05-07T20:03:48.3395124Z GLIBCXX_3.4.20 2025-05-07T20:03:48.3395343Z GLIBCXX_3.4.21 2025-05-07T20:03:48.3395362Z 2025-05-07T20:03:48.3395374Z 2025-05-07T20:03:48.3411303Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.5Xk1beD8eK.symbols.txt 2025-05-07T20:03:48.3411511Z 2025-05-07T20:03:48.3712874Z 2025-05-07T20:03:48.3738690Z [CHECK] Total Number of symbols: 6310 2025-05-07T20:03:48.3772901Z [CHECK] Number of fbgemm symbols: 4411 2025-05-07T20:03:48.3795035Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.NzLiqiv7MO.usymbols.txt 2025-05-07T20:03:48.3795063Z 2025-05-07T20:03:48.3828857Z 2025-05-07T20:03:48.3854004Z [CHECK] Listing out undefined symbols (492 total): 2025-05-07T20:03:48.3874920Z U GOMP_parallel 2025-05-07T20:03:48.3875539Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.3876043Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.3876370Z U VTT for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:03:48.3876676Z U VTT for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:03:48.3876839Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:48.3876994Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:03:48.3877194Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:48.3877416Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:48.3877591Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:48.3878014Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:48.3878222Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:48.3878414Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:48.3878599Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:48.3878749Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:48.3878935Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:48.3879088Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:48.3879242Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:48.3879449Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:48.3879586Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:48.3879796Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:48.3879997Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:48.3880212Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:48.3880393Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:48.3880522Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:48.3880646Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:48.3880753Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:48.3880857Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:48.3881119Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:48.3881255Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:48.3881381Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:48.3881523Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:48.3881652Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:03:48.3881765Z U at::SplitUntil32Bit::end() const 2025-05-07T20:03:48.3882041Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:03:48.3882189Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:03:48.3882459Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:48.3882690Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:48.3882878Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:03:48.3883048Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:03:48.3883182Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:03:48.3883332Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:03:48.3883454Z U at::TensorIteratorBase::numel() const 2025-05-07T20:03:48.3883604Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:03:48.3883832Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:03:48.3884046Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:03:48.3884158Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:48.3884310Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:03:48.3884466Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:03:48.3884706Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:48.3884944Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:48.3885061Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:48.3885399Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:03:48.3885736Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:48.3885918Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:03:48.3886123Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:03:48.3886303Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:48.3886497Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:48.3886669Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:48.3886940Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:48.3887153Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:03:48.3887471Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:03:48.3887661Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:48.3888202Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3888848Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3889000Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:48.3889174Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:48.3889306Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:03:48.3889806Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3889986Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:48.3890308Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:03:48.3890729Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:48.3891064Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:03:48.3891259Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:48.3891454Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:48.3891623Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:48.3892204Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3892408Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:48.3892925Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3893119Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:48.3893439Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:48.3893611Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:48.3894076Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:48.3894487Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:48.3894635Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:03:48.3894882Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:48.3895031Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:03:48.3895263Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:48.3895469Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:03:48.3895731Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:48.3896039Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:48.3896692Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:48.3896870Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:03:48.3897169Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:03:48.3897447Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:48.3897613Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:48.3897767Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:48.3897889Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:48.3898331Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3898903Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3899209Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:03:48.3899333Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:03:48.3899454Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:48.3899598Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:48.3899728Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:03:48.3900044Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:03:48.3900184Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:48.3900328Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:48.3900425Z U at::get_num_threads() 2025-05-07T20:03:48.3900541Z U at::get_thread_num() 2025-05-07T20:03:48.3900637Z U at::in_parallel_region() 2025-05-07T20:03:48.3900727Z U at::init_num_threads() 2025-05-07T20:03:48.3900940Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:03:48.3901046Z U at::internal::set_thread_num(int) 2025-05-07T20:03:48.3901265Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:03:48.3901807Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3902415Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:48.3902665Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:48.3902811Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:03:48.3902932Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:48.3903082Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:48.3903205Z U bool at::Tensor::item() const 2025-05-07T20:03:48.3903327Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3903468Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3903572Z U c10::AnyType::get() 2025-05-07T20:03:48.3903725Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:48.3903889Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3904097Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3904184Z U c10::BoolType::get() 2025-05-07T20:03:48.3904284Z U c10::DeviceObjType::get() 2025-05-07T20:03:48.3904472Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:48.3904632Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:48.3904745Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:48.3905243Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:48.3905825Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:48.3906220Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:48.3906353Z U c10::Error::what() const 2025-05-07T20:03:48.3906443Z U c10::FloatType::get() 2025-05-07T20:03:48.3906544Z U c10::GradMode::is_enabled() 2025-05-07T20:03:48.3906670Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:48.3906805Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3906965Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3907127Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:48.3907237Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:48.3907336Z U c10::IValue::isBoolList() const 2025-05-07T20:03:48.3907459Z U c10::IValue::isIntList() const 2025-05-07T20:03:48.3907566Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:48.3907672Z U c10::IValue::isTensorList() const 2025-05-07T20:03:48.3907808Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:48.3907911Z U c10::InferenceMode::is_enabled() 2025-05-07T20:03:48.3908006Z U c10::IntType::get() 2025-05-07T20:03:48.3908448Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:48.3908607Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:48.3908723Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:48.3908837Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:48.3908984Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:48.3909187Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:48.3909308Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:48.3909431Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:48.3909529Z U c10::ScalarTypeType::get() 2025-05-07T20:03:48.3909792Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:48.3910094Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:03:48.3910239Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:48.3910341Z U c10::StringType::get() 2025-05-07T20:03:48.3910486Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:48.3910639Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:48.3910775Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:48.3911171Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:48.3911391Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:48.3911547Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:03:48.3911882Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:48.3912017Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:48.3912136Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:48.3912287Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:48.3912427Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:48.3912638Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:48.3912760Z U c10::SymIntType::get() 2025-05-07T20:03:48.3913005Z U c10::SymbolicShapeMeta::init_is_channels_last_3d_contiguous() const 2025-05-07T20:03:48.3913236Z U c10::SymbolicShapeMeta::init_is_channels_last_contiguous() const 2025-05-07T20:03:48.3913399Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:48.3913523Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:48.3913969Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:48.3914137Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:48.3914289Z U c10::TensorImpl::throw_storage_access_error() const 2025-05-07T20:03:48.3914392Z U c10::TensorType::get() 2025-05-07T20:03:48.3915184Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:48.3915377Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:48.3915486Z U c10::Type::is_module() const 2025-05-07T20:03:48.3915622Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:48.3916327Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:48.3916458Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:48.3916638Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:48.3916928Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:48.3917265Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:48.3917396Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:48.3917516Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:48.3917631Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:48.3917766Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:48.3918021Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:48.3918266Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:48.3918378Z U c10::cuda::current_device() 2025-05-07T20:03:48.3918597Z U c10::cuda::device_count() 2025-05-07T20:03:48.3918727Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:48.3918856Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:48.3918997Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:48.3919124Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:48.3919269Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:48.3919386Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:48.3919806Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:48.3920298Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:48.3920532Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:48.3920986Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.3921365Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:48.3922102Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.3922368Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:48.3922573Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:48.3922687Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:48.3922789Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:48.3923122Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:48.3923301Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:48.3923429Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:03:48.3923565Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:03:48.3923712Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:48.3923875Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:48.3924001Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:48.3924124Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:48.3924271Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:48.3924643Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:48.3924785Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:03:48.3924903Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:48.3925058Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:03:48.3925184Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:48.3925318Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:03:48.3925444Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:03:48.3925559Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:48.3925693Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:48.3925819Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:48.3925985Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:48.3926117Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:48.3926237Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:03:48.3926366Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:48.3926484Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:48.3926595Z U c10::report_overflow(char const*) 2025-05-07T20:03:48.3926718Z U c10::throwNullDataPtrError() 2025-05-07T20:03:48.3926857Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:48.3926959Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:48.3927086Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:48.3927458Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:48.3927574Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:48.3927697Z U cublasGemmStridedBatchedEx 2025-05-07T20:03:48.3927797Z U cublasSetStream_v2 2025-05-07T20:03:48.3928066Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:48.3928197Z U cudaDeviceGetByPCIBusId@libcudart.so.11.0 2025-05-07T20:03:48.3928366Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.3928524Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:48.3928636Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:48.3928773Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:48.3928883Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:48.3928996Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:48.3929129Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.3929231Z U cudaFree@libcudart.so.11.0 2025-05-07T20:03:48.3929358Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:48.3929476Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:48.3929600Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:48.3929718Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:48.3929857Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:48.3930033Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:48.3930177Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:48.3930359Z U cudaHostGetDevicePointer@libcudart.so.11.0 2025-05-07T20:03:48.3930712Z U cudaHostRegister@libcudart.so.11.0 2025-05-07T20:03:48.3930915Z U cudaHostUnregister@libcudart.so.11.0 2025-05-07T20:03:48.3931070Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:48.3931236Z U cudaMallocManaged@libcudart.so.11.0 2025-05-07T20:03:48.3931477Z U cudaMemAdvise@libcudart.so.11.0 2025-05-07T20:03:48.3931741Z U cudaMemPrefetchAsync@libcudart.so.11.0 2025-05-07T20:03:48.3931969Z U cudaMemcpy2DAsync@libcudart.so.11.0 2025-05-07T20:03:48.3932170Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:48.3932459Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:48.3932585Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:48.3932713Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:48.3932829Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:48.3932964Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:48.3933088Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:48.3933246Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3933412Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3933508Z U exit@GLIBC_2.2.5 2025-05-07T20:03:48.3933616Z U exp10@GLIBC_2.2.5 2025-05-07T20:03:48.3933707Z U exp@GLIBC_2.2.5 2025-05-07T20:03:48.3933796Z U expf@GLIBC_2.2.5 2025-05-07T20:03:48.3934007Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:48.3934201Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:48.3934403Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:48.3934660Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:48.3934854Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:48.3934990Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3935146Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3935253Z U fminf@GLIBC_2.2.5 2025-05-07T20:03:48.3935342Z U fmod@GLIBC_2.2.5 2025-05-07T20:03:48.3935431Z U free@GLIBC_2.2.5 2025-05-07T20:03:48.3935564Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:03:48.3935707Z U int at::Tensor::item() const 2025-05-07T20:03:48.3935900Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:48.3936041Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3936184Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3936279Z U lgamma@GLIBC_2.2.5 2025-05-07T20:03:48.3936383Z U llrint@GLIBC_2.2.5 2025-05-07T20:03:48.3936470Z U log10@GLIBC_2.2.5 2025-05-07T20:03:48.3936559Z U log2@GLIBC_2.2.5 2025-05-07T20:03:48.3936645Z U log@GLIBC_2.2.5 2025-05-07T20:03:48.3936769Z U long at::Tensor::item() const 2025-05-07T20:03:48.3936940Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:48.3937112Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:48.3937256Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3937405Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3937493Z U lrint@GLIBC_2.2.5 2025-05-07T20:03:48.3937593Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:48.3937686Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:48.3937777Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:48.3937867Z U memcpy@GLIBC_2.14 2025-05-07T20:03:48.3937972Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:48.3938062Z U memset@GLIBC_2.2.5 2025-05-07T20:03:48.3938163Z U nvmlDeviceGetCount_v2 2025-05-07T20:03:48.3938285Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:03:48.3938414Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:03:48.3938518Z U nvmlDeviceGetNvLinkState 2025-05-07T20:03:48.3938649Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:03:48.3938757Z U nvmlInit_v2 2025-05-07T20:03:48.3938852Z U omp_get_num_threads 2025-05-07T20:03:48.3938943Z U omp_get_thread_num 2025-05-07T20:03:48.3939101Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:03:48.3939222Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:48.3939347Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:48.3939450Z U pow@GLIBC_2.2.5 2025-05-07T20:03:48.3939544Z U printf@GLIBC_2.2.5 2025-05-07T20:03:48.3939636Z U puts@GLIBC_2.2.5 2025-05-07T20:03:48.3939725Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:48.3939897Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3940094Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3940190Z U sin@GLIBC_2.2.5 2025-05-07T20:03:48.3940409Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:48.3940583Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:48.3940770Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:03:48.3940985Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:48.3941380Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:48.3941538Z U std::__basic_file::~__basic_file()@GLIBCXX_3.4 2025-05-07T20:03:48.3941916Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:48.3942321Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:48.3942815Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3943361Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:48.3943752Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3944316Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3944879Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:48.3945358Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3945706Z U std::__cxx11::basic_string, std::allocator >::append(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3946226Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3946555Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const@GLIBCXX_3.4.21 2025-05-07T20:03:48.3946912Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:48.3947302Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:48.3947463Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:48.3947601Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:48.3947716Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:48.3947854Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:48.3947968Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:48.3948082Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:48.3948212Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:48.3948341Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:48.3948483Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.3948644Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.3948777Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.3948949Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:48.3949100Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:48.3949235Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:48.3949484Z U std::basic_filebuf >::basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:03:48.3949702Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:03:48.3949992Z U std::basic_filebuf >::open(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:03:48.3950215Z U std::basic_filebuf >::~basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:03:48.3950470Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:03:48.3950719Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:48.3952692Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:03:48.3952980Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:03:48.3953565Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.3954084Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:03:48.3954278Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:03:48.3954392Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:48.3954548Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:48.3954703Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:48.3954843Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:48.3954965Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:48.3955112Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:48.3955237Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:03:48.3955355Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:48.3955493Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:03:48.3955615Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:48.3955819Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:48.3956056Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.3956303Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:48.3956428Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:48.3956570Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:48.3956695Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:48.3956848Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:03:48.3957018Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3957179Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:48.3957413Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:48.3957882Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:48.3958026Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:48.3958144Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:48.3958251Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:48.3958363Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:48.3958465Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:48.3958627Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:48.3959237Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:48.3959704Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:48.3960235Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:48.3960548Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:48.3960679Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:48.3961006Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:48.3961197Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:48.3961402Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:48.3961619Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:48.3961976Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:48.3962133Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:48.3962346Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:48.3962538Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:48.3962667Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:48.3962808Z U torch::autograd::Node::metadata() 2025-05-07T20:03:48.3962954Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:48.3963204Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:48.3963665Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:48.3963817Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:48.3964035Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:48.3964430Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:48.3966936Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:48.3967115Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:48.3967265Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:48.3967422Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:48.3967586Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:48.3967993Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:48.3968338Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:48.3968730Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:48.3968933Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:03:48.3969061Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:03:48.3969888Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:48.3970027Z U typeinfo for c10::Error 2025-05-07T20:03:48.3970215Z U typeinfo for c10::Type 2025-05-07T20:03:48.3970372Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:48.3970499Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:48.3970635Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:48.3970779Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:48.3970902Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:48.3971110Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:48.3971338Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:48.3971778Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:48.3972301Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:48.3972764Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:48.3973281Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:48.3973738Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:03:48.3974259Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:03:48.3974712Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:48.3975242Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:48.3975735Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:48.3976297Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:48.3976870Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:48.3977015Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:48.3977188Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:48.3977340Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:48.3977490Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:48.3977688Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:48.3977799Z U vtable for at::TensorIterator 2025-05-07T20:03:48.3977915Z U vtable for at::TensorIteratorBase 2025-05-07T20:03:48.3978014Z U vtable for c10::Error 2025-05-07T20:03:48.3978137Z U vtable for c10::ListType 2025-05-07T20:03:48.3978477Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.3978803Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.3979197Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:48.3979329Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:48.3979542Z U vtable for std::basic_filebuf >@GLIBCXX_3.4 2025-05-07T20:03:48.3979780Z U vtable for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:03:48.3979971Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:03:48.3980191Z U vtable for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:03:48.3980409Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:48.3980536Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:48.3980646Z U vtable for torch::autograd::Node 2025-05-07T20:03:48.3980830Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:48.3980937Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:48.3981047Z w _ITM_registerTMCloneTable 2025-05-07T20:03:48.3981179Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:48.3981288Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:48.3981385Z w __gmon_start__ 2025-05-07T20:03:48.3981510Z w __pthread_key_create 2025-05-07T20:03:48.3981621Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:48.3981733Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:48.3981828Z w pthread_once 2025-05-07T20:03:48.3981990Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:48.3982184Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:48.3982194Z 2025-05-07T20:03:48.3982352Z linux-vdso.so.1 (0x00007fffb97f6000) 2025-05-07T20:03:48.3982438Z libc10.so => not found 2025-05-07T20:03:48.3982532Z libc10_cuda.so => not found 2025-05-07T20:03:48.3982906Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f70c2a00000) 2025-05-07T20:03:48.3983003Z libnvidia-ml.so.1 => not found 2025-05-07T20:03:48.3983092Z libtorch.so => not found 2025-05-07T20:03:48.3983624Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f70c5d26000) 2025-05-07T20:03:48.3984084Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f70c2000000) 2025-05-07T20:03:48.3984176Z libtorch_cpu.so => not found 2025-05-07T20:03:48.3984271Z libtorch_cuda.so => not found 2025-05-07T20:03:48.3984386Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.3984543Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f70c1d9c000) 2025-05-07T20:03:48.3984666Z libm.so.6 => /lib64/libm.so.6 (0x00007f70c2925000) 2025-05-07T20:03:48.3984818Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f70c5cf6000) 2025-05-07T20:03:48.3984930Z libc.so.6 => /lib64/libc.so.6 (0x00007f70c1b94000) 2025-05-07T20:03:48.3985082Z /lib64/ld-linux-x86-64.so.2 (0x00007f70c5e2f000) 2025-05-07T20:03:48.3985171Z libc10.so => not found 2025-05-07T20:03:48.3985526Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f70c5c7b000) 2025-05-07T20:03:48.3985617Z libtorch.so => not found 2025-05-07T20:03:48.3985708Z libtorch_cpu.so => not found 2025-05-07T20:03:48.3985808Z libtorch_cuda.so => not found 2025-05-07T20:03:48.3985889Z libc10.so => not found 2025-05-07T20:03:48.3985980Z libc10_cuda.so => not found 2025-05-07T20:03:48.3986092Z libtorch.so => not found 2025-05-07T20:03:48.3986184Z libtorch_cpu.so => not found 2025-05-07T20:03:48.3986277Z libtorch_cuda.so => not found 2025-05-07T20:03:48.3986395Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.3986522Z libtorch.so => not found 2025-05-07T20:03:48.3986606Z libc10.so => not found 2025-05-07T20:03:48.3986695Z libc10_cuda.so => not found 2025-05-07T20:03:48.3986793Z libtorch_cpu.so => not found 2025-05-07T20:03:48.3986888Z libtorch_cuda.so => not found 2025-05-07T20:03:48.3986984Z libcudart.so.11.0 => not found 2025-05-07T20:03:48.3987072Z libtorch_cpu.so => not found 2025-05-07T20:03:48.3987169Z libtorch_cuda.so => not found 2025-05-07T20:03:48.3987258Z libtorch.so => not found 2025-05-07T20:03:48.3987264Z 2025-05-07T20:03:48.3987362Z [CHECK] Displaying ELF information: 2025-05-07T20:03:48.3987569Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:48.3987574Z 2025-05-07T20:03:48.3987578Z 2025-05-07T20:03:48.3987736Z Dynamic section at offset 0x27e16f8 contains 43 entries: 2025-05-07T20:03:48.3987853Z Tag Type Name/Value 2025-05-07T20:03:48.3988051Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:48.3988244Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:48.3988420Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:48.3988618Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:03:48.3988821Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:48.3989055Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:48.3989262Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:48.3989468Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:48.3989660Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:48.3989880Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:48.3990085Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:48.3990265Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:48.3990645Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:48.3991055Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:48.3991566Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:48.3991771Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:03:48.3991954Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:48.3992084Z 0x000000000000000c (INIT) 0x18a000 2025-05-07T20:03:48.3992199Z 0x000000000000000d (FINI) 0x7f44ac 2025-05-07T20:03:48.3992321Z 0x0000000000000019 (INIT_ARRAY) 0x27db600 2025-05-07T20:03:48.3992472Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:03:48.3992590Z 0x000000000000001a (FINI_ARRAY) 0x27dba88 2025-05-07T20:03:48.3992710Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:48.3992831Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:03:48.3992943Z 0x000000006ffffef5 (GNU_HASH) 0x8490 2025-05-07T20:03:48.3993138Z 0x0000000000000005 (STRTAB) 0x35f10 2025-05-07T20:03:48.3993247Z 0x0000000000000006 (SYMTAB) 0x10f68 2025-05-07T20:03:48.3993404Z 0x000000000000000a (STRSZ) 1196066 (bytes) 2025-05-07T20:03:48.3993524Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:48.3993641Z 0x0000000000000003 (PLTGOT) 0x27e29e8 2025-05-07T20:03:48.3993792Z 0x0000000000000002 (PLTRELSZ) 42096 (bytes) 2025-05-07T20:03:48.3993897Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:48.3994008Z 0x0000000000000017 (JMPREL) 0x17f708 2025-05-07T20:03:48.3994177Z 0x0000000000000007 (RELA) 0x15d220 2025-05-07T20:03:48.3994369Z 0x0000000000000008 (RELASZ) 140520 (bytes) 2025-05-07T20:03:48.3994485Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:48.3994583Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:48.3994724Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:48.3994843Z 0x000000006ffffffe (VERNEED) 0x15d080 2025-05-07T20:03:48.3994949Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:48.3995080Z 0x000000006ffffff0 (VERSYM) 0x159f32 2025-05-07T20:03:48.3995188Z 0x000000006ffffff9 (RELACOUNT) 514 2025-05-07T20:03:48.3995283Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:48.3995288Z 2025-05-07T20:03:48.3995419Z ################################################################################ 2025-05-07T20:03:48.3995424Z 2025-05-07T20:03:48.3995428Z 2025-05-07T20:03:48.3995633Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:03:48.4088900Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4114362Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4333807Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4365056Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4418928Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4451062Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4484275Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4513930Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:48.4625436Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.4648295Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.4868139Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.4898284Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.4948603Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.4982333Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.5019304Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.5044784Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.5434851Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.5789660Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.6699061Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.6882147Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.6964293Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.6993882Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.8757631Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.9006301Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.9553355Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.9674222Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.9975132Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:48.9976924Z ################################################################################ 2025-05-07T20:03:48.9978409Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:48.9979588Z 2025-05-07T20:03:48.9980563Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:48.9981195Z 2025-05-07T20:03:56.8551431Z 2025-05-07T20:03:56.8552447Z fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl is 2025-05-07T20:03:56.8553951Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:03:56.8555139Z 2025-05-07T20:03:56.8555634Z The wheel references external versioned symbols in these 2025-05-07T20:03:56.8556957Z system-provided shared libraries: libgcc_s.so.1 with versions 2025-05-07T20:03:56.8557670Z {'GCC_3.0', 'GCC_3.4'}, libstdc++.so.6 with versions 2025-05-07T20:03:56.8558056Z {'GLIBCXX_3.4.14', 'GLIBCXX_3.4.11', 'CXXABI_1.3', 'CXXABI_1.3.5', 2025-05-07T20:03:56.8558483Z 'CXXABI_1.3.3', 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.15', 'GLIBCXX_3.4.18', 2025-05-07T20:03:56.8558909Z 'CXXABI_1.3.9', 'CXXABI_1.3.8', 'GLIBCXX_3.4.20', 'GLIBCXX_3.4', 2025-05-07T20:03:56.8559327Z 'CXXABI_1.3.7', 'GLIBCXX_3.4.19', 'CXXABI_1.3.11', 'GLIBCXX_3.4.9'}, 2025-05-07T20:03:56.8559757Z libc.so.6 with versions {'GLIBC_2.14', 'GLIBC_2.2.5'}, libm.so.6 with 2025-05-07T20:03:56.8560184Z versions {'GLIBC_2.2.5'}, libcudart.so.11.0 with versions 2025-05-07T20:03:56.8560517Z {'libcudart.so.11.0'} 2025-05-07T20:03:56.8560665Z 2025-05-07T20:03:56.8560855Z This constrains the platform tag to "manylinux_2_27_x86_64". In order 2025-05-07T20:03:56.8561317Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:03:56.8561785Z wheel from source on a system with earlier versions of these 2025-05-07T20:03:56.8562174Z libraries, such as a recent manylinux image. 2025-05-07T20:03:56.9385803Z 2025-05-07T20:03:56.9385915Z 2025-05-07T20:03:56.9386806Z ################################################################################ 2025-05-07T20:03:56.9387900Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:03:56.9389268Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:56.9390297Z 2025-05-07T20:03:56.9409356Z -rw-r--r--. 1 root root 268M May 7 20:03 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:56.9409854Z 2025-05-07T20:03:56.9409974Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:03:56.9410425Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:56.9410812Z 2025-05-07T20:03:57.4512249Z c543429527c825aa2eaeaef6b8ae33708d0237d2 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:57.4513112Z 2025-05-07T20:03:57.4513396Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:57.4513797Z 2025-05-07T20:03:58.6253843Z 08a0a986c55b7c27400e6e31db454a2e92fc0fae12a31149d4bbadafcde176f4 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:58.6255802Z 2025-05-07T20:03:58.6256544Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:58.6257591Z 2025-05-07T20:03:59.0795221Z 90fefcba3fb43efbb0c8e19d6137e819 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:59.0796693Z 2025-05-07T20:03:59.0797086Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:03:59.0904995Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:03:59.0905304Z with: 2025-05-07T20:03:59.0905544Z name: fbgemm_default_x86_gcc_py3.13_cu11.8.0.whl 2025-05-07T20:03:59.0905865Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:03:59.0906155Z if-no-files-found: error 2025-05-07T20:03:59.0906391Z compression-level: 6 2025-05-07T20:03:59.0906621Z overwrite: false 2025-05-07T20:03:59.0906853Z include-hidden-files: false 2025-05-07T20:03:59.0907086Z env: 2025-05-07T20:03:59.0907301Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:03:59.0907576Z BUILD_ENV: build_binary 2025-05-07T20:03:59.0907814Z BUILD_TARGET: default 2025-05-07T20:03:59.0908026Z BUILD_VARIANT: cuda 2025-05-07T20:03:59.0908254Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T20:03:59.0908481Z ##[endgroup] 2025-05-07T20:03:59.0911905Z ##[command]/usr/bin/docker exec 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:03:59.5098216Z With the provided path, there will be 1 file uploaded 2025-05-07T20:03:59.5100142Z Artifact name is valid! 2025-05-07T20:03:59.5100906Z Root directory input is valid! 2025-05-07T20:03:59.5929138Z Beginning upload of artifact content to blob storage 2025-05-07T20:04:00.1520845Z Uploaded bytes 8388608 2025-05-07T20:04:00.5202737Z Uploaded bytes 16777216 2025-05-07T20:04:00.8044091Z Uploaded bytes 25165824 2025-05-07T20:04:01.0669504Z Uploaded bytes 33554432 2025-05-07T20:04:01.4011216Z Uploaded bytes 41943040 2025-05-07T20:04:01.7733950Z Uploaded bytes 50331648 2025-05-07T20:04:02.0700329Z Uploaded bytes 58720256 2025-05-07T20:04:02.4037035Z Uploaded bytes 67108864 2025-05-07T20:04:02.6685337Z Uploaded bytes 75497472 2025-05-07T20:04:03.0299404Z Uploaded bytes 83886080 2025-05-07T20:04:03.2667000Z Uploaded bytes 92274688 2025-05-07T20:04:03.5803057Z Uploaded bytes 100663296 2025-05-07T20:04:03.8955188Z Uploaded bytes 109051904 2025-05-07T20:04:04.3386439Z Uploaded bytes 117440512 2025-05-07T20:04:04.5570529Z Uploaded bytes 125829120 2025-05-07T20:04:04.8573206Z Uploaded bytes 134217728 2025-05-07T20:04:05.1603651Z Uploaded bytes 142606336 2025-05-07T20:04:05.4306977Z Uploaded bytes 150994944 2025-05-07T20:04:05.7520528Z Uploaded bytes 159383552 2025-05-07T20:04:06.0234081Z Uploaded bytes 167772160 2025-05-07T20:04:06.3740645Z Uploaded bytes 176160768 2025-05-07T20:04:06.7494472Z Uploaded bytes 184549376 2025-05-07T20:04:07.0115145Z Uploaded bytes 192937984 2025-05-07T20:04:07.3082462Z Uploaded bytes 201326592 2025-05-07T20:04:07.6346646Z Uploaded bytes 209715200 2025-05-07T20:04:07.9625257Z Uploaded bytes 218103808 2025-05-07T20:04:08.2737521Z Uploaded bytes 226492416 2025-05-07T20:04:08.5954642Z Uploaded bytes 234881024 2025-05-07T20:04:08.8496698Z Uploaded bytes 243269632 2025-05-07T20:04:09.1810649Z Uploaded bytes 251658240 2025-05-07T20:04:09.5431047Z Uploaded bytes 260046848 2025-05-07T20:04:09.7359911Z Uploaded bytes 268435456 2025-05-07T20:04:09.9356490Z Uploaded bytes 274683091 2025-05-07T20:04:09.9514507Z Finished uploading artifact content to blob storage! 2025-05-07T20:04:09.9516450Z SHA256 digest of uploaded artifact zip is 1bd8a3c4862bc6ebc56dc3bdddceaed6fea2b8ae2945209e3bb83a088b169c32 2025-05-07T20:04:09.9518537Z Finalizing artifact upload 2025-05-07T20:04:10.0373197Z Artifact fbgemm_default_x86_gcc_py3.13_cu11.8.0.whl.zip successfully finalized. Artifact ID 3081408946 2025-05-07T20:04:10.0374216Z Artifact fbgemm_default_x86_gcc_py3.13_cu11.8.0.whl has been successfully uploaded! Final size is 274683091 bytes. Artifact ID is 3081408946 2025-05-07T20:04:10.0386701Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081408946 2025-05-07T20:04:10.0623790Z Post job cleanup. 2025-05-07T20:04:10.0629420Z ##[command]/usr/bin/docker exec 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:04:10.3492700Z [command]/usr/bin/git version 2025-05-07T20:04:10.3525914Z git version 2.47.1 2025-05-07T20:04:10.3556706Z Copying '/github/home/.gitconfig' to '/__w/_temp/0c49098b-8002-4b05-ac0a-7a1035a95fe5/.gitconfig' 2025-05-07T20:04:10.3564295Z Temporarily overriding HOME='/__w/_temp/0c49098b-8002-4b05-ac0a-7a1035a95fe5' before making global git config changes 2025-05-07T20:04:10.3565111Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:04:10.3583105Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:04:10.3635809Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:04:10.3665593Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:04:10.3940799Z Entering 'external/asmjit' 2025-05-07T20:04:10.3992220Z Entering 'external/composable_kernel' 2025-05-07T20:04:10.4059059Z Entering 'external/cpuinfo' 2025-05-07T20:04:10.4114520Z Entering 'external/cutlass' 2025-05-07T20:04:10.4188993Z Entering 'external/googletest' 2025-05-07T20:04:10.4250287Z Entering 'external/hipify_torch' 2025-05-07T20:04:10.4315306Z Entering 'external/json' 2025-05-07T20:04:10.4388326Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:04:10.4409822Z http.https://github.com/.extraheader 2025-05-07T20:04:10.4415539Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:04:10.4441031Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:04:10.4710093Z Entering 'external/asmjit' 2025-05-07T20:04:10.4750439Z http.https://github.com/.extraheader 2025-05-07T20:04:10.4787571Z Entering 'external/composable_kernel' 2025-05-07T20:04:10.4820666Z http.https://github.com/.extraheader 2025-05-07T20:04:10.4858189Z Entering 'external/cpuinfo' 2025-05-07T20:04:10.4890295Z http.https://github.com/.extraheader 2025-05-07T20:04:10.4930501Z Entering 'external/cutlass' 2025-05-07T20:04:10.4972195Z http.https://github.com/.extraheader 2025-05-07T20:04:10.5021467Z Entering 'external/googletest' 2025-05-07T20:04:10.5053496Z http.https://github.com/.extraheader 2025-05-07T20:04:10.5090196Z Entering 'external/hipify_torch' 2025-05-07T20:04:10.5124096Z http.https://github.com/.extraheader 2025-05-07T20:04:10.5163822Z Entering 'external/json' 2025-05-07T20:04:10.5210101Z http.https://github.com/.extraheader 2025-05-07T20:04:10.5410898Z Stop and remove container: 1f736eeadaf44e318f4e00a477aace86_amazonlinux2023_ac52de 2025-05-07T20:04:10.5415928Z ##[command]/usr/bin/docker rm --force 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b 2025-05-07T20:04:11.2473833Z 3a46c886120491b2b40bc3a625109b99b6e6604702540dfc4f2e17198255ca9b 2025-05-07T20:04:11.2506438Z Remove container network: github_network_0023449067dc45b89da1976825adb551 2025-05-07T20:04:11.2510789Z ##[command]/usr/bin/docker network rm github_network_0023449067dc45b89da1976825adb551 2025-05-07T20:04:12.0537154Z github_network_0023449067dc45b89da1976825adb551 2025-05-07T20:04:12.0571494Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:04:12.0592056Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:04:12.0598525Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:04:12.0598987Z ##[endgroup] 2025-05-07T20:04:12.0709678Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:04:22.1802713Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:04:38.2184954Z Cleaning up orphan processes