2025-05-07T19:42:35.1322328Z Current runner version: '2.323.0' 2025-05-07T19:42:35.1327908Z Runner name: 'i-0fa4abaec4f96f332' 2025-05-07T19:42:35.1328802Z Machine name: 'ip-10-0-41-178' 2025-05-07T19:42:35.1331505Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:35.1333556Z Contents: read 2025-05-07T19:42:35.1334112Z Metadata: read 2025-05-07T19:42:35.1334655Z Packages: read 2025-05-07T19:42:35.1335127Z ##[endgroup] 2025-05-07T19:42:35.1337406Z Secret source: None 2025-05-07T19:42:35.1338349Z Prepare workflow directory 2025-05-07T19:42:35.1950845Z Prepare all required actions 2025-05-07T19:42:35.1987782Z Getting action download info 2025-05-07T19:42:35.4199472Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:35.6564098Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:36.1057371Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.9, 12.6.3, gcc) 2025-05-07T19:42:36.1890757Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:36.2008969Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:36.2019100Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:36.2020400Z ##[endgroup] 2025-05-07T19:42:37.2415864Z Runner Type: linux.24xlarge 2025-05-07T19:42:37.2416394Z Instance Type: c5.24xlarge 2025-05-07T19:42:37.2416760Z AMI Name: unknown 2025-05-07T19:42:37.2458957Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:42.2635159Z ##[group]Checking docker version 2025-05-07T19:42:42.2648019Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:42.2852993Z '1.44' 2025-05-07T19:42:42.2880960Z Docker daemon API version: '1.44' 2025-05-07T19:42:42.2881493Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:42.3060760Z '1.44' 2025-05-07T19:42:42.3071823Z Docker client API version: '1.44' 2025-05-07T19:42:42.3076732Z ##[endgroup] 2025-05-07T19:42:42.3079273Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:42.3084275Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=67a4de" 2025-05-07T19:42:42.3241511Z ##[command]/usr/bin/docker network prune --force --filter "label=67a4de" 2025-05-07T19:42:42.3375810Z ##[endgroup] 2025-05-07T19:42:42.3376152Z ##[group]Create local container network 2025-05-07T19:42:42.3385400Z ##[command]/usr/bin/docker network create --label 67a4de github_network_7a3f3acfbff84df5a5ba037f14b56b19 2025-05-07T19:42:42.6209150Z eb32f201063562d697b9b5bb829b362f485e0d51fd71a205b10fcef237e170a0 2025-05-07T19:42:42.6237879Z ##[endgroup] 2025-05-07T19:42:42.6260191Z ##[group]Starting job container 2025-05-07T19:42:42.6278523Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:42.8410784Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:42.8994385Z 1c3112c87ab2: Pulling fs layer 2025-05-07T19:42:43.4630991Z 1c3112c87ab2: Verifying Checksum 2025-05-07T19:42:43.4631383Z 1c3112c87ab2: Download complete 2025-05-07T19:42:45.2839315Z 1c3112c87ab2: Pull complete 2025-05-07T19:42:45.2919875Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:45.2940889Z Status: Downloaded newer image for amazonlinux:2023 2025-05-07T19:42:45.2956729Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:45.3049310Z ##[command]/usr/bin/docker create --name 150a28556a5a407db4177bd68a8ce0e0_amazonlinux2023_748304 --label 67a4de --workdir /__w/FBGEMM/FBGEMM --network github_network_7a3f3acfbff84df5a5ba037f14b56b19 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:45.6144934Z 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 2025-05-07T19:42:45.6168758Z ##[command]/usr/bin/docker start 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 2025-05-07T19:42:46.1370807Z 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 2025-05-07T19:42:46.1391897Z ##[command]/usr/bin/docker ps --all --filter id=8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:46.1556320Z 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 Up Less than a second 2025-05-07T19:42:46.1579760Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 2025-05-07T19:42:46.1735778Z HOME=/github/home 2025-05-07T19:42:46.1736306Z GITHUB_ACTIONS=true 2025-05-07T19:42:46.1736698Z CI=true 2025-05-07T19:42:46.1737143Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:46.1753064Z ##[endgroup] 2025-05-07T19:42:46.1763426Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:46.1765066Z ##[endgroup] 2025-05-07T19:42:46.1842842Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:46.1843631Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:46.1844585Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:46.1844988Z env: 2025-05-07T19:42:46.1845293Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:46.1845618Z BUILD_ENV: build_binary 2025-05-07T19:42:46.1846013Z BUILD_TARGET: default 2025-05-07T19:42:46.1846332Z BUILD_VARIANT: cuda 2025-05-07T19:42:46.1846589Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:46.1846969Z ##[endgroup] 2025-05-07T19:42:47.0212991Z Amazon Linux 2023 repository 66 MB/s | 37 MB 00:00 2025-05-07T19:42:53.6820720Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:42:54.2415829Z Dependencies resolved. 2025-05-07T19:42:54.2592219Z Nothing to do. 2025-05-07T19:42:54.2593282Z Complete! 2025-05-07T19:42:54.5063936Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:42:54.5693660Z Dependencies resolved. 2025-05-07T19:42:54.5919840Z ======================================================================================== 2025-05-07T19:42:54.5920418Z Package Arch Version Repository Size 2025-05-07T19:42:54.5921119Z ======================================================================================== 2025-05-07T19:42:54.5921564Z Installing: 2025-05-07T19:42:54.5922036Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:54.5922767Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:54.5923328Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:54.5923950Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:54.5924540Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:54.5925133Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:54.5925716Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:54.5926283Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:54.5926795Z Installing dependencies: 2025-05-07T19:42:54.5927215Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:54.5927916Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:54.5928826Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:54.5929499Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:54.5930251Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:54.5930838Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:54.5931439Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:54.5932170Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:54.5932725Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:54.5933450Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:54.6033727Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:54.6034249Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:54.6035003Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:54.6035527Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:54.6036048Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:54.6036636Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:54.6037194Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:54.6037769Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:54.6038435Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:54.6038982Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:54.6039570Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:54.6040101Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:54.6040617Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:54.6041118Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:54.6041628Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:54.6042145Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:54.6042650Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:54.6043205Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:54.6043736Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:54.6044257Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:54.6044838Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:54.6045369Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:54.6045911Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:54.6046490Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:54.6047106Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:54.6047691Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:54.6048402Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:54.6048993Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:54.6049562Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:54.6050113Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:54.6050659Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:54.6051248Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:54.6051818Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:54.6052393Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:54.6053054Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:54.6053730Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:54.6054364Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:54.6056180Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:54.6056800Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:54.6057427Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:54.6058033Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:54.6058618Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:54.6059280Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:54.6059849Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:54.6060420Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:54.6061008Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:54.6061602Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:54.6062160Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:54.6062737Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:54.6063343Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:54.6063962Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:54.6064580Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:54.6065175Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:54.6065800Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:54.6066426Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:54.6067019Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:54.6067581Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:54.6068145Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:54.6068758Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:54.6069363Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:54.6069963Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:54.6070684Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:54.6071326Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:54.6072059Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:54.6072610Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:54.6073152Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:54.6073704Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:54.6074263Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:54.6074829Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:54.6075367Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:54.6075900Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:54.6076420Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:54.6076986Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:54.6077520Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:54.6078092Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:54.6078670Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:54.6079223Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:54.6079778Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:54.6080320Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:54.6080844Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:54.6081391Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:54.6082047Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:54.6082781Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:54.6083243Z Installing weak dependencies: 2025-05-07T19:42:54.6083714Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:54.6084338Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:54.6084941Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:54.6085558Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:54.6086133Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:54.6086723Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:54.6087077Z 2025-05-07T19:42:54.6087191Z Transaction Summary 2025-05-07T19:42:54.6087467Z ======================================================================================== 2025-05-07T19:42:54.6087814Z Install 107 Packages 2025-05-07T19:42:54.6087965Z 2025-05-07T19:42:54.6088114Z Total download size: 38 M 2025-05-07T19:42:54.6088387Z Installed size: 151 M 2025-05-07T19:42:54.6088633Z Downloading Packages: 2025-05-07T19:42:54.8917714Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 4.1 MB/s | 82 kB 00:00 2025-05-07T19:42:54.8994093Z (2/107): elfutils-debuginfod-client-0.188-3.amz 6.5 MB/s | 41 kB 00:00 2025-05-07T19:42:54.9085326Z (3/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 21 MB/s | 786 kB 00:00 2025-05-07T19:42:54.9341373Z (4/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 84 MB/s | 5.3 MB 00:00 2025-05-07T19:42:54.9388509Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 14 MB/s | 539 kB 00:00 2025-05-07T19:42:54.9417396Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 1.7 MB/s | 54 kB 00:00 2025-05-07T19:42:54.9602911Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 59 MB/s | 1.1 MB 00:00 2025-05-07T19:42:54.9790215Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 70 MB/s | 2.8 MB 00:00 2025-05-07T19:42:55.0041296Z (9/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 73 MB/s | 4.7 MB 00:00 2025-05-07T19:42:55.0098916Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 23 MB/s | 1.0 MB 00:00 2025-05-07T19:42:55.0132525Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.0 MB/s | 160 kB 00:00 2025-05-07T19:42:55.0165273Z (12/107): jansson-2.14-0.amzn2023.x86_64.rpm 7.6 MB/s | 46 kB 00:00 2025-05-07T19:42:55.0199589Z (13/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 9.6 MB/s | 62 kB 00:00 2025-05-07T19:42:55.0305607Z (14/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 80 MB/s | 1.6 MB 00:00 2025-05-07T19:42:55.0337963Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 10 MB/s | 168 kB 00:00 2025-05-07T19:42:55.0346114Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 3.9 MB/s | 57 kB 00:00 2025-05-07T19:42:55.0441429Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 59 MB/s | 756 kB 00:00 2025-05-07T19:42:55.0461216Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.5 MB/s | 28 kB 00:00 2025-05-07T19:42:55.0484179Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 8.1 MB/s | 108 kB 00:00 2025-05-07T19:42:55.0519894Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 21 MB/s | 153 kB 00:00 2025-05-07T19:42:55.0561089Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 13 MB/s | 95 kB 00:00 2025-05-07T19:42:55.0573144Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 3.6 MB/s | 31 kB 00:00 2025-05-07T19:42:55.0602586Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 13 MB/s | 106 kB 00:00 2025-05-07T19:42:55.0650727Z (24/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 3.8 MB/s | 26 kB 00:00 2025-05-07T19:42:55.0709675Z (25/107): nano-8.3-1.amzn2023.x86_64.rpm 64 MB/s | 706 kB 00:00 2025-05-07T19:42:55.0740118Z (26/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 7.5 MB/s | 121 kB 00:00 2025-05-07T19:42:55.0753314Z (27/107): nano-default-editor-8.3-1.amzn2023.no 1.1 MB/s | 10 kB 00:00 2025-05-07T19:42:55.0824686Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 35 MB/s | 394 kB 00:00 2025-05-07T19:42:55.0880360Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 41 MB/s | 573 kB 00:00 2025-05-07T19:42:55.0908458Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 16 MB/s | 256 kB 00:00 2025-05-07T19:42:55.0955427Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 39 MB/s | 454 kB 00:00 2025-05-07T19:42:55.1047168Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 56 MB/s | 708 kB 00:00 2025-05-07T19:42:55.1092638Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 31 MB/s | 542 kB 00:00 2025-05-07T19:42:55.1108118Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 5.9 MB/s | 93 kB 00:00 2025-05-07T19:42:55.1131726Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.3 MB/s | 41 kB 00:00 2025-05-07T19:42:55.1181365Z (36/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 5.8 MB/s | 29 kB 00:00 2025-05-07T19:42:55.1212423Z (37/107): perl-AutoLoader-5.74-477.amzn2023.0.6 2.3 MB/s | 22 kB 00:00 2025-05-07T19:42:55.1243464Z (38/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 14 MB/s | 179 kB 00:00 2025-05-07T19:42:55.1263316Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 3.0 MB/s | 22 kB 00:00 2025-05-07T19:42:55.1292903Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 11 MB/s | 55 kB 00:00 2025-05-07T19:42:55.1318513Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 3.7 MB/s | 26 kB 00:00 2025-05-07T19:42:55.1341261Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.0 MB/s | 36 kB 00:00 2025-05-07T19:42:55.1366586Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 3.8 MB/s | 26 kB 00:00 2025-05-07T19:42:55.1403842Z (44/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 2.7 MB/s | 15 kB 00:00 2025-05-07T19:42:55.1517077Z (45/107): perl-Encode-3.15-462.amzn2023.0.2.x86 86 MB/s | 1.7 MB 00:00 2025-05-07T19:42:55.1538687Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.4 MB/s | 41 kB 00:00 2025-05-07T19:42:55.1550467Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 2.3 MB/s | 31 kB 00:00 2025-05-07T19:42:55.1576128Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 3.7 MB/s | 21 kB 00:00 2025-05-07T19:42:55.1611209Z (49/107): perl-File-Basename-2.85-477.amzn2023. 3.5 MB/s | 18 kB 00:00 2025-05-07T19:42:55.1654787Z (50/107): perl-File-Path-2.18-2.amzn2023.0.2.no 4.6 MB/s | 36 kB 00:00 2025-05-07T19:42:55.1683772Z (51/107): perl-File-Temp-0.231.100-2.amzn2023.0 8.6 MB/s | 60 kB 00:00 2025-05-07T19:42:55.1719348Z (52/107): perl-File-stat-1.09-477.amzn2023.0.6. 2.9 MB/s | 17 kB 00:00 2025-05-07T19:42:55.1741907Z (53/107): perl-File-Find-1.37-477.amzn2023.0.6. 1.4 MB/s | 26 kB 00:00 2025-05-07T19:42:55.1765542Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.0 MB/s | 16 kB 00:00 2025-05-07T19:42:55.1784863Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 9.5 MB/s | 60 kB 00:00 2025-05-07T19:42:55.1807536Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.6 MB/s | 16 kB 00:00 2025-05-07T19:42:55.1823030Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 7.5 MB/s | 42 kB 00:00 2025-05-07T19:42:55.1854886Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 8.7 MB/s | 56 kB 00:00 2025-05-07T19:42:55.1877741Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 13 MB/s | 87 kB 00:00 2025-05-07T19:42:55.1890441Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 6.4 MB/s | 42 kB 00:00 2025-05-07T19:42:55.1928998Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 31 MB/s | 218 kB 00:00 2025-05-07T19:42:55.1950440Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 3.2 MB/s | 23 kB 00:00 2025-05-07T19:42:55.1972967Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.0 MB/s | 31 kB 00:00 2025-05-07T19:42:55.2001494Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.0 MB/s | 13 kB 00:00 2025-05-07T19:42:55.2029623Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 4.1 MB/s | 23 kB 00:00 2025-05-07T19:42:55.2081304Z (66/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 38 MB/s | 392 kB 00:00 2025-05-07T19:42:55.2102589Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 9.7 MB/s | 97 kB 00:00 2025-05-07T19:42:55.2131438Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 9.8 MB/s | 85 kB 00:00 2025-05-07T19:42:55.2156476Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 4.4 MB/s | 20 kB 00:00 2025-05-07T19:42:55.2199156Z (70/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 9.6 MB/s | 84 kB 00:00 2025-05-07T19:42:55.2222939Z (71/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 6.3 MB/s | 41 kB 00:00 2025-05-07T19:42:55.2261630Z (72/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 17 MB/s | 215 kB 00:00 2025-05-07T19:42:55.2284380Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 8.9 MB/s | 71 kB 00:00 2025-05-07T19:42:55.2294510Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 1.9 MB/s | 12 kB 00:00 2025-05-07T19:42:55.2325714Z (75/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 9.6 MB/s | 55 kB 00:00 2025-05-07T19:42:55.2357755Z (76/107): perl-Storable-3.21-458.amzn2023.0.2.x 19 MB/s | 96 kB 00:00 2025-05-07T19:42:55.2375017Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.2 MB/s | 15 kB 00:00 2025-05-07T19:42:55.2400104Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 6.8 MB/s | 48 kB 00:00 2025-05-07T19:42:55.2428708Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 3.2 MB/s | 22 kB 00:00 2025-05-07T19:42:55.2452879Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 5.2 MB/s | 36 kB 00:00 2025-05-07T19:42:55.2463812Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 2.6 MB/s | 17 kB 00:00 2025-05-07T19:42:55.2494045Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.8 MB/s | 22 kB 00:00 2025-05-07T19:42:55.2534994Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 5.4 MB/s | 34 kB 00:00 2025-05-07T19:42:55.2559191Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 13 MB/s | 108 kB 00:00 2025-05-07T19:42:55.2581535Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.0 MB/s | 17 kB 00:00 2025-05-07T19:42:55.2603048Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 3.7 MB/s | 23 kB 00:00 2025-05-07T19:42:55.2621747Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.5 MB/s | 14 kB 00:00 2025-05-07T19:42:55.2653568Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 11 MB/s | 71 kB 00:00 2025-05-07T19:42:55.2662980Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 2.6 MB/s | 15 kB 00:00 2025-05-07T19:42:55.2692737Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 18 MB/s | 126 kB 00:00 2025-05-07T19:42:55.2719287Z (91/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 6.5 MB/s | 29 kB 00:00 2025-05-07T19:42:55.2859422Z (92/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 111 MB/s | 2.0 MB 00:00 2025-05-07T19:42:55.2878770Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.5 MB/s | 46 kB 00:00 2025-05-07T19:42:55.2886702Z (94/107): perl-overloading-0.02-477.amzn2023.0. 936 kB/s | 13 kB 00:00 2025-05-07T19:42:55.2905548Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 3.2 MB/s | 14 kB 00:00 2025-05-07T19:42:55.2950737Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 19 MB/s | 112 kB 00:00 2025-05-07T19:42:55.2967870Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.6 MB/s | 12 kB 00:00 2025-05-07T19:42:55.2984345Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 1.8 MB/s | 13 kB 00:00 2025-05-07T19:42:55.3037276Z (99/107): sudo-python-plugin-1.9.15-1.p5.amzn20 12 MB/s | 56 kB 00:00 2025-05-07T19:42:55.3117056Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 88 MB/s | 1.3 MB 00:00 2025-05-07T19:42:55.3167195Z (101/107): systemd-libs-252.23-3.amzn2023.x86_6 52 MB/s | 613 kB 00:00 2025-05-07T19:42:55.3251161Z (102/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 70 MB/s | 879 kB 00:00 2025-05-07T19:42:55.3345344Z (103/107): shadow-utils-4.9-12.amzn2023.0.4.x86 30 MB/s | 1.1 MB 00:00 2025-05-07T19:42:55.3455764Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 81 MB/s | 2.2 MB 00:00 2025-05-07T19:42:55.3494316Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 20 MB/s | 432 kB 00:00 2025-05-07T19:42:55.3549333Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 40 MB/s | 779 kB 00:00 2025-05-07T19:42:55.3568855Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 7.1 MB/s | 42 kB 00:00 2025-05-07T19:42:55.3586016Z -------------------------------------------------------------------------------- 2025-05-07T19:42:55.3586948Z Total 50 MB/s | 38 MB 00:00 2025-05-07T19:42:56.4280246Z Running transaction check 2025-05-07T19:42:56.4759425Z Transaction check succeeded. 2025-05-07T19:42:56.4759770Z Running transaction test 2025-05-07T19:42:56.8495914Z Transaction test succeeded. 2025-05-07T19:42:56.8496822Z Running transaction 2025-05-07T19:42:57.5671867Z Preparing : 1/1 2025-05-07T19:42:57.5818829Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:57.6062704Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:57.6254095Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:57.6302571Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:57.6376557Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:57.6462108Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:57.6738165Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:57.6796895Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:57.6846555Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:57.7356797Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:57.7417309Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:57.7699861Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:57.7745704Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:57.7799805Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:57.7854273Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:57.7901619Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:57.8028703Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:57.8078376Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:57.8130243Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:57.8190658Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:57.8246037Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:57.8286124Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:57.8708569Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:57.8784222Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:57.8934561Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:57.9364279Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:57.9526411Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:58.0343076Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:58.0344804Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:58.0346211Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:58.0346974Z 2025-05-07T19:42:58.0541358Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:58.0819716Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:58.0998314Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:58.1049615Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:58.2176545Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:58.3665252Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:58.3774326Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:58.4200876Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:58.4258473Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:58.4336650Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:58.4384472Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:58.4461843Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:58.4503084Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:58.4542940Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:58.4583031Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:58.4659940Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:58.4707474Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:58.4793379Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:58.4983141Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:58.5045336Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:58.5083679Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:58.5122159Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:58.5170827Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:58.5222258Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:58.5280494Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:58.5373788Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:58.5440013Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:58.5477872Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:58.5538170Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:58.5592067Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:58.5640021Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:58.5675444Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:58.5727754Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:58.5788278Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:58.5842051Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:58.5942747Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:58.6025819Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:58.6085121Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:58.6128665Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:58.6168030Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:58.6244985Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:58.6335211Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:58.6409117Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:58.6463153Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:58.6521651Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:58.6590030Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:58.6651352Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:58.6702446Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:58.6775900Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:58.6827224Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:58.6884350Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:58.6945894Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:58.7022511Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:58.7108780Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:58.7179306Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:58.7247573Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:58.7296134Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:58.7345768Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:58.7411128Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:58.7463305Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:58.7520110Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:58.7575234Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:58.7629353Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:58.7708817Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:58.8242855Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:58.9200263Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:58.9340961Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:58.9421516Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:58.9489721Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:58.9557318Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:58.9624070Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:58.9684210Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:58.9746932Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:58.9820463Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:59.0025842Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:59.0157922Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:59.0237342Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:59.0638038Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:59.1860519Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:59.1948279Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:59.2070669Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:59.2372265Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:59.2468900Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:59.2723384Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:59.2939651Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:59.3026028Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:59.3147986Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:43:00.0771870Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:00.0773843Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:43:00.0774466Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:43:00.0775174Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:43:00.0775809Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:43:00.0776436Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:43:00.0777062Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:43:00.0777714Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:43:00.0778309Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:43:00.0779439Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:43:00.0780110Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:43:00.0780740Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:43:00.0781435Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:43:00.0782233Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:43:00.0782849Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:43:00.0783561Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:43:00.0784156Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:43:00.0784756Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:43:00.0785404Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:43:00.0786060Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:43:00.0786682Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:43:00.0787329Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:43:00.0787971Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:43:00.0788613Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:43:00.0789336Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:43:00.0790002Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:43:00.0790604Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:43:00.0791327Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:43:00.0791944Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:43:00.0792676Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:43:00.0793345Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:43:00.0793940Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:43:00.0794579Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:43:00.0795149Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:43:00.0795680Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:43:00.0796415Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:43:00.0796999Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:43:00.0797542Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:43:00.0798090Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:43:00.0798642Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:43:00.0799230Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:43:00.0799806Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:43:00.0800353Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:43:00.0800931Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:43:00.0801473Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:43:00.0802028Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:43:00.0802657Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:43:00.0803228Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:43:00.0803792Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:43:00.0804340Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:43:00.0804921Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:43:00.0805476Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:43:00.0806051Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:43:00.0806612Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:43:00.0807172Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:43:00.0807752Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:43:00.0808307Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:43:00.0808861Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:43:00.0809397Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:43:00.0809975Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:43:00.0810521Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:43:00.0811071Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:43:00.0811633Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:43:00.0812196Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:43:00.0812738Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:43:00.0813297Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:43:00.0813942Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:43:00.0814587Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:43:00.0815085Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:43:00.0815609Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:43:00.0816148Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:43:00.0816667Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:43:00.0817361Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:43:00.0817868Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:43:00.0818413Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:43:00.0818923Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:43:00.0819721Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:43:00.0820273Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:43:00.0820825Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:43:00.0821402Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:43:00.0821954Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:43:00.0822552Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:43:00.0823140Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:43:00.0823688Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:43:00.0824307Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:43:00.0824837Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:43:00.0825398Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:43:00.0826040Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:43:00.0826552Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:43:00.0827059Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:00.0827551Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:43:00.0828061Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:43:00.0828539Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:43:00.0829061Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:43:00.0829577Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:43:00.0830103Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:00.0830616Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:43:00.0831113Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:43:00.0831621Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:43:00.0832112Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:43:00.0832612Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:43:00.0833129Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:43:00.0833644Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:43:00.0834141Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:43:00.0834620Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:00.0835137Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:43:00.0835621Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:00.1866722Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:00.1867778Z 2025-05-07T19:43:00.1868016Z Installed: 2025-05-07T19:43:00.1868987Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:43:00.1871039Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1872698Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:43:00.1873357Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1873930Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1874450Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1874951Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1875502Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:00.1876050Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:43:00.1876576Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1877138Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:43:00.1877772Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:43:00.1878576Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:43:00.1879079Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:43:00.1879600Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1880116Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1880604Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1881131Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:43:00.1881658Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1882459Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:00.1883193Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1883753Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1884362Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1884936Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1885530Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1886063Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:43:00.1886647Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:43:00.1887260Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1887817Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:43:00.1888393Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:43:00.1889047Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:00.1889585Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:00.1890079Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:43:00.1890579Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1891110Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1891642Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1892168Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1892661Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:43:00.1893352Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1893909Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1894441Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:43:00.1894973Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1895499Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1896032Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1896540Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1897069Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:43:00.1897627Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:43:00.1898162Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1898724Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1899491Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1900283Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:43:00.1900885Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:43:00.1901465Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1902083Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1902677Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:43:00.1903281Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1903851Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:00.1904453Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:43:00.1905036Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1905588Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:43:00.1906277Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:43:00.1906813Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1907355Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1907893Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:43:00.1908438Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1908967Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:43:00.1909481Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1910018Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1910556Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:43:00.1911106Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:43:00.1911652Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:43:00.1912163Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:43:00.1912703Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1913241Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1913782Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1914375Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1914909Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1915472Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:43:00.1916015Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:43:00.1916565Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1917112Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:43:00.1917691Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:43:00.1918243Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:43:00.1918744Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:43:00.1919266Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1919785Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:43:00.1920316Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1920973Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1921499Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1922022Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:43:00.1922522Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1923029Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:43:00.1923539Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1924100Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1924638Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:43:00.1925177Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:43:00.1925726Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1926236Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:43:00.1926760Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:43:00.1927243Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:00.1927756Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:00.1928300Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:43:00.1928777Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:43:00.1929268Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:00.1929773Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:00.1930341Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:43:00.1930806Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:43:00.1931120Z 2025-05-07T19:43:00.1931208Z Complete! 2025-05-07T19:43:00.2701509Z ##[group]Run actions/checkout@v4 2025-05-07T19:43:00.2701885Z with: 2025-05-07T19:43:00.2702102Z submodules: true 2025-05-07T19:43:00.2702366Z repository: pytorch/FBGEMM 2025-05-07T19:43:00.2702873Z token: *** 2025-05-07T19:43:00.2703096Z ssh-strict: true 2025-05-07T19:43:00.2703342Z ssh-user: git 2025-05-07T19:43:00.2703589Z persist-credentials: true 2025-05-07T19:43:00.2703884Z clean: true 2025-05-07T19:43:00.2704122Z sparse-checkout-cone-mode: true 2025-05-07T19:43:00.2704625Z fetch-depth: 1 2025-05-07T19:43:00.2704848Z fetch-tags: false 2025-05-07T19:43:00.2705101Z show-progress: true 2025-05-07T19:43:00.2705331Z lfs: false 2025-05-07T19:43:00.2705568Z set-safe-directory: true 2025-05-07T19:43:00.2705819Z env: 2025-05-07T19:43:00.2706057Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:00.2706406Z BUILD_ENV: build_binary 2025-05-07T19:43:00.2706659Z BUILD_TARGET: default 2025-05-07T19:43:00.2706969Z BUILD_VARIANT: cuda 2025-05-07T19:43:00.2707479Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:00.2707752Z ##[endgroup] 2025-05-07T19:43:00.2750301Z ##[command]/usr/bin/docker exec 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:43:00.6178937Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:43:00.6180518Z ##[group]Getting Git version info 2025-05-07T19:43:00.6180877Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:43:00.6181412Z [command]/usr/bin/git version 2025-05-07T19:43:00.6181714Z git version 2.47.1 2025-05-07T19:43:00.6186104Z ##[endgroup] 2025-05-07T19:43:00.6201324Z Temporarily overriding HOME='/__w/_temp/ede4869b-1878-4215-8e6a-fe19273313c6' before making global git config changes 2025-05-07T19:43:00.6202146Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:43:00.6213794Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:43:00.6243927Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:43:00.6261143Z https://github.com/pytorch/FBGEMM 2025-05-07T19:43:00.6283425Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:43:00.6286225Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:43:00.6302629Z HEAD 2025-05-07T19:43:00.6336150Z ##[endgroup] 2025-05-07T19:43:00.6336845Z [command]/usr/bin/git submodule status 2025-05-07T19:43:00.6701477Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:43:00.6771535Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (remotes/origin/FBGEMM) 2025-05-07T19:43:00.6876590Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:43:00.6935961Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (remotes/origin/FBGEMM) 2025-05-07T19:43:00.7165580Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (release-1.8.0-3335-gf8d7d77c) 2025-05-07T19:43:00.7245313Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (remotes/origin/mmelesse-9-g4200844) 2025-05-07T19:43:00.7280650Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (v3.11.2-84-g9cca280a) 2025-05-07T19:43:00.7292465Z ##[group]Cleaning the repository 2025-05-07T19:43:00.7294555Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:43:00.8088608Z Removing amdgpu-install_6.3.60300-1_all.deb 2025-05-07T19:43:00.8089687Z Removing collect_env.py 2025-05-07T19:43:00.8090407Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:43:00.8090848Z Removing fbgemm_gpu/bench/verify_fp16_stochastic_benchmark.hip 2025-05-07T19:43:00.8091356Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:43:00.8091968Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_cpu_template_hip.cpp 2025-05-07T19:43:00.8092702Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu_hip.cpp 2025-05-07T19:43:00.8093429Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_hip.cpp 2025-05-07T19:43:00.8094490Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.hip 2025-05-07T19:43:00.8095268Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_host_template.hip 2025-05-07T19:43:00.8096085Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_kernel_template.hip 2025-05-07T19:43:00.8097064Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu_hip.cpp 2025-05-07T19:43:00.8097941Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_approx_template_hip.cpp 2025-05-07T19:43:00.8098684Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_template_hip.cpp 2025-05-07T19:43:00.8100031Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_device_kernel_template_hip.cuh 2025-05-07T19:43:00.8100849Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_grad_template.hip 2025-05-07T19:43:00.8101630Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_cpu_template_hip.cpp 2025-05-07T19:43:00.8102441Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_template_hip.cpp 2025-05-07T19:43:00.8103251Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_indice_weights_template.hip 2025-05-07T19:43:00.8104082Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_cta_template.hip 2025-05-07T19:43:00.8104880Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_warp_template.hip 2025-05-07T19:43:00.8105685Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_meta_template_hip.cpp 2025-05-07T19:43:00.8106563Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_template.hip 2025-05-07T19:43:00.8107319Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu_hip.cpp 2025-05-07T19:43:00.8108034Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_nobag_small_template.hip 2025-05-07T19:43:00.8109147Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_template.hip 2025-05-07T19:43:00.8109974Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_v2_template.hip 2025-05-07T19:43:00.8110708Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_template.hip 2025-05-07T19:43:00.8111411Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host_hip.cpp 2025-05-07T19:43:00.8112177Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops_hip.cpp 2025-05-07T19:43:00.8112965Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_device_kernel_template_hip.cuh 2025-05-07T19:43:00.8113824Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_host_template_hip.cpp 2025-05-07T19:43:00.8114631Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_kernel_template.hip 2025-05-07T19:43:00.8115401Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_template.hip 2025-05-07T19:43:00.8116249Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_autograd_template_hip.cpp 2025-05-07T19:43:00.8117040Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_cpu_wrapper_template_hip.cpp 2025-05-07T19:43:00.8117761Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_hip_wrapper_template.cpp 2025-05-07T19:43:00.8147799Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu_hip.cpp 2025-05-07T19:43:00.8148529Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_hip.cpp 2025-05-07T19:43:00.8149087Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.hip 2025-05-07T19:43:00.8149629Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.hip 2025-05-07T19:43:00.8150049Z Removing fbgemm_gpu/dist/ 2025-05-07T19:43:00.8150469Z Removing fbgemm_gpu/experimental/example/src/cutlass_sgemm_nn.hip 2025-05-07T19:43:00.8151177Z Removing fbgemm_gpu/experimental/example/src/example_nccl_hip.cpp 2025-05-07T19:43:00.8151760Z Removing fbgemm_gpu/experimental/gen_ai/src/attention/gqa_attn_splitk.hip 2025-05-07T19:43:00.8152320Z Removing fbgemm_gpu/experimental/gen_ai/src/coalesce/coalesce.hip 2025-05-07T19:43:00.8152820Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car.hip 2025-05-07T19:43:00.8153294Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car_hip.cpp 2025-05-07T19:43:00.8153840Z Removing fbgemm_gpu/experimental/gen_ai/src/gather_scatter/gather_scatter.hip 2025-05-07T19:43:00.8154506Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache.hip 2025-05-07T19:43:00.8155033Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache_hip.cpp 2025-05-07T19:43:00.8155597Z Removing fbgemm_gpu/experimental/gen_ai/src/moe/index_shuffling.hip 2025-05-07T19:43:00.8156335Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/bf16_grouped/kernels/bf16_grouped_common_hip.h 2025-05-07T19:43:00.8157286Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise/kernels/fp8_rowwise_common_hip.h 2025-05-07T19:43:00.8158286Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_batched/kernels/fp8_rowwise_batched_common_hip.h 2025-05-07T19:43:00.8159340Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fp8_rowwise_grouped_common_hip.h 2025-05-07T19:43:00.8160280Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fused_moe/fused_moe_op_hip.cpp 2025-05-07T19:43:00.8160988Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cublas_utils_hip.h 2025-05-07T19:43:00.8161672Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16bf16bf16_grouped.hip 2025-05-07T19:43:00.8162549Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16.hip 2025-05-07T19:43:00.8163299Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_rowwise_batched.hip 2025-05-07T19:43:00.8164147Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_shuffled_grouped.hip 2025-05-07T19:43:00.8164909Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16.hip 2025-05-07T19:43:00.8165673Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_f.hip 2025-05-07T19:43:00.8166551Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_t.hip 2025-05-07T19:43:00.8167419Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_f.hip 2025-05-07T19:43:00.8168287Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_t.hip 2025-05-07T19:43:00.8169153Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_f.hip 2025-05-07T19:43:00.8170010Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_t.hip 2025-05-07T19:43:00.8170890Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_f.hip 2025-05-07T19:43:00.8171747Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_t.hip 2025-05-07T19:43:00.8172622Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_f.hip 2025-05-07T19:43:00.8173494Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_t.hip 2025-05-07T19:43:00.8174355Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_f.hip 2025-05-07T19:43:00.8175222Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_t.hip 2025-05-07T19:43:00.8176087Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_f.hip 2025-05-07T19:43:00.8177029Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_t.hip 2025-05-07T19:43:00.8177934Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_f.hip 2025-05-07T19:43:00.8178795Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_t.hip 2025-05-07T19:43:00.8179952Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_f.hip 2025-05-07T19:43:00.8180888Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_t.hip 2025-05-07T19:43:00.8181790Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_f.hip 2025-05-07T19:43:00.8182866Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_t.hip 2025-05-07T19:43:00.8183756Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_f.hip 2025-05-07T19:43:00.8184660Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_t.hip 2025-05-07T19:43:00.8185542Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_f.hip 2025-05-07T19:43:00.8186447Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_t.hip 2025-05-07T19:43:00.8187337Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_common_hip.cuh 2025-05-07T19:43:00.8188189Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_manifest_hip.cuh 2025-05-07T19:43:00.8188967Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16.hip 2025-05-07T19:43:00.8189690Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_blockwise.hip 2025-05-07T19:43:00.8190466Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_cublas.hip 2025-05-07T19:43:00.8191211Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_lite.hip 2025-05-07T19:43:00.8191944Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise.hip 2025-05-07T19:43:00.8192873Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_128_128_2_1_1_t_f.hip 2025-05-07T19:43:00.8194080Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_2_1_1_f_t.hip 2025-05-07T19:43:00.8195227Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_4_4_1_f_t.hip 2025-05-07T19:43:00.8196241Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_128_128_1_1_1_f_f.hip 2025-05-07T19:43:00.8197237Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_16_128_1_1_1_f_f.hip 2025-05-07T19:43:00.8198242Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_1_1_1_f_f.hip 2025-05-07T19:43:00.8199252Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_2_1_1_f_f.hip 2025-05-07T19:43:00.8200240Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_32_128_2_1_1_f_f.hip 2025-05-07T19:43:00.8201245Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_64_128_2_1_1_f_f.hip 2025-05-07T19:43:00.8202202Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_common_hip.cuh 2025-05-07T19:43:00.8203235Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/common_hip.cuh 2025-05-07T19:43:00.8204347Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_cluster_size_and_transpose.hip 2025-05-07T19:43:00.8205596Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_tile_size.hip 2025-05-07T19:43:00.8206694Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched.hip 2025-05-07T19:43:00.8207815Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched_impl.hip 2025-05-07T19:43:00.8208800Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/handle_transposition.hip 2025-05-07T19:43:00.8209677Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_grouped.hip 2025-05-07T19:43:00.8210424Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_tensorwise.hip 2025-05-07T19:43:00.8211153Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_rowwise.hip 2025-05-07T19:43:00.8211870Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled.hip 2025-05-07T19:43:00.8212615Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled_grouped.hip 2025-05-07T19:43:00.8213334Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16.hip 2025-05-07T19:43:00.8214001Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16_dynamic.hip 2025-05-07T19:43:00.8214809Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/include/fp8_blockwise_cutlass_helpers_hip.h 2025-05-07T19:43:00.8215622Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/mixed_dtype_utils.hip 2025-05-07T19:43:00.8216280Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16_fast_gemv.hip 2025-05-07T19:43:00.8216913Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16fp8bf16_fast_gemv.hip 2025-05-07T19:43:00.8217563Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/fp8fp8bf16_fast_gemv.hip 2025-05-07T19:43:00.8218211Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv.hip 2025-05-07T19:43:00.8218864Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv_hip.cuh 2025-05-07T19:43:00.8219763Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/utility_hip.cuh 2025-05-07T19:43:00.8220389Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize.hip 2025-05-07T19:43:00.8220925Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize_hip.cpp 2025-05-07T19:43:00.8221412Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:43:00.8221799Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:43:00.8222241Z Removing fbgemm_gpu/include/fbgemm_gpu/cumem_utils_hip.h 2025-05-07T19:43:00.8222813Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_backward_template_helpers_hip.cuh 2025-05-07T19:43:00.8223442Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_split_cpu_hip.h 2025-05-07T19:43:00.8224077Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_template_helpers_hip.cuh 2025-05-07T19:43:00.8224674Z Removing fbgemm_gpu/include/fbgemm_gpu/layout_transform_ops_hip.cuh 2025-05-07T19:43:00.8225266Z Removing fbgemm_gpu/include/fbgemm_gpu/permute_multi_embedding_function_hip.h 2025-05-07T19:43:00.8225818Z Removing fbgemm_gpu/include/fbgemm_gpu/quantize_ops_hip.cuh 2025-05-07T19:43:00.8226292Z Removing fbgemm_gpu/include/fbgemm_gpu/sparse_ops_hip.cuh 2025-05-07T19:43:00.8226800Z Removing fbgemm_gpu/include/fbgemm_gpu/split_embeddings_utils_hip.cuh 2025-05-07T19:43:00.8227535Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/barrier_isolation_hip.cuh 2025-05-07T19:43:00.8228179Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bench_utils_hip.cuh 2025-05-07T19:43:00.8228687Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bitonic_sort_hip.cuh 2025-05-07T19:43:00.8229243Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_postfix_hip.cuh 2025-05-07T19:43:00.8229834Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_prefix_hip.cuh 2025-05-07T19:43:00.8230436Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_cache_flusher_hip.cuh 2025-05-07T19:43:00.8231022Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_properties_hip.cuh 2025-05-07T19:43:00.8231631Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/dispatch_macros_hip.h 2025-05-07T19:43:00.8232312Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/embedding_bounds_check_common_hip.cuh 2025-05-07T19:43:00.8232859Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/find_qparams_hip.cuh 2025-05-07T19:43:00.8233328Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/float_hip.cuh 2025-05-07T19:43:00.8233775Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/hip_prelude.cuh 2025-05-07T19:43:00.8234283Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/host_device_buffer_pair_hip.cuh 2025-05-07T19:43:00.8234842Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/inclusive_sum_scan_hip.cuh 2025-05-07T19:43:00.8235353Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/kernel_launcher_hip.cuh 2025-05-07T19:43:00.8235896Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/stochastic_rounding_hip.h 2025-05-07T19:43:00.8236394Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/vec2_hip.h 2025-05-07T19:43:00.8236874Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/weight_row_hip.h 2025-05-07T19:43:00.8237372Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/shared_memory_hip.cuh 2025-05-07T19:43:00.8237879Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding_hip.cuh 2025-05-07T19:43:00.8238437Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_builder_hip.h 2025-05-07T19:43:00.8238953Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_hip.h 2025-05-07T19:43:00.8239423Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4_hip.cuh 2025-05-07T19:43:00.8239846Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4acc_hip.cuh 2025-05-07T19:43:00.8240303Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec_quant_hip.cuh 2025-05-07T19:43:00.8240744Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vecn_hip.cuh 2025-05-07T19:43:00.8241175Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/weight_row_hip.cuh 2025-05-07T19:43:00.8241696Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_hip.h 2025-05-07T19:43:00.8242284Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_wrapper_hip.h 2025-05-07T19:43:00.8242879Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.hip 2025-05-07T19:43:00.8243463Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu_hip.cpp 2025-05-07T19:43:00.8244013Z Removing fbgemm_gpu/src/histogram_binning_calibration_ops.hip 2025-05-07T19:43:00.8244465Z Removing fbgemm_gpu/src/input_combine_ops/input_combine.hip 2025-05-07T19:43:00.8244928Z Removing fbgemm_gpu/src/input_combine_ops/input_combine_cpu_hip.cpp 2025-05-07T19:43:00.8245521Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.hip 2025-05-07T19:43:00.8246221Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu_hip.cpp 2025-05-07T19:43:00.8246912Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.hip 2025-05-07T19:43:00.8247548Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.hip 2025-05-07T19:43:00.8248072Z Removing fbgemm_gpu/src/jagged_tensor_ops/common_hip.cuh 2025-05-07T19:43:00.8248540Z Removing fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.hip 2025-05-07T19:43:00.8249055Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.hip 2025-05-07T19:43:00.8249705Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.hip 2025-05-07T19:43:00.8250468Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.hip 2025-05-07T19:43:00.8251089Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.hip 2025-05-07T19:43:00.8251664Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.hip 2025-05-07T19:43:00.8252202Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.hip 2025-05-07T19:43:00.8252739Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.hip 2025-05-07T19:43:00.8253297Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.hip 2025-05-07T19:43:00.8253842Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.hip 2025-05-07T19:43:00.8254332Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.hip 2025-05-07T19:43:00.8254816Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu_hip.cpp 2025-05-07T19:43:00.8255376Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.hip 2025-05-07T19:43:00.8255961Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.hip 2025-05-07T19:43:00.8256485Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.hip 2025-05-07T19:43:00.8257042Z Removing fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.hip 2025-05-07T19:43:00.8257571Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.hip 2025-05-07T19:43:00.8258116Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu_hip.cpp 2025-05-07T19:43:00.8258593Z Removing fbgemm_gpu/src/memory_utils/common_hip.cuh 2025-05-07T19:43:00.8258990Z Removing fbgemm_gpu/src/memory_utils/memory_utils.hip 2025-05-07T19:43:00.8259485Z Removing fbgemm_gpu/src/memory_utils/memory_utils_hip.cpp 2025-05-07T19:43:00.8260098Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops.hip 2025-05-07T19:43:00.8260650Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops_hip.cpp 2025-05-07T19:43:00.8261241Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:43:00.8261964Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu_hip.cpp 2025-05-07T19:43:00.8262519Z Removing fbgemm_gpu/src/metric_ops/metric_ops.hip 2025-05-07T19:43:00.8263098Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function_hip.cpp 2025-05-07T19:43:00.8263815Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.hip 2025-05-07T19:43:00.8264506Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu_hip.cpp 2025-05-07T19:43:00.8265236Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.hip 2025-05-07T19:43:00.8265944Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:43:00.8266699Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.hip 2025-05-07T19:43:00.8267430Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_split_table_batched_embeddings_hip.cpp 2025-05-07T19:43:00.8268136Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_table_batched_embeddings_hip.h 2025-05-07T19:43:00.8268678Z Removing fbgemm_gpu/src/quantize_ops/common_hip.cuh 2025-05-07T19:43:00.8269088Z Removing fbgemm_gpu/src/quantize_ops/mx/common_hip.cuh 2025-05-07T19:43:00.8269516Z Removing fbgemm_gpu/src/quantize_ops/mx_common_hip.cuh 2025-05-07T19:43:00.8269952Z Removing fbgemm_gpu/src/quantize_ops/quantize_bfloat16.hip 2025-05-07T19:43:00.8270471Z Removing fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.hip 2025-05-07T19:43:00.8270980Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.hip 2025-05-07T19:43:00.8271530Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.hip 2025-05-07T19:43:00.8272117Z Removing fbgemm_gpu/src/quantize_ops/quantize_hfp8.hip 2025-05-07T19:43:00.8272505Z Removing fbgemm_gpu/src/quantize_ops/quantize_msfp.hip 2025-05-07T19:43:00.8272901Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx.hip 2025-05-07T19:43:00.8273374Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx_hip.cuh 2025-05-07T19:43:00.8273813Z Removing fbgemm_gpu/src/quantize_ops/quantize_ops_cpu_hip.cpp 2025-05-07T19:43:00.8274280Z Removing fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.hip 2025-05-07T19:43:00.8274725Z Removing fbgemm_gpu/src/sparse_ops/common_hip.cuh 2025-05-07T19:43:00.8275160Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.hip 2025-05-07T19:43:00.8275656Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum_hip.cpp 2025-05-07T19:43:00.8276197Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.hip 2025-05-07T19:43:00.8276629Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum_hip.cpp 2025-05-07T19:43:00.8277126Z Removing fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.hip 2025-05-07T19:43:00.8277639Z Removing fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.hip 2025-05-07T19:43:00.8278162Z Removing fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.hip 2025-05-07T19:43:00.8278770Z Removing fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.hip 2025-05-07T19:43:00.8279303Z Removing fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.hip 2025-05-07T19:43:00.8279796Z Removing fbgemm_gpu/src/sparse_ops/sparse_group_index.hip 2025-05-07T19:43:00.8280204Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_add.hip 2025-05-07T19:43:00.8280621Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_select.hip 2025-05-07T19:43:00.8281050Z Removing fbgemm_gpu/src/sparse_ops/sparse_invert_permute.hip 2025-05-07T19:43:00.8281499Z Removing fbgemm_gpu/src/sparse_ops/sparse_ops_cpu_hip.cpp 2025-05-07T19:43:00.8282115Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.hip 2025-05-07T19:43:00.8282945Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.hip 2025-05-07T19:43:00.8283461Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute102.hip 2025-05-07T19:43:00.8283907Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_1d.hip 2025-05-07T19:43:00.8284367Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_2d.hip 2025-05-07T19:43:00.8284840Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.hip 2025-05-07T19:43:00.8285306Z Removing fbgemm_gpu/src/sparse_ops/sparse_range.hip 2025-05-07T19:43:00.8285765Z Removing fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.hip 2025-05-07T19:43:00.8286250Z Removing fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.hip 2025-05-07T19:43:00.8286703Z Removing fbgemm_gpu/src/sparse_ops/sparse_zipf.hip 2025-05-07T19:43:00.8287175Z Removing fbgemm_gpu/src/split_embeddings_cache/cachelib_cache_hip.cpp 2025-05-07T19:43:00.8287703Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.cuh 2025-05-07T19:43:00.8288171Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.h 2025-05-07T19:43:00.8288668Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.hip 2025-05-07T19:43:00.8289230Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.hip 2025-05-07T19:43:00.8289794Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.hip 2025-05-07T19:43:00.8290431Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte_hip.cpp 2025-05-07T19:43:00.8291035Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.hip 2025-05-07T19:43:00.8291650Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices_hip.cpp 2025-05-07T19:43:00.8292231Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.hip 2025-05-07T19:43:00.8292757Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.hip 2025-05-07T19:43:00.8293338Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.hip 2025-05-07T19:43:00.8293929Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte_hip.cpp 2025-05-07T19:43:00.8294486Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache.hip 2025-05-07T19:43:00.8295188Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache_hip.cpp 2025-05-07T19:43:00.8295837Z Removing fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.hip 2025-05-07T19:43:00.8296404Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.hip 2025-05-07T19:43:00.8296984Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops_hip.cpp 2025-05-07T19:43:00.8297562Z Removing fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.hip 2025-05-07T19:43:00.8298076Z Removing fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.hip 2025-05-07T19:43:00.8298586Z Removing fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.hip 2025-05-07T19:43:00.8299255Z Removing fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_hip.cpp 2025-05-07T19:43:00.8300029Z Removing fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.hip 2025-05-07T19:43:00.8300743Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/embedding_rocksdb_wrapper_hip.h 2025-05-07T19:43:00.8301341Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.cpp 2025-05-07T19:43:00.8301904Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.h 2025-05-07T19:43:00.8302516Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.cpp 2025-05-07T19:43:00.8303226Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.h 2025-05-07T19:43:00.8303901Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_tensor_wrapper_cpu_hip.cpp 2025-05-07T19:43:00.8304558Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_scratch_pad_indices_queue_hip.cpp 2025-05-07T19:43:00.8305256Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_embeddings_cache_hip.hip 2025-05-07T19:43:00.8305956Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_table_batched_embeddings_hip.cpp 2025-05-07T19:43:00.8306668Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_table_batched_embeddings_hip.h 2025-05-07T19:43:00.8307194Z Removing fbgemm_gpu/src/topology_utils_hip.cpp 2025-05-07T19:43:00.8307625Z Removing fbgemm_gpu/test/tbe/utils/cpu_kernel_test_hip.cpp 2025-05-07T19:43:00.8308080Z Removing fbgemm_gpu/test/utils/kernel_launcher_test.hip 2025-05-07T19:43:00.8308526Z Removing fbgemm_gpu/test/utils/stochastic_rounding_test.hip 2025-05-07T19:43:00.8308990Z Removing fbgemm_gpu/test/utils/tensor_accessor2_test.hip 2025-05-07T19:43:00.8309475Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_test.hip 2025-05-07T19:43:00.8310048Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_with_memcheck_test.hip 2025-05-07T19:43:00.8310577Z Removing fbgemm_gpu/test/utils/tensor_accessor_test.hip 2025-05-07T19:43:00.8311082Z Removing fbgemm_gpu/test/utils/tensor_accessor_with_memcheck_test.hip 2025-05-07T19:43:00.8311572Z Removing fbgemm_gpu/test/utils/weight_row_test.hip 2025-05-07T19:43:00.8313691Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:43:00.9206940Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:00.9210762Z ##[endgroup] 2025-05-07T19:43:00.9212492Z ##[group]Disabling automatic garbage collection 2025-05-07T19:43:00.9217179Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:43:00.9245440Z ##[endgroup] 2025-05-07T19:43:00.9246522Z ##[group]Setting up auth 2025-05-07T19:43:00.9250192Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:43:00.9277160Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:43:00.9552360Z Entering 'external/asmjit' 2025-05-07T19:43:00.9605137Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.9662558Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.9730103Z Entering 'external/cutlass' 2025-05-07T19:43:00.9802809Z Entering 'external/googletest' 2025-05-07T19:43:00.9851090Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.9898879Z Entering 'external/json' 2025-05-07T19:43:00.9987024Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:43:01.0011149Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:43:01.0280038Z Entering 'external/asmjit' 2025-05-07T19:43:01.0347776Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.0404162Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.0455710Z Entering 'external/cutlass' 2025-05-07T19:43:01.0508848Z Entering 'external/googletest' 2025-05-07T19:43:01.0558829Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.0607680Z Entering 'external/json' 2025-05-07T19:43:01.0667966Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:01.0700401Z ##[endgroup] 2025-05-07T19:43:01.0700870Z ##[group]Fetching the repository 2025-05-07T19:43:01.0711283Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:43:01.2156713Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:43:01.2157883Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:43:01.2174933Z ##[endgroup] 2025-05-07T19:43:01.2175371Z ##[group]Determining the checkout info 2025-05-07T19:43:01.2175832Z ##[endgroup] 2025-05-07T19:43:01.2178374Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:43:01.2679970Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:43:01.2703615Z ##[group]Checking out the ref 2025-05-07T19:43:01.2704276Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:43:01.2775906Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:43:01.2777066Z any of your branches: 2025-05-07T19:43:01.2777532Z 2025-05-07T19:43:01.2778660Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:01.2779593Z 2025-05-07T19:43:01.2779993Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:43:01.2780561Z to do so with: 2025-05-07T19:43:01.2780694Z 2025-05-07T19:43:01.2780837Z git branch 1c9ad64 2025-05-07T19:43:01.2781045Z 2025-05-07T19:43:01.2781449Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:01.2782891Z ##[endgroup] 2025-05-07T19:43:01.2783371Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:43:01.2783990Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:01.2821859Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:43:01.2841584Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:43:01.2863157Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:43:01.2884557Z ##[endgroup] 2025-05-07T19:43:01.2884956Z ##[group]Fetching submodules 2025-05-07T19:43:01.2885532Z [command]/usr/bin/git submodule sync 2025-05-07T19:43:01.3182200Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:43:01.3182704Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:43:01.3183189Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:43:01.3183589Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:43:01.3184013Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:43:01.3184447Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:43:01.3184868Z Synchronizing submodule url for 'external/json' 2025-05-07T19:43:01.3188656Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:01.3961376Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:01.6642561Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:01.7651767Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:02.4308626Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:02.4738161Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:02.4822924Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:02.5959043Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:02.5969612Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:02.6311770Z Entering 'external/asmjit' 2025-05-07T19:43:02.6339882Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.6373613Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.6403474Z Entering 'external/cutlass' 2025-05-07T19:43:02.6432244Z Entering 'external/googletest' 2025-05-07T19:43:02.6462132Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.6491827Z Entering 'external/json' 2025-05-07T19:43:02.6535673Z ##[endgroup] 2025-05-07T19:43:02.6536093Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:02.6541485Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:02.6844854Z Entering 'external/asmjit' 2025-05-07T19:43:02.6894384Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.6957331Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.7021524Z Entering 'external/cutlass' 2025-05-07T19:43:02.7087918Z Entering 'external/googletest' 2025-05-07T19:43:02.7144528Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.7199273Z Entering 'external/json' 2025-05-07T19:43:02.7273787Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:02.7558847Z Entering 'external/asmjit' 2025-05-07T19:43:02.7604077Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:02.7604667Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.7654671Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:02.7655276Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.7708911Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:02.7711019Z Entering 'external/cutlass' 2025-05-07T19:43:02.7760938Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:02.7761489Z Entering 'external/googletest' 2025-05-07T19:43:02.7809101Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:02.7809679Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.7862323Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:02.7865162Z Entering 'external/json' 2025-05-07T19:43:02.7918278Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:02.8012105Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:02.8305538Z Entering 'external/asmjit' 2025-05-07T19:43:02.8340874Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.8369823Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.8393828Z Entering 'external/cutlass' 2025-05-07T19:43:02.8422077Z Entering 'external/googletest' 2025-05-07T19:43:02.8449401Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.8471278Z Entering 'external/json' 2025-05-07T19:43:02.8509891Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:02.8791553Z Entering 'external/asmjit' 2025-05-07T19:43:02.8823301Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.8851437Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.8877304Z Entering 'external/cutlass' 2025-05-07T19:43:02.8904646Z Entering 'external/googletest' 2025-05-07T19:43:02.8932502Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.8970560Z Entering 'external/json' 2025-05-07T19:43:02.9007017Z ##[endgroup] 2025-05-07T19:43:02.9034669Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:02.9056817Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:02.9211309Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:02.9211713Z . $PRELUDE; print_system_info 2025-05-07T19:43:02.9212227Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:02.9212581Z env: 2025-05-07T19:43:02.9212843Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:02.9213155Z BUILD_ENV: build_binary 2025-05-07T19:43:02.9213432Z BUILD_TARGET: default 2025-05-07T19:43:02.9213670Z BUILD_VARIANT: cuda 2025-05-07T19:43:02.9213942Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:02.9214196Z ##[endgroup] 2025-05-07T19:43:03.3686814Z ################################################################################ 2025-05-07T19:43:03.3687268Z # Print System Info 2025-05-07T19:43:03.3687556Z # 2025-05-07T19:43:03.3709483Z # [2025-05-07T19:43:03.370Z] + print_system_info 2025-05-07T19:43:03.3710537Z ################################################################################ 2025-05-07T19:43:03.3711251Z 2025-05-07T19:43:03.3711720Z ################################################################################ 2025-05-07T19:43:03.3712680Z [INFO] Printing environment variables ... 2025-05-07T19:43:03.3713038Z + printenv 2025-05-07T19:43:03.3713171Z 2025-05-07T19:43:03.3728088Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:03.3729162Z BUILD_VARIANT=cuda 2025-05-07T19:43:03.3729874Z HOSTNAME=8b1652485f5b 2025-05-07T19:43:03.3731133Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_76520ffd-fbca-4028-99eb-d9b31341b8f7 2025-05-07T19:43:03.3732599Z GITHUB_ACTION=__run_2 2025-05-07T19:43:03.3733320Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:03.3734046Z RUNNER_NAME=i-0fa4abaec4f96f332 2025-05-07T19:43:03.3734900Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:03.3735816Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:03.3736707Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:03.3736976Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:03.3737301Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:03.3737623Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:03.3738188Z *** 2025-05-07T19:43:03.3738448Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:03.3738735Z GITHUB_ACTIONS=true 2025-05-07T19:43:03.3739063Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:03.3739768Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:03.3740431Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:03.3740737Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:03.3741060Z RUNNER_OS=Linux 2025-05-07T19:43:03.3741307Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:03.3741607Z HOME=/github/home 2025-05-07T19:43:03.3741881Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:03.3742232Z RUNNER_ARCH=X64 2025-05-07T19:43:03.3742497Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:03.3742763Z BUILD_TARGET=default 2025-05-07T19:43:03.3743242Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_76520ffd-fbca-4028-99eb-d9b31341b8f7 2025-05-07T19:43:03.3743962Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_76520ffd-fbca-4028-99eb-d9b31341b8f7 2025-05-07T19:43:03.3744521Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:03.3744885Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:03.3745219Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:03.3745731Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_76520ffd-fbca-4028-99eb-d9b31341b8f7 2025-05-07T19:43:03.3746316Z BUILD_ENV=build_binary 2025-05-07T19:43:03.3746578Z GITHUB_ACTOR=q10 2025-05-07T19:43:03.3746864Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:03.3747123Z KERN_NAME_LC=linux 2025-05-07T19:43:03.3747411Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:03.3747782Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:03.3748170Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:03.3752519Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:03.3752837Z SHLVL=1 2025-05-07T19:43:03.3753100Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:03.3753375Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:03.3753937Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:03.3754354Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:03.3754654Z KERN_NAME=Linux 2025-05-07T19:43:03.3754909Z GITHUB_JOB=build_artifact 2025-05-07T19:43:03.3755235Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:03.3755538Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:03.3755843Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:03.3756165Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:03.3756540Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:03.3757086Z GITHUB_BASE_REF=main 2025-05-07T19:43:03.3757321Z CI=true 2025-05-07T19:43:03.3757572Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:03.3757871Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:03.3758196Z GITHUB_ACTION_REF= 2025-05-07T19:43:03.3758464Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:03.3759006Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_76520ffd-fbca-4028-99eb-d9b31341b8f7 2025-05-07T19:43:03.3759539Z MACHINE_NAME=x86_64 2025-05-07T19:43:03.3759786Z _=/usr/bin/printenv 2025-05-07T19:43:03.3759934Z 2025-05-07T19:43:03.3760087Z ################################################################################ 2025-05-07T19:43:03.3760434Z [INFO] Print ldd version ... 2025-05-07T19:43:03.3760742Z + ldd --version 2025-05-07T19:43:03.3760880Z 2025-05-07T19:43:03.3760999Z ldd (GNU libc) 2.34 2025-05-07T19:43:03.3761373Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:03.3762051Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:03.3762669Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:03.3763207Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:03.3763454Z 2025-05-07T19:43:03.3763586Z ################################################################################ 2025-05-07T19:43:03.3763959Z [INFO] Print CPU info ... 2025-05-07T19:43:03.3764218Z + nproc 2025-05-07T19:43:03.3764370Z 2025-05-07T19:43:03.3764472Z 96 2025-05-07T19:43:03.3764590Z 2025-05-07T19:43:03.3764708Z + lscpu 2025-05-07T19:43:03.3764828Z 2025-05-07T19:43:03.4033100Z Architecture: x86_64 2025-05-07T19:43:03.4034197Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:03.4034641Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4035106Z Byte Order: Little Endian 2025-05-07T19:43:03.4035487Z CPU(s): 96 2025-05-07T19:43:03.4035814Z On-line CPU(s) list: 0-95 2025-05-07T19:43:03.4036189Z Vendor ID: GenuineIntel 2025-05-07T19:43:03.4036713Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4037255Z CPU family: 6 2025-05-07T19:43:03.4037544Z Model: 85 2025-05-07T19:43:03.4037860Z Thread(s) per core: 2 2025-05-07T19:43:03.4038160Z Core(s) per socket: 24 2025-05-07T19:43:03.4038482Z Socket(s): 2 2025-05-07T19:43:03.4038793Z Stepping: 7 2025-05-07T19:43:03.4039091Z BogoMIPS: 5999.99 2025-05-07T19:43:03.4041417Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4043958Z Hypervisor vendor: KVM 2025-05-07T19:43:03.4044377Z Virtualization type: full 2025-05-07T19:43:03.4044758Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:03.4045161Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:03.4045532Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:03.4045973Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:03.4046465Z NUMA node(s): 2 2025-05-07T19:43:03.4046861Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:03.4047214Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:03.4047718Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:03.4048350Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:03.4048867Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:03.4049698Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:03.4050340Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:03.4051074Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:03.4051744Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:03.4052187Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:03.4052610Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:03.4053010Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:03.4053643Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:03.4054556Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:03.4055303Z Vulnerability Srbds: Not affected 2025-05-07T19:43:03.4055714Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:03.4055993Z 2025-05-07T19:43:03.4056083Z + cat /proc/cpuinfo 2025-05-07T19:43:03.4056223Z 2025-05-07T19:43:03.4056521Z processor : 0 2025-05-07T19:43:03.4056748Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4057006Z cpu family : 6 2025-05-07T19:43:03.4057218Z model : 85 2025-05-07T19:43:03.4057526Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4057893Z stepping : 7 2025-05-07T19:43:03.4058119Z microcode : 0x5003901 2025-05-07T19:43:03.4058368Z cpu MHz : 2999.998 2025-05-07T19:43:03.4058588Z cache size : 36608 KB 2025-05-07T19:43:03.4058831Z physical id : 0 2025-05-07T19:43:03.4059054Z siblings : 48 2025-05-07T19:43:03.4059273Z core id : 0 2025-05-07T19:43:03.4059576Z cpu cores : 24 2025-05-07T19:43:03.4059816Z apicid : 0 2025-05-07T19:43:03.4060017Z initial apicid : 0 2025-05-07T19:43:03.4060250Z fpu : yes 2025-05-07T19:43:03.4060485Z fpu_exception : yes 2025-05-07T19:43:03.4060837Z cpuid level : 13 2025-05-07T19:43:03.4061073Z wp : yes 2025-05-07T19:43:03.4063485Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4066311Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4067051Z bogomips : 5999.99 2025-05-07T19:43:03.4067297Z clflush size : 64 2025-05-07T19:43:03.4067571Z cache_alignment : 64 2025-05-07T19:43:03.4067873Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4068310Z power management: 2025-05-07T19:43:03.4068460Z 2025-05-07T19:43:03.4068544Z processor : 1 2025-05-07T19:43:03.4068787Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4069033Z cpu family : 6 2025-05-07T19:43:03.4069256Z model : 85 2025-05-07T19:43:03.4069541Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4069906Z stepping : 7 2025-05-07T19:43:03.4070140Z microcode : 0x5003901 2025-05-07T19:43:03.4070369Z cpu MHz : 2999.998 2025-05-07T19:43:03.4070601Z cache size : 36608 KB 2025-05-07T19:43:03.4070835Z physical id : 0 2025-05-07T19:43:03.4071056Z siblings : 48 2025-05-07T19:43:03.4071286Z core id : 1 2025-05-07T19:43:03.4071532Z cpu cores : 24 2025-05-07T19:43:03.4071764Z apicid : 2 2025-05-07T19:43:03.4072018Z initial apicid : 2 2025-05-07T19:43:03.4072253Z fpu : yes 2025-05-07T19:43:03.4072512Z fpu_exception : yes 2025-05-07T19:43:03.4072754Z cpuid level : 13 2025-05-07T19:43:03.4073023Z wp : yes 2025-05-07T19:43:03.4075434Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4078206Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4078863Z bogomips : 5999.99 2025-05-07T19:43:03.4079140Z clflush size : 64 2025-05-07T19:43:03.4079390Z cache_alignment : 64 2025-05-07T19:43:03.4079713Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4080074Z power management: 2025-05-07T19:43:03.4080263Z 2025-05-07T19:43:03.4080366Z processor : 2 2025-05-07T19:43:03.4080605Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4080903Z cpu family : 6 2025-05-07T19:43:03.4081126Z model : 85 2025-05-07T19:43:03.4081457Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4081870Z stepping : 7 2025-05-07T19:43:03.4082256Z microcode : 0x5003901 2025-05-07T19:43:03.4082548Z cpu MHz : 3236.624 2025-05-07T19:43:03.4082804Z cache size : 36608 KB 2025-05-07T19:43:03.4083199Z physical id : 0 2025-05-07T19:43:03.4083427Z siblings : 48 2025-05-07T19:43:03.4083670Z core id : 2 2025-05-07T19:43:03.4083888Z cpu cores : 24 2025-05-07T19:43:03.4084132Z apicid : 4 2025-05-07T19:43:03.4084352Z initial apicid : 4 2025-05-07T19:43:03.4084611Z fpu : yes 2025-05-07T19:43:03.4084848Z fpu_exception : yes 2025-05-07T19:43:03.4085116Z cpuid level : 13 2025-05-07T19:43:03.4085344Z wp : yes 2025-05-07T19:43:03.4087747Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4090528Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4091192Z bogomips : 5999.99 2025-05-07T19:43:03.4091437Z clflush size : 64 2025-05-07T19:43:03.4091815Z cache_alignment : 64 2025-05-07T19:43:03.4092110Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4092478Z power management: 2025-05-07T19:43:03.4092624Z 2025-05-07T19:43:03.4092722Z processor : 3 2025-05-07T19:43:03.4093063Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4093332Z cpu family : 6 2025-05-07T19:43:03.4093592Z model : 85 2025-05-07T19:43:03.4093898Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4094321Z stepping : 7 2025-05-07T19:43:03.4094586Z microcode : 0x5003901 2025-05-07T19:43:03.4094840Z cpu MHz : 3263.596 2025-05-07T19:43:03.4095110Z cache size : 36608 KB 2025-05-07T19:43:03.4095365Z physical id : 0 2025-05-07T19:43:03.4095633Z siblings : 48 2025-05-07T19:43:03.4095861Z core id : 3 2025-05-07T19:43:03.4096117Z cpu cores : 24 2025-05-07T19:43:03.4096352Z apicid : 6 2025-05-07T19:43:03.4096610Z initial apicid : 6 2025-05-07T19:43:03.4096854Z fpu : yes 2025-05-07T19:43:03.4097119Z fpu_exception : yes 2025-05-07T19:43:03.4097372Z cpuid level : 13 2025-05-07T19:43:03.4097637Z wp : yes 2025-05-07T19:43:03.4100112Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4102896Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4103526Z bogomips : 5999.99 2025-05-07T19:43:03.4103796Z clflush size : 64 2025-05-07T19:43:03.4104039Z cache_alignment : 64 2025-05-07T19:43:03.4104359Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4104714Z power management: 2025-05-07T19:43:03.4104885Z 2025-05-07T19:43:03.4104979Z processor : 4 2025-05-07T19:43:03.4105216Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4105616Z cpu family : 6 2025-05-07T19:43:03.4105834Z model : 85 2025-05-07T19:43:03.4106142Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4106533Z stepping : 7 2025-05-07T19:43:03.4106751Z microcode : 0x5003901 2025-05-07T19:43:03.4107014Z cpu MHz : 3240.289 2025-05-07T19:43:03.4107241Z cache size : 36608 KB 2025-05-07T19:43:03.4107504Z physical id : 0 2025-05-07T19:43:03.4107727Z siblings : 48 2025-05-07T19:43:03.4107969Z core id : 4 2025-05-07T19:43:03.4108182Z cpu cores : 24 2025-05-07T19:43:03.4108423Z apicid : 8 2025-05-07T19:43:03.4108636Z initial apicid : 8 2025-05-07T19:43:03.4108885Z fpu : yes 2025-05-07T19:43:03.4109099Z fpu_exception : yes 2025-05-07T19:43:03.4109367Z cpuid level : 13 2025-05-07T19:43:03.4109591Z wp : yes 2025-05-07T19:43:03.4111921Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4114626Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4115250Z bogomips : 5999.99 2025-05-07T19:43:03.4115485Z clflush size : 64 2025-05-07T19:43:03.4115744Z cache_alignment : 64 2025-05-07T19:43:03.4116033Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4116565Z power management: 2025-05-07T19:43:03.4116707Z 2025-05-07T19:43:03.4116799Z processor : 5 2025-05-07T19:43:03.4117060Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4117322Z cpu family : 6 2025-05-07T19:43:03.4117625Z model : 85 2025-05-07T19:43:03.4117943Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4118312Z stepping : 7 2025-05-07T19:43:03.4118529Z microcode : 0x5003901 2025-05-07T19:43:03.4118748Z cpu MHz : 2999.998 2025-05-07T19:43:03.4118972Z cache size : 36608 KB 2025-05-07T19:43:03.4119188Z physical id : 0 2025-05-07T19:43:03.4119406Z siblings : 48 2025-05-07T19:43:03.4119615Z core id : 5 2025-05-07T19:43:03.4119844Z cpu cores : 24 2025-05-07T19:43:03.4120058Z apicid : 10 2025-05-07T19:43:03.4120298Z initial apicid : 10 2025-05-07T19:43:03.4120527Z fpu : yes 2025-05-07T19:43:03.4120761Z fpu_exception : yes 2025-05-07T19:43:03.4120993Z cpuid level : 13 2025-05-07T19:43:03.4121238Z wp : yes 2025-05-07T19:43:03.4123569Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4126273Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4126877Z bogomips : 5999.99 2025-05-07T19:43:03.4127137Z clflush size : 64 2025-05-07T19:43:03.4127364Z cache_alignment : 64 2025-05-07T19:43:03.4127672Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4128012Z power management: 2025-05-07T19:43:03.4128178Z 2025-05-07T19:43:03.4128271Z processor : 6 2025-05-07T19:43:03.4128498Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4128772Z cpu family : 6 2025-05-07T19:43:03.4128988Z model : 85 2025-05-07T19:43:03.4129304Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4129692Z stepping : 7 2025-05-07T19:43:03.4129911Z microcode : 0x5003901 2025-05-07T19:43:03.4130172Z cpu MHz : 2999.998 2025-05-07T19:43:03.4130397Z cache size : 36608 KB 2025-05-07T19:43:03.4130657Z physical id : 0 2025-05-07T19:43:03.4130879Z siblings : 48 2025-05-07T19:43:03.4131122Z core id : 6 2025-05-07T19:43:03.4131331Z cpu cores : 24 2025-05-07T19:43:03.4131573Z apicid : 12 2025-05-07T19:43:03.4131790Z initial apicid : 12 2025-05-07T19:43:03.4132040Z fpu : yes 2025-05-07T19:43:03.4132256Z fpu_exception : yes 2025-05-07T19:43:03.4132519Z cpuid level : 13 2025-05-07T19:43:03.4132748Z wp : yes 2025-05-07T19:43:03.4135087Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4137792Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4138416Z bogomips : 5999.99 2025-05-07T19:43:03.4138651Z clflush size : 64 2025-05-07T19:43:03.4138906Z cache_alignment : 64 2025-05-07T19:43:03.4139191Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4139639Z power management: 2025-05-07T19:43:03.4139947Z 2025-05-07T19:43:03.4140117Z processor : 7 2025-05-07T19:43:03.4140387Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4140735Z cpu family : 6 2025-05-07T19:43:03.4140994Z model : 85 2025-05-07T19:43:03.4141317Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4141746Z stepping : 7 2025-05-07T19:43:03.4142001Z microcode : 0x5003901 2025-05-07T19:43:03.4142253Z cpu MHz : 3311.151 2025-05-07T19:43:03.4142521Z cache size : 36608 KB 2025-05-07T19:43:03.4142777Z physical id : 0 2025-05-07T19:43:03.4143045Z siblings : 48 2025-05-07T19:43:03.4143270Z core id : 7 2025-05-07T19:43:03.4143527Z cpu cores : 24 2025-05-07T19:43:03.4143756Z apicid : 14 2025-05-07T19:43:03.4144021Z initial apicid : 14 2025-05-07T19:43:03.4144267Z fpu : yes 2025-05-07T19:43:03.4165415Z fpu_exception : yes 2025-05-07T19:43:03.4165673Z cpuid level : 13 2025-05-07T19:43:03.4165937Z wp : yes 2025-05-07T19:43:03.4168244Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4170827Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4171433Z bogomips : 5999.99 2025-05-07T19:43:03.4171664Z clflush size : 64 2025-05-07T19:43:03.4171927Z cache_alignment : 64 2025-05-07T19:43:03.4172236Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4172572Z power management: 2025-05-07T19:43:03.4172711Z 2025-05-07T19:43:03.4172832Z processor : 8 2025-05-07T19:43:03.4173061Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4173335Z cpu family : 6 2025-05-07T19:43:03.4173550Z model : 85 2025-05-07T19:43:03.4173863Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4174220Z stepping : 7 2025-05-07T19:43:03.4174466Z microcode : 0x5003901 2025-05-07T19:43:03.4174703Z cpu MHz : 3185.174 2025-05-07T19:43:03.4174953Z cache size : 36608 KB 2025-05-07T19:43:03.4175207Z physical id : 0 2025-05-07T19:43:03.4175423Z siblings : 48 2025-05-07T19:43:03.4175655Z core id : 8 2025-05-07T19:43:03.4175863Z cpu cores : 24 2025-05-07T19:43:03.4176103Z apicid : 16 2025-05-07T19:43:03.4176316Z initial apicid : 16 2025-05-07T19:43:03.4176559Z fpu : yes 2025-05-07T19:43:03.4176765Z fpu_exception : yes 2025-05-07T19:43:03.4177015Z cpuid level : 13 2025-05-07T19:43:03.4177226Z wp : yes 2025-05-07T19:43:03.4179863Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4183037Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4183661Z bogomips : 5999.99 2025-05-07T19:43:03.4183930Z clflush size : 64 2025-05-07T19:43:03.4184200Z cache_alignment : 64 2025-05-07T19:43:03.4184495Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4184877Z power management: 2025-05-07T19:43:03.4185022Z 2025-05-07T19:43:03.4185120Z processor : 9 2025-05-07T19:43:03.4185387Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4185829Z cpu family : 6 2025-05-07T19:43:03.4186086Z model : 85 2025-05-07T19:43:03.4186388Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4186799Z stepping : 7 2025-05-07T19:43:03.4187030Z microcode : 0x5003901 2025-05-07T19:43:03.4187394Z cpu MHz : 2999.998 2025-05-07T19:43:03.4187667Z cache size : 36608 KB 2025-05-07T19:43:03.4187919Z physical id : 0 2025-05-07T19:43:03.4188182Z siblings : 48 2025-05-07T19:43:03.4188409Z core id : 9 2025-05-07T19:43:03.4188663Z cpu cores : 24 2025-05-07T19:43:03.4188896Z apicid : 18 2025-05-07T19:43:03.4189155Z initial apicid : 18 2025-05-07T19:43:03.4189390Z fpu : yes 2025-05-07T19:43:03.4189651Z fpu_exception : yes 2025-05-07T19:43:03.4189900Z cpuid level : 13 2025-05-07T19:43:03.4190156Z wp : yes 2025-05-07T19:43:03.4192564Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4195416Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4196044Z bogomips : 5999.99 2025-05-07T19:43:03.4196302Z clflush size : 64 2025-05-07T19:43:03.4196539Z cache_alignment : 64 2025-05-07T19:43:03.4196852Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4197194Z power management: 2025-05-07T19:43:03.4197335Z 2025-05-07T19:43:03.4197447Z processor : 10 2025-05-07T19:43:03.4197676Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4197945Z cpu family : 6 2025-05-07T19:43:03.4198163Z model : 85 2025-05-07T19:43:03.4198484Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4198858Z stepping : 7 2025-05-07T19:43:03.4199109Z microcode : 0x5003901 2025-05-07T19:43:03.4199334Z cpu MHz : 2999.998 2025-05-07T19:43:03.4199599Z cache size : 36608 KB 2025-05-07T19:43:03.4199868Z physical id : 0 2025-05-07T19:43:03.4200100Z siblings : 48 2025-05-07T19:43:03.4200345Z core id : 10 2025-05-07T19:43:03.4200560Z cpu cores : 24 2025-05-07T19:43:03.4200801Z apicid : 20 2025-05-07T19:43:03.4201016Z initial apicid : 20 2025-05-07T19:43:03.4201270Z fpu : yes 2025-05-07T19:43:03.4201477Z fpu_exception : yes 2025-05-07T19:43:03.4201733Z cpuid level : 13 2025-05-07T19:43:03.4201950Z wp : yes 2025-05-07T19:43:03.4204361Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4207196Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4207801Z bogomips : 5999.99 2025-05-07T19:43:03.4208061Z clflush size : 64 2025-05-07T19:43:03.4208323Z cache_alignment : 64 2025-05-07T19:43:03.4208618Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4208990Z power management: 2025-05-07T19:43:03.4209126Z 2025-05-07T19:43:03.4209223Z processor : 11 2025-05-07T19:43:03.4209482Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4209737Z cpu family : 6 2025-05-07T19:43:03.4209985Z model : 85 2025-05-07T19:43:03.4210275Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4210742Z stepping : 7 2025-05-07T19:43:03.4210971Z microcode : 0x5003901 2025-05-07T19:43:03.4211242Z cpu MHz : 3268.793 2025-05-07T19:43:03.4211503Z cache size : 36608 KB 2025-05-07T19:43:03.4211806Z physical id : 0 2025-05-07T19:43:03.4212057Z siblings : 48 2025-05-07T19:43:03.4212278Z core id : 11 2025-05-07T19:43:03.4212570Z cpu cores : 24 2025-05-07T19:43:03.4212788Z apicid : 22 2025-05-07T19:43:03.4213045Z initial apicid : 22 2025-05-07T19:43:03.4213281Z fpu : yes 2025-05-07T19:43:03.4213670Z fpu_exception : yes 2025-05-07T19:43:03.4213904Z cpuid level : 13 2025-05-07T19:43:03.4214149Z wp : yes 2025-05-07T19:43:03.4216486Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4219169Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4220033Z bogomips : 5999.99 2025-05-07T19:43:03.4220296Z clflush size : 64 2025-05-07T19:43:03.4220612Z cache_alignment : 64 2025-05-07T19:43:03.4220937Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4221278Z power management: 2025-05-07T19:43:03.4221424Z 2025-05-07T19:43:03.4221545Z processor : 12 2025-05-07T19:43:03.4221780Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4222069Z cpu family : 6 2025-05-07T19:43:03.4222289Z model : 85 2025-05-07T19:43:03.4222617Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4222996Z stepping : 7 2025-05-07T19:43:03.4223247Z microcode : 0x5003901 2025-05-07T19:43:03.4223496Z cpu MHz : 2999.998 2025-05-07T19:43:03.4223754Z cache size : 36608 KB 2025-05-07T19:43:03.4224024Z physical id : 0 2025-05-07T19:43:03.4224247Z siblings : 48 2025-05-07T19:43:03.4224479Z core id : 12 2025-05-07T19:43:03.4224696Z cpu cores : 24 2025-05-07T19:43:03.4224937Z apicid : 24 2025-05-07T19:43:03.4225155Z initial apicid : 24 2025-05-07T19:43:03.4225412Z fpu : yes 2025-05-07T19:43:03.4225630Z fpu_exception : yes 2025-05-07T19:43:03.4225891Z cpuid level : 13 2025-05-07T19:43:03.4226118Z wp : yes 2025-05-07T19:43:03.4228509Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4231288Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4231906Z bogomips : 5999.99 2025-05-07T19:43:03.4232332Z clflush size : 64 2025-05-07T19:43:03.4232609Z cache_alignment : 64 2025-05-07T19:43:03.4232895Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4233240Z power management: 2025-05-07T19:43:03.4233387Z 2025-05-07T19:43:03.4233478Z processor : 13 2025-05-07T19:43:03.4233752Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4234002Z cpu family : 6 2025-05-07T19:43:03.4234240Z model : 85 2025-05-07T19:43:03.4234531Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4234927Z stepping : 7 2025-05-07T19:43:03.4235225Z microcode : 0x5003901 2025-05-07T19:43:03.4235501Z cpu MHz : 2999.998 2025-05-07T19:43:03.4235760Z cache size : 36608 KB 2025-05-07T19:43:03.4236003Z physical id : 0 2025-05-07T19:43:03.4236244Z siblings : 48 2025-05-07T19:43:03.4236509Z core id : 13 2025-05-07T19:43:03.4236713Z cpu cores : 24 2025-05-07T19:43:03.4236939Z apicid : 26 2025-05-07T19:43:03.4237156Z initial apicid : 26 2025-05-07T19:43:03.4237415Z fpu : yes 2025-05-07T19:43:03.4237655Z fpu_exception : yes 2025-05-07T19:43:03.4237882Z cpuid level : 13 2025-05-07T19:43:03.4238127Z wp : yes 2025-05-07T19:43:03.4240433Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4243159Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4243785Z bogomips : 5999.99 2025-05-07T19:43:03.4244019Z clflush size : 64 2025-05-07T19:43:03.4244276Z cache_alignment : 64 2025-05-07T19:43:03.4244562Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4244924Z power management: 2025-05-07T19:43:03.4245065Z 2025-05-07T19:43:03.4245156Z processor : 14 2025-05-07T19:43:03.4245508Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4245774Z cpu family : 6 2025-05-07T19:43:03.4245980Z model : 85 2025-05-07T19:43:03.4246442Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4246860Z stepping : 7 2025-05-07T19:43:03.4247102Z microcode : 0x5003901 2025-05-07T19:43:03.4247341Z cpu MHz : 2999.998 2025-05-07T19:43:03.4247595Z cache size : 36608 KB 2025-05-07T19:43:03.4247834Z physical id : 0 2025-05-07T19:43:03.4248079Z siblings : 48 2025-05-07T19:43:03.4248291Z core id : 14 2025-05-07T19:43:03.4248525Z cpu cores : 24 2025-05-07T19:43:03.4248740Z apicid : 28 2025-05-07T19:43:03.4248979Z initial apicid : 28 2025-05-07T19:43:03.4249229Z fpu : yes 2025-05-07T19:43:03.4249442Z fpu_exception : yes 2025-05-07T19:43:03.4249695Z cpuid level : 13 2025-05-07T19:43:03.4249906Z wp : yes 2025-05-07T19:43:03.4252420Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4255200Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4255818Z bogomips : 5999.99 2025-05-07T19:43:03.4256078Z clflush size : 64 2025-05-07T19:43:03.4256312Z cache_alignment : 64 2025-05-07T19:43:03.4256628Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4256975Z power management: 2025-05-07T19:43:03.4257222Z 2025-05-07T19:43:03.4257316Z processor : 15 2025-05-07T19:43:03.4257577Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4257835Z cpu family : 6 2025-05-07T19:43:03.4258079Z model : 85 2025-05-07T19:43:03.4258373Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4258769Z stepping : 7 2025-05-07T19:43:03.4258996Z microcode : 0x5003901 2025-05-07T19:43:03.4259273Z cpu MHz : 2999.998 2025-05-07T19:43:03.4259652Z cache size : 36608 KB 2025-05-07T19:43:03.4259931Z physical id : 0 2025-05-07T19:43:03.4260164Z siblings : 48 2025-05-07T19:43:03.4260413Z core id : 15 2025-05-07T19:43:03.4260681Z cpu cores : 24 2025-05-07T19:43:03.4260930Z apicid : 30 2025-05-07T19:43:03.4261244Z initial apicid : 30 2025-05-07T19:43:03.4261480Z fpu : yes 2025-05-07T19:43:03.4261732Z fpu_exception : yes 2025-05-07T19:43:03.4261973Z cpuid level : 13 2025-05-07T19:43:03.4262230Z wp : yes 2025-05-07T19:43:03.4264608Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4267394Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4268044Z bogomips : 5999.99 2025-05-07T19:43:03.4268283Z clflush size : 64 2025-05-07T19:43:03.4268539Z cache_alignment : 64 2025-05-07T19:43:03.4268833Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4269206Z power management: 2025-05-07T19:43:03.4269354Z 2025-05-07T19:43:03.4269447Z processor : 16 2025-05-07T19:43:03.4269715Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4269999Z cpu family : 6 2025-05-07T19:43:03.4270222Z model : 85 2025-05-07T19:43:03.4270544Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4270915Z stepping : 7 2025-05-07T19:43:03.4271165Z microcode : 0x5003901 2025-05-07T19:43:03.4271408Z cpu MHz : 2999.998 2025-05-07T19:43:03.4271667Z cache size : 36608 KB 2025-05-07T19:43:03.4271921Z physical id : 0 2025-05-07T19:43:03.4272170Z siblings : 48 2025-05-07T19:43:03.4272385Z core id : 16 2025-05-07T19:43:03.4272627Z cpu cores : 24 2025-05-07T19:43:03.4272846Z apicid : 32 2025-05-07T19:43:03.4273088Z initial apicid : 32 2025-05-07T19:43:03.4273350Z fpu : yes 2025-05-07T19:43:03.4273566Z fpu_exception : yes 2025-05-07T19:43:03.4273822Z cpuid level : 13 2025-05-07T19:43:03.4274045Z wp : yes 2025-05-07T19:43:03.4276444Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4279213Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4280075Z bogomips : 5999.99 2025-05-07T19:43:03.4280354Z clflush size : 64 2025-05-07T19:43:03.4280612Z cache_alignment : 64 2025-05-07T19:43:03.4280925Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4281273Z power management: 2025-05-07T19:43:03.4281444Z 2025-05-07T19:43:03.4281539Z processor : 17 2025-05-07T19:43:03.4281826Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4282512Z cpu family : 6 2025-05-07T19:43:03.4282899Z model : 85 2025-05-07T19:43:03.4283252Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4283658Z stepping : 7 2025-05-07T19:43:03.4283888Z microcode : 0x5003901 2025-05-07T19:43:03.4284157Z cpu MHz : 2999.998 2025-05-07T19:43:03.4284395Z cache size : 36608 KB 2025-05-07T19:43:03.4284660Z physical id : 0 2025-05-07T19:43:03.4285004Z siblings : 48 2025-05-07T19:43:03.4285251Z core id : 17 2025-05-07T19:43:03.4285475Z cpu cores : 24 2025-05-07T19:43:03.4285732Z apicid : 34 2025-05-07T19:43:03.4285993Z initial apicid : 34 2025-05-07T19:43:03.4286234Z fpu : yes 2025-05-07T19:43:03.4286558Z fpu_exception : yes 2025-05-07T19:43:03.4286801Z cpuid level : 13 2025-05-07T19:43:03.4287055Z wp : yes 2025-05-07T19:43:03.4289425Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4292220Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4292869Z bogomips : 5999.99 2025-05-07T19:43:03.4293104Z clflush size : 64 2025-05-07T19:43:03.4293370Z cache_alignment : 64 2025-05-07T19:43:03.4293659Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4294028Z power management: 2025-05-07T19:43:03.4294171Z 2025-05-07T19:43:03.4294291Z processor : 18 2025-05-07T19:43:03.4294523Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4294803Z cpu family : 6 2025-05-07T19:43:03.4295025Z model : 85 2025-05-07T19:43:03.4295455Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4295819Z stepping : 7 2025-05-07T19:43:03.4296069Z microcode : 0x5003901 2025-05-07T19:43:03.4296305Z cpu MHz : 2999.998 2025-05-07T19:43:03.4296557Z cache size : 36608 KB 2025-05-07T19:43:03.4296789Z physical id : 0 2025-05-07T19:43:03.4297033Z siblings : 48 2025-05-07T19:43:03.4297243Z core id : 18 2025-05-07T19:43:03.4297477Z cpu cores : 24 2025-05-07T19:43:03.4297691Z apicid : 36 2025-05-07T19:43:03.4297924Z initial apicid : 36 2025-05-07T19:43:03.4298171Z fpu : yes 2025-05-07T19:43:03.4298382Z fpu_exception : yes 2025-05-07T19:43:03.4298634Z cpuid level : 13 2025-05-07T19:43:03.4298853Z wp : yes 2025-05-07T19:43:03.4301463Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4304240Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4304866Z bogomips : 5999.99 2025-05-07T19:43:03.4305129Z clflush size : 64 2025-05-07T19:43:03.4305366Z cache_alignment : 64 2025-05-07T19:43:03.4305688Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4306041Z power management: 2025-05-07T19:43:03.4306213Z 2025-05-07T19:43:03.4306309Z processor : 19 2025-05-07T19:43:03.4306577Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4306839Z cpu family : 6 2025-05-07T19:43:03.4307087Z model : 85 2025-05-07T19:43:03.4307384Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4307778Z stepping : 7 2025-05-07T19:43:03.4308000Z microcode : 0x5003901 2025-05-07T19:43:03.4308270Z cpu MHz : 2999.998 2025-05-07T19:43:03.4308503Z cache size : 36608 KB 2025-05-07T19:43:03.4308770Z physical id : 0 2025-05-07T19:43:03.4308992Z siblings : 48 2025-05-07T19:43:03.4309223Z core id : 19 2025-05-07T19:43:03.4309437Z cpu cores : 24 2025-05-07T19:43:03.4309735Z apicid : 38 2025-05-07T19:43:03.4309968Z initial apicid : 38 2025-05-07T19:43:03.4310192Z fpu : yes 2025-05-07T19:43:03.4310435Z fpu_exception : yes 2025-05-07T19:43:03.4310659Z cpuid level : 13 2025-05-07T19:43:03.4310873Z wp : yes 2025-05-07T19:43:03.4313432Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4315995Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4316777Z bogomips : 5999.99 2025-05-07T19:43:03.4317091Z clflush size : 64 2025-05-07T19:43:03.4317306Z cache_alignment : 64 2025-05-07T19:43:03.4317606Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4317969Z power management: 2025-05-07T19:43:03.4318125Z 2025-05-07T19:43:03.4318210Z processor : 20 2025-05-07T19:43:03.4318426Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4318741Z cpu family : 6 2025-05-07T19:43:03.4318954Z model : 85 2025-05-07T19:43:03.4319295Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4319721Z stepping : 7 2025-05-07T19:43:03.4320018Z microcode : 0x5003901 2025-05-07T19:43:03.4320226Z cpu MHz : 2999.998 2025-05-07T19:43:03.4320435Z cache size : 36608 KB 2025-05-07T19:43:03.4320642Z physical id : 0 2025-05-07T19:43:03.4320858Z siblings : 48 2025-05-07T19:43:03.4321054Z core id : 20 2025-05-07T19:43:03.4321244Z cpu cores : 24 2025-05-07T19:43:03.4321526Z apicid : 40 2025-05-07T19:43:03.4321750Z initial apicid : 40 2025-05-07T19:43:03.4321998Z fpu : yes 2025-05-07T19:43:03.4322321Z fpu_exception : yes 2025-05-07T19:43:03.4322528Z cpuid level : 13 2025-05-07T19:43:03.4322733Z wp : yes 2025-05-07T19:43:03.4324945Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4327481Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4328048Z bogomips : 5999.99 2025-05-07T19:43:03.4328270Z clflush size : 64 2025-05-07T19:43:03.4328468Z cache_alignment : 64 2025-05-07T19:43:03.4328737Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4329063Z power management: 2025-05-07T19:43:03.4329191Z 2025-05-07T19:43:03.4329270Z processor : 21 2025-05-07T19:43:03.4329488Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4329703Z cpu family : 6 2025-05-07T19:43:03.4329907Z model : 85 2025-05-07T19:43:03.4330160Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4330499Z stepping : 7 2025-05-07T19:43:03.4330696Z microcode : 0x5003901 2025-05-07T19:43:03.4330914Z cpu MHz : 3278.413 2025-05-07T19:43:03.4331119Z cache size : 36608 KB 2025-05-07T19:43:03.4331335Z physical id : 0 2025-05-07T19:43:03.4331531Z siblings : 48 2025-05-07T19:43:03.4331722Z core id : 21 2025-05-07T19:43:03.4331915Z cpu cores : 24 2025-05-07T19:43:03.4332094Z apicid : 42 2025-05-07T19:43:03.4332290Z initial apicid : 42 2025-05-07T19:43:03.4332538Z fpu : yes 2025-05-07T19:43:03.4332728Z fpu_exception : yes 2025-05-07T19:43:03.4332922Z cpuid level : 13 2025-05-07T19:43:03.4333138Z wp : yes 2025-05-07T19:43:03.4336156Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4338699Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4339273Z bogomips : 5999.99 2025-05-07T19:43:03.4339545Z clflush size : 64 2025-05-07T19:43:03.4339931Z cache_alignment : 64 2025-05-07T19:43:03.4340768Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:03.4341139Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4341488Z power management: 2025-05-07T19:43:03.4341622Z 2025-05-07T19:43:03.4341711Z processor : 22 2025-05-07T19:43:03.4341943Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4342211Z cpu family : 6 2025-05-07T19:43:03.4342420Z model : 85 2025-05-07T19:43:03.4342687Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4343061Z stepping : 7 2025-05-07T19:43:03.4343259Z microcode : 0x5003901 2025-05-07T19:43:03.4343491Z cpu MHz : 2999.998 2025-05-07T19:43:03.4343719Z cache size : 36608 KB 2025-05-07T19:43:03.4343934Z physical id : 0 2025-05-07T19:43:03.4344144Z siblings : 48 2025-05-07T19:43:03.4344349Z core id : 22 2025-05-07T19:43:03.4344557Z cpu cores : 24 2025-05-07T19:43:03.4344758Z apicid : 44 2025-05-07T19:43:03.4344969Z initial apicid : 44 2025-05-07T19:43:03.4345184Z fpu : yes 2025-05-07T19:43:03.4345386Z fpu_exception : yes 2025-05-07T19:43:03.4345589Z cpuid level : 13 2025-05-07T19:43:03.4345796Z wp : yes 2025-05-07T19:43:03.4348160Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4350889Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4351496Z bogomips : 5999.99 2025-05-07T19:43:03.4351714Z clflush size : 64 2025-05-07T19:43:03.4351930Z cache_alignment : 64 2025-05-07T19:43:03.4352296Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4352588Z power management: 2025-05-07T19:43:03.4352709Z 2025-05-07T19:43:03.4352794Z processor : 23 2025-05-07T19:43:03.4352988Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4353217Z cpu family : 6 2025-05-07T19:43:03.4353400Z model : 85 2025-05-07T19:43:03.4353673Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4353998Z stepping : 7 2025-05-07T19:43:03.4354209Z microcode : 0x5003901 2025-05-07T19:43:03.4354415Z cpu MHz : 2999.998 2025-05-07T19:43:03.4354625Z cache size : 36608 KB 2025-05-07T19:43:03.4354843Z physical id : 0 2025-05-07T19:43:03.4355034Z siblings : 48 2025-05-07T19:43:03.4355232Z core id : 23 2025-05-07T19:43:03.4355420Z cpu cores : 24 2025-05-07T19:43:03.4355618Z apicid : 46 2025-05-07T19:43:03.4355806Z initial apicid : 46 2025-05-07T19:43:03.4356005Z fpu : yes 2025-05-07T19:43:03.4356255Z fpu_exception : yes 2025-05-07T19:43:03.4356468Z cpuid level : 13 2025-05-07T19:43:03.4356661Z wp : yes 2025-05-07T19:43:03.4358898Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4361439Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4361988Z bogomips : 5999.99 2025-05-07T19:43:03.4362206Z clflush size : 64 2025-05-07T19:43:03.4362425Z cache_alignment : 64 2025-05-07T19:43:03.4362672Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4362996Z power management: 2025-05-07T19:43:03.4363120Z 2025-05-07T19:43:03.4363200Z processor : 24 2025-05-07T19:43:03.4363416Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4363628Z cpu family : 6 2025-05-07T19:43:03.4363828Z model : 85 2025-05-07T19:43:03.4364076Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4364420Z stepping : 7 2025-05-07T19:43:03.4364599Z microcode : 0x5003901 2025-05-07T19:43:03.4364826Z cpu MHz : 2999.998 2025-05-07T19:43:03.4365036Z cache size : 36608 KB 2025-05-07T19:43:03.4365240Z physical id : 1 2025-05-07T19:43:03.4365440Z siblings : 48 2025-05-07T19:43:03.4365629Z core id : 0 2025-05-07T19:43:03.4365818Z cpu cores : 24 2025-05-07T19:43:03.4366008Z apicid : 64 2025-05-07T19:43:03.4366206Z initial apicid : 64 2025-05-07T19:43:03.4366394Z fpu : yes 2025-05-07T19:43:03.4366581Z fpu_exception : yes 2025-05-07T19:43:03.4366774Z cpuid level : 13 2025-05-07T19:43:03.4366960Z wp : yes 2025-05-07T19:43:03.4369129Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4371638Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4372197Z bogomips : 5999.99 2025-05-07T19:43:03.4372412Z clflush size : 64 2025-05-07T19:43:03.4372606Z cache_alignment : 64 2025-05-07T19:43:03.4372870Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4373171Z power management: 2025-05-07T19:43:03.4373290Z 2025-05-07T19:43:03.4373380Z processor : 25 2025-05-07T19:43:03.4373574Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4373799Z cpu family : 6 2025-05-07T19:43:03.4373976Z model : 85 2025-05-07T19:43:03.4374233Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4374556Z stepping : 7 2025-05-07T19:43:03.4374769Z microcode : 0x5003901 2025-05-07T19:43:03.4374999Z cpu MHz : 2999.998 2025-05-07T19:43:03.4375241Z cache size : 36608 KB 2025-05-07T19:43:03.4375500Z physical id : 1 2025-05-07T19:43:03.4375712Z siblings : 48 2025-05-07T19:43:03.4375940Z core id : 1 2025-05-07T19:43:03.4376144Z cpu cores : 24 2025-05-07T19:43:03.4376370Z apicid : 66 2025-05-07T19:43:03.4376574Z initial apicid : 66 2025-05-07T19:43:03.4376813Z fpu : yes 2025-05-07T19:43:03.4377013Z fpu_exception : yes 2025-05-07T19:43:03.4377246Z cpuid level : 13 2025-05-07T19:43:03.4377497Z wp : yes 2025-05-07T19:43:03.4380007Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4382939Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4383553Z bogomips : 5999.99 2025-05-07T19:43:03.4383811Z clflush size : 64 2025-05-07T19:43:03.4384080Z cache_alignment : 64 2025-05-07T19:43:03.4384371Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4384743Z power management: 2025-05-07T19:43:03.4384888Z 2025-05-07T19:43:03.4384982Z processor : 26 2025-05-07T19:43:03.4385238Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4385497Z cpu family : 6 2025-05-07T19:43:03.4385744Z model : 85 2025-05-07T19:43:03.4386038Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4386442Z stepping : 7 2025-05-07T19:43:03.4386669Z microcode : 0x5003901 2025-05-07T19:43:03.4386942Z cpu MHz : 1213.257 2025-05-07T19:43:03.4387201Z cache size : 36608 KB 2025-05-07T19:43:03.4387445Z physical id : 1 2025-05-07T19:43:03.4387688Z siblings : 48 2025-05-07T19:43:03.4387906Z core id : 2 2025-05-07T19:43:03.4388143Z cpu cores : 24 2025-05-07T19:43:03.4388365Z apicid : 68 2025-05-07T19:43:03.4388602Z initial apicid : 68 2025-05-07T19:43:03.4388834Z fpu : yes 2025-05-07T19:43:03.4389076Z fpu_exception : yes 2025-05-07T19:43:03.4389309Z cpuid level : 13 2025-05-07T19:43:03.4389555Z wp : yes 2025-05-07T19:43:03.4391953Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4394752Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4395356Z bogomips : 5999.99 2025-05-07T19:43:03.4395603Z clflush size : 64 2025-05-07T19:43:03.4395822Z cache_alignment : 64 2025-05-07T19:43:03.4396114Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4396436Z power management: 2025-05-07T19:43:03.4396571Z 2025-05-07T19:43:03.4396686Z processor : 27 2025-05-07T19:43:03.4396905Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4397167Z cpu family : 6 2025-05-07T19:43:03.4397370Z model : 85 2025-05-07T19:43:03.4397667Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4397999Z stepping : 7 2025-05-07T19:43:03.4398232Z microcode : 0x5003901 2025-05-07T19:43:03.4398463Z cpu MHz : 1192.970 2025-05-07T19:43:03.4398708Z cache size : 36608 KB 2025-05-07T19:43:03.4398960Z physical id : 1 2025-05-07T19:43:03.4399170Z siblings : 48 2025-05-07T19:43:03.4399400Z core id : 3 2025-05-07T19:43:03.4399606Z cpu cores : 24 2025-05-07T19:43:03.4399835Z apicid : 70 2025-05-07T19:43:03.4400039Z initial apicid : 70 2025-05-07T19:43:03.4400275Z fpu : yes 2025-05-07T19:43:03.4400472Z fpu_exception : yes 2025-05-07T19:43:03.4400709Z cpuid level : 13 2025-05-07T19:43:03.4400921Z wp : yes 2025-05-07T19:43:03.4403230Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4405866Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4406465Z bogomips : 5999.99 2025-05-07T19:43:03.4406712Z clflush size : 64 2025-05-07T19:43:03.4406958Z cache_alignment : 64 2025-05-07T19:43:03.4407232Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4407583Z power management: 2025-05-07T19:43:03.4407722Z 2025-05-07T19:43:03.4407816Z processor : 28 2025-05-07T19:43:03.4408064Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4408311Z cpu family : 6 2025-05-07T19:43:03.4408550Z model : 85 2025-05-07T19:43:03.4408826Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4409209Z stepping : 7 2025-05-07T19:43:03.4409418Z microcode : 0x5003901 2025-05-07T19:43:03.4409671Z cpu MHz : 1202.644 2025-05-07T19:43:03.4409913Z cache size : 36608 KB 2025-05-07T19:43:03.4410143Z physical id : 1 2025-05-07T19:43:03.4410383Z siblings : 48 2025-05-07T19:43:03.4410586Z core id : 4 2025-05-07T19:43:03.4410801Z cpu cores : 24 2025-05-07T19:43:03.4410995Z apicid : 72 2025-05-07T19:43:03.4411202Z initial apicid : 72 2025-05-07T19:43:03.4411394Z fpu : yes 2025-05-07T19:43:03.4411595Z fpu_exception : yes 2025-05-07T19:43:03.4411807Z cpuid level : 13 2025-05-07T19:43:03.4412016Z wp : yes 2025-05-07T19:43:03.4414204Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4416753Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4417325Z bogomips : 5999.99 2025-05-07T19:43:03.4417553Z clflush size : 64 2025-05-07T19:43:03.4417756Z cache_alignment : 64 2025-05-07T19:43:03.4418027Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4418342Z power management: 2025-05-07T19:43:03.4418466Z 2025-05-07T19:43:03.4418560Z processor : 29 2025-05-07T19:43:03.4418765Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4419000Z cpu family : 6 2025-05-07T19:43:03.4419183Z model : 85 2025-05-07T19:43:03.4419533Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4419861Z stepping : 7 2025-05-07T19:43:03.4420251Z microcode : 0x5003901 2025-05-07T19:43:03.4420482Z cpu MHz : 2999.998 2025-05-07T19:43:03.4420716Z cache size : 36608 KB 2025-05-07T19:43:03.4420952Z physical id : 1 2025-05-07T19:43:03.4421166Z siblings : 48 2025-05-07T19:43:03.4421380Z core id : 5 2025-05-07T19:43:03.4421574Z cpu cores : 24 2025-05-07T19:43:03.4421786Z apicid : 74 2025-05-07T19:43:03.4421982Z initial apicid : 74 2025-05-07T19:43:03.4422201Z fpu : yes 2025-05-07T19:43:03.4422398Z fpu_exception : yes 2025-05-07T19:43:03.4422626Z cpuid level : 13 2025-05-07T19:43:03.4422835Z wp : yes 2025-05-07T19:43:03.4425329Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4428117Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4428713Z bogomips : 5999.99 2025-05-07T19:43:03.4428937Z clflush size : 64 2025-05-07T19:43:03.4429166Z cache_alignment : 64 2025-05-07T19:43:03.4429438Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4429768Z power management: 2025-05-07T19:43:03.4429899Z 2025-05-07T19:43:03.4429984Z processor : 30 2025-05-07T19:43:03.4430204Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4430435Z cpu family : 6 2025-05-07T19:43:03.4430651Z model : 85 2025-05-07T19:43:03.4430918Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4431282Z stepping : 7 2025-05-07T19:43:03.4431485Z microcode : 0x5003901 2025-05-07T19:43:03.4431711Z cpu MHz : 2999.998 2025-05-07T19:43:03.4432043Z cache size : 36608 KB 2025-05-07T19:43:03.4432254Z physical id : 1 2025-05-07T19:43:03.4432479Z siblings : 48 2025-05-07T19:43:03.4432671Z core id : 6 2025-05-07T19:43:03.4432880Z cpu cores : 24 2025-05-07T19:43:03.4433076Z apicid : 76 2025-05-07T19:43:03.4433292Z initial apicid : 76 2025-05-07T19:43:03.4433573Z fpu : yes 2025-05-07T19:43:03.4433812Z fpu_exception : yes 2025-05-07T19:43:03.4434087Z cpuid level : 13 2025-05-07T19:43:03.4434360Z wp : yes 2025-05-07T19:43:03.4436790Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4439589Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4440182Z bogomips : 5999.99 2025-05-07T19:43:03.4440390Z clflush size : 64 2025-05-07T19:43:03.4440605Z cache_alignment : 64 2025-05-07T19:43:03.4440879Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4441193Z power management: 2025-05-07T19:43:03.4441329Z 2025-05-07T19:43:03.4441422Z processor : 31 2025-05-07T19:43:03.4441650Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4441899Z cpu family : 6 2025-05-07T19:43:03.4442108Z model : 85 2025-05-07T19:43:03.4442397Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4442746Z stepping : 7 2025-05-07T19:43:03.4442971Z microcode : 0x5003901 2025-05-07T19:43:03.4443192Z cpu MHz : 1204.252 2025-05-07T19:43:03.4443424Z cache size : 36608 KB 2025-05-07T19:43:03.4443759Z physical id : 1 2025-05-07T19:43:03.4443957Z siblings : 48 2025-05-07T19:43:03.4444172Z core id : 7 2025-05-07T19:43:03.4444365Z cpu cores : 24 2025-05-07T19:43:03.4444567Z apicid : 78 2025-05-07T19:43:03.4444743Z initial apicid : 78 2025-05-07T19:43:03.4444946Z fpu : yes 2025-05-07T19:43:03.4445131Z fpu_exception : yes 2025-05-07T19:43:03.4445326Z cpuid level : 13 2025-05-07T19:43:03.4445519Z wp : yes 2025-05-07T19:43:03.4447757Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4450552Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4451119Z bogomips : 5999.99 2025-05-07T19:43:03.4451324Z clflush size : 64 2025-05-07T19:43:03.4451535Z cache_alignment : 64 2025-05-07T19:43:03.4451788Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4452102Z power management: 2025-05-07T19:43:03.4452228Z 2025-05-07T19:43:03.4452307Z processor : 32 2025-05-07T19:43:03.4452515Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4452732Z cpu family : 6 2025-05-07T19:43:03.4452924Z model : 85 2025-05-07T19:43:03.4453179Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4453523Z stepping : 7 2025-05-07T19:43:03.4453708Z microcode : 0x5003901 2025-05-07T19:43:03.4453919Z cpu MHz : 1212.950 2025-05-07T19:43:03.4454124Z cache size : 36608 KB 2025-05-07T19:43:03.4454345Z physical id : 1 2025-05-07T19:43:03.4454559Z siblings : 48 2025-05-07T19:43:03.4454745Z core id : 8 2025-05-07T19:43:03.4454942Z cpu cores : 24 2025-05-07T19:43:03.4455122Z apicid : 80 2025-05-07T19:43:03.4455329Z initial apicid : 80 2025-05-07T19:43:03.4455531Z fpu : yes 2025-05-07T19:43:03.4455734Z fpu_exception : yes 2025-05-07T19:43:03.4455934Z cpuid level : 13 2025-05-07T19:43:03.4456139Z wp : yes 2025-05-07T19:43:03.4458433Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4461380Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4461981Z bogomips : 5999.99 2025-05-07T19:43:03.4462187Z clflush size : 64 2025-05-07T19:43:03.4462418Z cache_alignment : 64 2025-05-07T19:43:03.4462689Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4463009Z power management: 2025-05-07T19:43:03.4463137Z 2025-05-07T19:43:03.4463225Z processor : 33 2025-05-07T19:43:03.4463426Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4463660Z cpu family : 6 2025-05-07T19:43:03.4463847Z model : 85 2025-05-07T19:43:03.4464116Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4464457Z stepping : 7 2025-05-07T19:43:03.4464660Z microcode : 0x5003901 2025-05-07T19:43:03.4464870Z cpu MHz : 1211.733 2025-05-07T19:43:03.4465084Z cache size : 36608 KB 2025-05-07T19:43:03.4465305Z physical id : 1 2025-05-07T19:43:03.4465502Z siblings : 48 2025-05-07T19:43:03.4465699Z core id : 9 2025-05-07T19:43:03.4465887Z cpu cores : 24 2025-05-07T19:43:03.4466108Z apicid : 82 2025-05-07T19:43:03.4466312Z initial apicid : 82 2025-05-07T19:43:03.4466533Z fpu : yes 2025-05-07T19:43:03.4466722Z fpu_exception : yes 2025-05-07T19:43:03.4466947Z cpuid level : 13 2025-05-07T19:43:03.4467154Z wp : yes 2025-05-07T19:43:03.4469562Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4472502Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4473099Z bogomips : 5999.99 2025-05-07T19:43:03.4473312Z clflush size : 64 2025-05-07T19:43:03.4473537Z cache_alignment : 64 2025-05-07T19:43:03.4473806Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4474127Z power management: 2025-05-07T19:43:03.4474249Z 2025-05-07T19:43:03.4474326Z processor : 34 2025-05-07T19:43:03.4474536Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4474762Z cpu family : 6 2025-05-07T19:43:03.4474957Z model : 85 2025-05-07T19:43:03.4475222Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4475579Z stepping : 7 2025-05-07T19:43:03.4475782Z microcode : 0x5003901 2025-05-07T19:43:03.4476011Z cpu MHz : 1202.343 2025-05-07T19:43:03.4476227Z cache size : 36608 KB 2025-05-07T19:43:03.4476436Z physical id : 1 2025-05-07T19:43:03.4476648Z siblings : 48 2025-05-07T19:43:03.4476845Z core id : 10 2025-05-07T19:43:03.4477049Z cpu cores : 24 2025-05-07T19:43:03.4477249Z apicid : 84 2025-05-07T19:43:03.4477450Z initial apicid : 84 2025-05-07T19:43:03.4477650Z fpu : yes 2025-05-07T19:43:03.4477853Z fpu_exception : yes 2025-05-07T19:43:03.4477936Z cpuid level : 13 2025-05-07T19:43:03.4478009Z wp : yes 2025-05-07T19:43:03.4480206Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4480607Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4480706Z bogomips : 5999.99 2025-05-07T19:43:03.4480789Z clflush size : 64 2025-05-07T19:43:03.4480874Z cache_alignment : 64 2025-05-07T19:43:03.4481000Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4481098Z power management: 2025-05-07T19:43:03.4481102Z 2025-05-07T19:43:03.4481179Z processor : 35 2025-05-07T19:43:03.4481277Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4481376Z cpu family : 6 2025-05-07T19:43:03.4481450Z model : 85 2025-05-07T19:43:03.4481610Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4481688Z stepping : 7 2025-05-07T19:43:03.4481791Z microcode : 0x5003901 2025-05-07T19:43:03.4481874Z cpu MHz : 1202.292 2025-05-07T19:43:03.4482087Z cache size : 36608 KB 2025-05-07T19:43:03.4482195Z physical id : 1 2025-05-07T19:43:03.4482278Z siblings : 48 2025-05-07T19:43:03.4482518Z core id : 11 2025-05-07T19:43:03.4482598Z cpu cores : 24 2025-05-07T19:43:03.4482703Z apicid : 86 2025-05-07T19:43:03.4482791Z initial apicid : 86 2025-05-07T19:43:03.4482869Z fpu : yes 2025-05-07T19:43:03.4482954Z fpu_exception : yes 2025-05-07T19:43:03.4483055Z cpuid level : 13 2025-05-07T19:43:03.4483163Z wp : yes 2025-05-07T19:43:03.4485409Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4485962Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4486046Z bogomips : 5999.99 2025-05-07T19:43:03.4486136Z clflush size : 64 2025-05-07T19:43:03.4486229Z cache_alignment : 64 2025-05-07T19:43:03.4486363Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4486446Z power management: 2025-05-07T19:43:03.4486450Z 2025-05-07T19:43:03.4486540Z processor : 36 2025-05-07T19:43:03.4486629Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4486705Z cpu family : 6 2025-05-07T19:43:03.4486799Z model : 85 2025-05-07T19:43:03.4486959Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4487043Z stepping : 7 2025-05-07T19:43:03.4487126Z microcode : 0x5003901 2025-05-07T19:43:03.4487225Z cpu MHz : 1202.590 2025-05-07T19:43:03.4487311Z cache size : 36608 KB 2025-05-07T19:43:03.4487390Z physical id : 1 2025-05-07T19:43:03.4487474Z siblings : 48 2025-05-07T19:43:03.4487569Z core id : 12 2025-05-07T19:43:03.4487646Z cpu cores : 24 2025-05-07T19:43:03.4487724Z apicid : 88 2025-05-07T19:43:03.4487816Z initial apicid : 88 2025-05-07T19:43:03.4487915Z fpu : yes 2025-05-07T19:43:03.4488014Z fpu_exception : yes 2025-05-07T19:43:03.4488109Z cpuid level : 13 2025-05-07T19:43:03.4488226Z wp : yes 2025-05-07T19:43:03.4490476Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4490923Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4491028Z bogomips : 5999.99 2025-05-07T19:43:03.4491126Z clflush size : 64 2025-05-07T19:43:03.4491227Z cache_alignment : 64 2025-05-07T19:43:03.4491398Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4491499Z power management: 2025-05-07T19:43:03.4491503Z 2025-05-07T19:43:03.4491597Z processor : 37 2025-05-07T19:43:03.4491714Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4491797Z cpu family : 6 2025-05-07T19:43:03.4491886Z model : 85 2025-05-07T19:43:03.4492055Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4492166Z stepping : 7 2025-05-07T19:43:03.4492268Z microcode : 0x5003901 2025-05-07T19:43:03.4492354Z cpu MHz : 1203.499 2025-05-07T19:43:03.4492464Z cache size : 36608 KB 2025-05-07T19:43:03.4492558Z physical id : 1 2025-05-07T19:43:03.4492646Z siblings : 48 2025-05-07T19:43:03.4492737Z core id : 13 2025-05-07T19:43:03.4492848Z cpu cores : 24 2025-05-07T19:43:03.4492939Z apicid : 90 2025-05-07T19:43:03.4493038Z initial apicid : 90 2025-05-07T19:43:03.4493158Z fpu : yes 2025-05-07T19:43:03.4493260Z fpu_exception : yes 2025-05-07T19:43:03.4493355Z cpuid level : 13 2025-05-07T19:43:03.4493448Z wp : yes 2025-05-07T19:43:03.4495817Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4496274Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4496402Z bogomips : 5999.99 2025-05-07T19:43:03.4496540Z clflush size : 64 2025-05-07T19:43:03.4496641Z cache_alignment : 64 2025-05-07T19:43:03.4496779Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4496895Z power management: 2025-05-07T19:43:03.4496899Z 2025-05-07T19:43:03.4496989Z processor : 38 2025-05-07T19:43:03.4497091Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4497211Z cpu family : 6 2025-05-07T19:43:03.4497297Z model : 85 2025-05-07T19:43:03.4497465Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4497555Z stepping : 7 2025-05-07T19:43:03.4497678Z microcode : 0x5003901 2025-05-07T19:43:03.4497773Z cpu MHz : 1196.390 2025-05-07T19:43:03.4497865Z cache size : 36608 KB 2025-05-07T19:43:03.4497977Z physical id : 1 2025-05-07T19:43:03.4498069Z siblings : 48 2025-05-07T19:43:03.4498157Z core id : 14 2025-05-07T19:43:03.4498246Z cpu cores : 24 2025-05-07T19:43:03.4498354Z apicid : 92 2025-05-07T19:43:03.4498446Z initial apicid : 92 2025-05-07T19:43:03.4498532Z fpu : yes 2025-05-07T19:43:03.4498654Z fpu_exception : yes 2025-05-07T19:43:03.4498745Z cpuid level : 13 2025-05-07T19:43:03.4498834Z wp : yes 2025-05-07T19:43:03.4501323Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4501739Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4501835Z bogomips : 5999.99 2025-05-07T19:43:03.4501954Z clflush size : 64 2025-05-07T19:43:03.4502055Z cache_alignment : 64 2025-05-07T19:43:03.4502197Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4502294Z power management: 2025-05-07T19:43:03.4502299Z 2025-05-07T19:43:03.4502425Z processor : 39 2025-05-07T19:43:03.4502524Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4502618Z cpu family : 6 2025-05-07T19:43:03.4502733Z model : 85 2025-05-07T19:43:03.4502906Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4502996Z stepping : 7 2025-05-07T19:43:03.4503091Z microcode : 0x5003901 2025-05-07T19:43:03.4503205Z cpu MHz : 1204.596 2025-05-07T19:43:03.4503303Z cache size : 36608 KB 2025-05-07T19:43:03.4503400Z physical id : 1 2025-05-07T19:43:03.4503514Z siblings : 48 2025-05-07T19:43:03.4503605Z core id : 15 2025-05-07T19:43:03.4503698Z cpu cores : 24 2025-05-07T19:43:03.4503789Z apicid : 94 2025-05-07T19:43:03.4503912Z initial apicid : 94 2025-05-07T19:43:03.4504004Z fpu : yes 2025-05-07T19:43:03.4504102Z fpu_exception : yes 2025-05-07T19:43:03.4504201Z cpuid level : 13 2025-05-07T19:43:03.4504315Z wp : yes 2025-05-07T19:43:03.4506593Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4507032Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4507193Z bogomips : 5999.99 2025-05-07T19:43:03.4507288Z clflush size : 64 2025-05-07T19:43:03.4507412Z cache_alignment : 64 2025-05-07T19:43:03.4507597Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4507696Z power management: 2025-05-07T19:43:03.4507700Z 2025-05-07T19:43:03.4507793Z processor : 40 2025-05-07T19:43:03.4507922Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4508014Z cpu family : 6 2025-05-07T19:43:03.4508109Z model : 85 2025-05-07T19:43:03.4508311Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4508407Z stepping : 7 2025-05-07T19:43:03.4508504Z microcode : 0x5003901 2025-05-07T19:43:03.4508599Z cpu MHz : 2999.998 2025-05-07T19:43:03.4508717Z cache size : 36608 KB 2025-05-07T19:43:03.4508814Z physical id : 1 2025-05-07T19:43:03.4508906Z siblings : 48 2025-05-07T19:43:03.4509021Z core id : 16 2025-05-07T19:43:03.4509113Z cpu cores : 24 2025-05-07T19:43:03.4509206Z apicid : 96 2025-05-07T19:43:03.4509304Z initial apicid : 96 2025-05-07T19:43:03.4509417Z fpu : yes 2025-05-07T19:43:03.4509516Z fpu_exception : yes 2025-05-07T19:43:03.4509609Z cpuid level : 13 2025-05-07T19:43:03.4509701Z wp : yes 2025-05-07T19:43:03.4512077Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4512456Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4512572Z bogomips : 5999.99 2025-05-07T19:43:03.4512659Z clflush size : 64 2025-05-07T19:43:03.4512748Z cache_alignment : 64 2025-05-07T19:43:03.4512880Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4512997Z power management: 2025-05-07T19:43:03.4513001Z 2025-05-07T19:43:03.4513088Z processor : 41 2025-05-07T19:43:03.4513187Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4513294Z cpu family : 6 2025-05-07T19:43:03.4513377Z model : 85 2025-05-07T19:43:03.4513535Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4513646Z stepping : 7 2025-05-07T19:43:03.4513735Z microcode : 0x5003901 2025-05-07T19:43:03.4513822Z cpu MHz : 2999.998 2025-05-07T19:43:03.4513910Z cache size : 36608 KB 2025-05-07T19:43:03.4514018Z physical id : 1 2025-05-07T19:43:03.4514102Z siblings : 48 2025-05-07T19:43:03.4514186Z core id : 17 2025-05-07T19:43:03.4514272Z cpu cores : 24 2025-05-07T19:43:03.4514379Z apicid : 98 2025-05-07T19:43:03.4514472Z initial apicid : 98 2025-05-07T19:43:03.4514553Z fpu : yes 2025-05-07T19:43:03.4514665Z fpu_exception : yes 2025-05-07T19:43:03.4514752Z cpuid level : 13 2025-05-07T19:43:03.4514833Z wp : yes 2025-05-07T19:43:03.4516928Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4517308Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4517436Z bogomips : 5999.99 2025-05-07T19:43:03.4517546Z clflush size : 64 2025-05-07T19:43:03.4517638Z cache_alignment : 64 2025-05-07T19:43:03.4517769Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4517859Z power management: 2025-05-07T19:43:03.4517929Z 2025-05-07T19:43:03.4518018Z processor : 42 2025-05-07T19:43:03.4518112Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4518196Z cpu family : 6 2025-05-07T19:43:03.4518305Z model : 85 2025-05-07T19:43:03.4518466Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4518552Z stepping : 7 2025-05-07T19:43:03.4518640Z microcode : 0x5003901 2025-05-07T19:43:03.4518749Z cpu MHz : 2999.998 2025-05-07T19:43:03.4518836Z cache size : 36608 KB 2025-05-07T19:43:03.4518925Z physical id : 1 2025-05-07T19:43:03.4519030Z siblings : 48 2025-05-07T19:43:03.4519112Z core id : 18 2025-05-07T19:43:03.4519199Z cpu cores : 24 2025-05-07T19:43:03.4519282Z apicid : 100 2025-05-07T19:43:03.4519392Z initial apicid : 100 2025-05-07T19:43:03.4519477Z fpu : yes 2025-05-07T19:43:03.4519567Z fpu_exception : yes 2025-05-07T19:43:03.4519674Z cpuid level : 13 2025-05-07T19:43:03.4519755Z wp : yes 2025-05-07T19:43:03.4521836Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4522235Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4522323Z bogomips : 5999.99 2025-05-07T19:43:03.4522413Z clflush size : 64 2025-05-07T19:43:03.4522526Z cache_alignment : 64 2025-05-07T19:43:03.4522657Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4522748Z power management: 2025-05-07T19:43:03.4522752Z 2025-05-07T19:43:03.4522843Z processor : 43 2025-05-07T19:43:03.4522957Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4523043Z cpu family : 6 2025-05-07T19:43:03.4523127Z model : 85 2025-05-07T19:43:03.4523311Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4523402Z stepping : 7 2025-05-07T19:43:03.4523493Z microcode : 0x5003901 2025-05-07T19:43:03.4523583Z cpu MHz : 1206.590 2025-05-07T19:43:03.4523700Z cache size : 36608 KB 2025-05-07T19:43:03.4523789Z physical id : 1 2025-05-07T19:43:03.4523878Z siblings : 48 2025-05-07T19:43:03.4523993Z core id : 19 2025-05-07T19:43:03.4524079Z cpu cores : 24 2025-05-07T19:43:03.4524168Z apicid : 102 2025-05-07T19:43:03.4524259Z initial apicid : 102 2025-05-07T19:43:03.4524366Z fpu : yes 2025-05-07T19:43:03.4524457Z fpu_exception : yes 2025-05-07T19:43:03.4524544Z cpuid level : 13 2025-05-07T19:43:03.4524650Z wp : yes 2025-05-07T19:43:03.4526764Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4527145Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4527253Z bogomips : 5999.99 2025-05-07T19:43:03.4527340Z clflush size : 64 2025-05-07T19:43:03.4527473Z cache_alignment : 64 2025-05-07T19:43:03.4527627Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4527718Z power management: 2025-05-07T19:43:03.4527723Z 2025-05-07T19:43:03.4527811Z processor : 44 2025-05-07T19:43:03.4527945Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4528059Z cpu family : 6 2025-05-07T19:43:03.4528150Z model : 85 2025-05-07T19:43:03.4528313Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4528431Z stepping : 7 2025-05-07T19:43:03.4528524Z microcode : 0x5003901 2025-05-07T19:43:03.4528612Z cpu MHz : 2999.998 2025-05-07T19:43:03.4528703Z cache size : 36608 KB 2025-05-07T19:43:03.4528817Z physical id : 1 2025-05-07T19:43:03.4528904Z siblings : 48 2025-05-07T19:43:03.4528990Z core id : 20 2025-05-07T19:43:03.4529106Z cpu cores : 24 2025-05-07T19:43:03.4529193Z apicid : 104 2025-05-07T19:43:03.4529289Z initial apicid : 104 2025-05-07T19:43:03.4529374Z fpu : yes 2025-05-07T19:43:03.4529492Z fpu_exception : yes 2025-05-07T19:43:03.4529583Z cpuid level : 13 2025-05-07T19:43:03.4529664Z wp : yes 2025-05-07T19:43:03.4531778Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4532163Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4532253Z bogomips : 5999.99 2025-05-07T19:43:03.4532376Z clflush size : 64 2025-05-07T19:43:03.4532470Z cache_alignment : 64 2025-05-07T19:43:03.4532612Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4532735Z power management: 2025-05-07T19:43:03.4532739Z 2025-05-07T19:43:03.4532832Z processor : 45 2025-05-07T19:43:03.4532931Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4533024Z cpu family : 6 2025-05-07T19:43:03.4533139Z model : 85 2025-05-07T19:43:03.4533302Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4533392Z stepping : 7 2025-05-07T19:43:03.4533516Z microcode : 0x5003901 2025-05-07T19:43:03.4533610Z cpu MHz : 2999.998 2025-05-07T19:43:03.4533707Z cache size : 36608 KB 2025-05-07T19:43:03.4533798Z physical id : 1 2025-05-07T19:43:03.4533908Z siblings : 48 2025-05-07T19:43:03.4533996Z core id : 21 2025-05-07T19:43:03.4534079Z cpu cores : 24 2025-05-07T19:43:03.4534197Z apicid : 106 2025-05-07T19:43:03.4534289Z initial apicid : 106 2025-05-07T19:43:03.4534374Z fpu : yes 2025-05-07T19:43:03.4534467Z fpu_exception : yes 2025-05-07T19:43:03.4534581Z cpuid level : 13 2025-05-07T19:43:03.4534669Z wp : yes 2025-05-07T19:43:03.4536753Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4537153Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4537239Z bogomips : 5999.99 2025-05-07T19:43:03.4537325Z clflush size : 64 2025-05-07T19:43:03.4537439Z cache_alignment : 64 2025-05-07T19:43:03.4537572Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4537704Z power management: 2025-05-07T19:43:03.4537708Z 2025-05-07T19:43:03.4537819Z processor : 46 2025-05-07T19:43:03.4537918Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4538009Z cpu family : 6 2025-05-07T19:43:03.4538094Z model : 85 2025-05-07T19:43:03.4538327Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4538412Z stepping : 7 2025-05-07T19:43:03.4538501Z microcode : 0x5003901 2025-05-07T19:43:03.4538614Z cpu MHz : 1203.516 2025-05-07T19:43:03.4538701Z cache size : 36608 KB 2025-05-07T19:43:03.4538786Z physical id : 1 2025-05-07T19:43:03.4538869Z siblings : 48 2025-05-07T19:43:03.4538972Z core id : 22 2025-05-07T19:43:03.4539057Z cpu cores : 24 2025-05-07T19:43:03.4539142Z apicid : 108 2025-05-07T19:43:03.4539258Z initial apicid : 108 2025-05-07T19:43:03.4539340Z fpu : yes 2025-05-07T19:43:03.4539490Z fpu_exception : yes 2025-05-07T19:43:03.4539582Z cpuid level : 13 2025-05-07T19:43:03.4539688Z wp : yes 2025-05-07T19:43:03.4542098Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4542535Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4542631Z bogomips : 5999.99 2025-05-07T19:43:03.4542722Z clflush size : 64 2025-05-07T19:43:03.4542819Z cache_alignment : 64 2025-05-07T19:43:03.4542985Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4543080Z power management: 2025-05-07T19:43:03.4543088Z 2025-05-07T19:43:03.4543180Z processor : 47 2025-05-07T19:43:03.4559331Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4559473Z cpu family : 6 2025-05-07T19:43:03.4559557Z model : 85 2025-05-07T19:43:03.4559745Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4559840Z stepping : 7 2025-05-07T19:43:03.4559925Z microcode : 0x5003901 2025-05-07T19:43:03.4560008Z cpu MHz : 2999.998 2025-05-07T19:43:03.4560089Z cache size : 36608 KB 2025-05-07T19:43:03.4560181Z physical id : 1 2025-05-07T19:43:03.4560260Z siblings : 48 2025-05-07T19:43:03.4560341Z core id : 23 2025-05-07T19:43:03.4560428Z cpu cores : 24 2025-05-07T19:43:03.4560506Z apicid : 110 2025-05-07T19:43:03.4560590Z initial apicid : 110 2025-05-07T19:43:03.4560746Z fpu : yes 2025-05-07T19:43:03.4560842Z fpu_exception : yes 2025-05-07T19:43:03.4560919Z cpuid level : 13 2025-05-07T19:43:03.4560994Z wp : yes 2025-05-07T19:43:03.4563314Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4563716Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4563801Z bogomips : 5999.99 2025-05-07T19:43:03.4563904Z clflush size : 64 2025-05-07T19:43:03.4564057Z cache_alignment : 64 2025-05-07T19:43:03.4564197Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4564313Z power management: 2025-05-07T19:43:03.4564319Z 2025-05-07T19:43:03.4565208Z processor : 48 2025-05-07T19:43:03.4565324Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4565403Z cpu family : 6 2025-05-07T19:43:03.4565479Z model : 85 2025-05-07T19:43:03.4565677Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4565814Z stepping : 7 2025-05-07T19:43:03.4565902Z microcode : 0x5003901 2025-05-07T19:43:03.4565983Z cpu MHz : 2999.998 2025-05-07T19:43:03.4566074Z cache size : 36608 KB 2025-05-07T19:43:03.4566160Z physical id : 0 2025-05-07T19:43:03.4566234Z siblings : 48 2025-05-07T19:43:03.4566329Z core id : 0 2025-05-07T19:43:03.4566408Z cpu cores : 24 2025-05-07T19:43:03.4566482Z apicid : 1 2025-05-07T19:43:03.4566561Z initial apicid : 1 2025-05-07T19:43:03.4566657Z fpu : yes 2025-05-07T19:43:03.4566747Z fpu_exception : yes 2025-05-07T19:43:03.4566832Z cpuid level : 13 2025-05-07T19:43:03.4566909Z wp : yes 2025-05-07T19:43:03.4569311Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4569722Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4569814Z bogomips : 5999.99 2025-05-07T19:43:03.4569897Z clflush size : 64 2025-05-07T19:43:03.4569985Z cache_alignment : 64 2025-05-07T19:43:03.4570122Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4570328Z power management: 2025-05-07T19:43:03.4570333Z 2025-05-07T19:43:03.4570410Z processor : 49 2025-05-07T19:43:03.4570498Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4570587Z cpu family : 6 2025-05-07T19:43:03.4570655Z model : 85 2025-05-07T19:43:03.4570812Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4570906Z stepping : 7 2025-05-07T19:43:03.4570998Z microcode : 0x5003901 2025-05-07T19:43:03.4571076Z cpu MHz : 2999.998 2025-05-07T19:43:03.4571159Z cache size : 36608 KB 2025-05-07T19:43:03.4571253Z physical id : 0 2025-05-07T19:43:03.4571332Z siblings : 48 2025-05-07T19:43:03.4571407Z core id : 1 2025-05-07T19:43:03.4571484Z cpu cores : 24 2025-05-07T19:43:03.4571571Z apicid : 3 2025-05-07T19:43:03.4571655Z initial apicid : 3 2025-05-07T19:43:03.4571728Z fpu : yes 2025-05-07T19:43:03.4571821Z fpu_exception : yes 2025-05-07T19:43:03.4571903Z cpuid level : 13 2025-05-07T19:43:03.4571977Z wp : yes 2025-05-07T19:43:03.4574166Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4574557Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4574639Z bogomips : 5999.99 2025-05-07T19:43:03.4574729Z clflush size : 64 2025-05-07T19:43:03.4574810Z cache_alignment : 64 2025-05-07T19:43:03.4574939Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4575020Z power management: 2025-05-07T19:43:03.4575036Z 2025-05-07T19:43:03.4575112Z processor : 50 2025-05-07T19:43:03.4575198Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4575278Z cpu family : 6 2025-05-07T19:43:03.4575415Z model : 85 2025-05-07T19:43:03.4575571Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4575653Z stepping : 7 2025-05-07T19:43:03.4575741Z microcode : 0x5003901 2025-05-07T19:43:03.4575873Z cpu MHz : 3231.404 2025-05-07T19:43:03.4575955Z cache size : 36608 KB 2025-05-07T19:43:03.4576033Z physical id : 0 2025-05-07T19:43:03.4576122Z siblings : 48 2025-05-07T19:43:03.4576198Z core id : 2 2025-05-07T19:43:03.4576273Z cpu cores : 24 2025-05-07T19:43:03.4576346Z apicid : 5 2025-05-07T19:43:03.4576439Z initial apicid : 5 2025-05-07T19:43:03.4576512Z fpu : yes 2025-05-07T19:43:03.4576591Z fpu_exception : yes 2025-05-07T19:43:03.4576678Z cpuid level : 13 2025-05-07T19:43:03.4576756Z wp : yes 2025-05-07T19:43:03.4578963Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4579451Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4579542Z bogomips : 5999.99 2025-05-07T19:43:03.4579622Z clflush size : 64 2025-05-07T19:43:03.4579715Z cache_alignment : 64 2025-05-07T19:43:03.4580009Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4580096Z power management: 2025-05-07T19:43:03.4580100Z 2025-05-07T19:43:03.4580190Z processor : 51 2025-05-07T19:43:03.4580299Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4580379Z cpu family : 6 2025-05-07T19:43:03.4580512Z model : 85 2025-05-07T19:43:03.4580695Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4580778Z stepping : 7 2025-05-07T19:43:03.4580864Z microcode : 0x5003901 2025-05-07T19:43:03.4580948Z cpu MHz : 2999.998 2025-05-07T19:43:03.4581048Z cache size : 36608 KB 2025-05-07T19:43:03.4581130Z physical id : 0 2025-05-07T19:43:03.4581209Z siblings : 48 2025-05-07T19:43:03.4581302Z core id : 3 2025-05-07T19:43:03.4581381Z cpu cores : 24 2025-05-07T19:43:03.4581458Z apicid : 7 2025-05-07T19:43:03.4581544Z initial apicid : 7 2025-05-07T19:43:03.4581639Z fpu : yes 2025-05-07T19:43:03.4581722Z fpu_exception : yes 2025-05-07T19:43:03.4581803Z cpuid level : 13 2025-05-07T19:43:03.4582530Z wp : yes 2025-05-07T19:43:03.4584777Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4585179Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4585275Z bogomips : 5999.99 2025-05-07T19:43:03.4585360Z clflush size : 64 2025-05-07T19:43:03.4585446Z cache_alignment : 64 2025-05-07T19:43:03.4585598Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4585683Z power management: 2025-05-07T19:43:03.4585688Z 2025-05-07T19:43:03.4585770Z processor : 52 2025-05-07T19:43:03.4585859Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4585950Z cpu family : 6 2025-05-07T19:43:03.4586026Z model : 85 2025-05-07T19:43:03.4586185Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4586391Z stepping : 7 2025-05-07T19:43:03.4586484Z microcode : 0x5003901 2025-05-07T19:43:03.4586565Z cpu MHz : 3250.118 2025-05-07T19:43:03.4586644Z cache size : 36608 KB 2025-05-07T19:43:03.4586810Z physical id : 0 2025-05-07T19:43:03.4586893Z siblings : 48 2025-05-07T19:43:03.4586975Z core id : 4 2025-05-07T19:43:03.4587067Z cpu cores : 24 2025-05-07T19:43:03.4587146Z apicid : 9 2025-05-07T19:43:03.4587232Z initial apicid : 9 2025-05-07T19:43:03.4587312Z fpu : yes 2025-05-07T19:43:03.4587416Z fpu_exception : yes 2025-05-07T19:43:03.4587498Z cpuid level : 13 2025-05-07T19:43:03.4587575Z wp : yes 2025-05-07T19:43:03.4589841Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4590250Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4590331Z bogomips : 5999.99 2025-05-07T19:43:03.4590432Z clflush size : 64 2025-05-07T19:43:03.4590516Z cache_alignment : 64 2025-05-07T19:43:03.4590647Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4590748Z power management: 2025-05-07T19:43:03.4590752Z 2025-05-07T19:43:03.4590837Z processor : 53 2025-05-07T19:43:03.4590932Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4591011Z cpu family : 6 2025-05-07T19:43:03.4591099Z model : 85 2025-05-07T19:43:03.4591261Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4591346Z stepping : 7 2025-05-07T19:43:03.4591444Z microcode : 0x5003901 2025-05-07T19:43:03.4591528Z cpu MHz : 2999.998 2025-05-07T19:43:03.4591614Z cache size : 36608 KB 2025-05-07T19:43:03.4591695Z physical id : 0 2025-05-07T19:43:03.4591780Z siblings : 48 2025-05-07T19:43:03.4591859Z core id : 5 2025-05-07T19:43:03.4591935Z cpu cores : 24 2025-05-07T19:43:03.4592023Z apicid : 11 2025-05-07T19:43:03.4592107Z initial apicid : 11 2025-05-07T19:43:03.4592186Z fpu : yes 2025-05-07T19:43:03.4592273Z fpu_exception : yes 2025-05-07T19:43:03.4592363Z cpuid level : 13 2025-05-07T19:43:03.4592437Z wp : yes 2025-05-07T19:43:03.4594745Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4595128Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4595204Z bogomips : 5999.99 2025-05-07T19:43:03.4595280Z clflush size : 64 2025-05-07T19:43:03.4595376Z cache_alignment : 64 2025-05-07T19:43:03.4595496Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4595569Z power management: 2025-05-07T19:43:03.4595573Z 2025-05-07T19:43:03.4595656Z processor : 54 2025-05-07T19:43:03.4595737Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4595808Z cpu family : 6 2025-05-07T19:43:03.4595876Z model : 85 2025-05-07T19:43:03.4596030Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4596102Z stepping : 7 2025-05-07T19:43:03.4596178Z microcode : 0x5003901 2025-05-07T19:43:03.4596306Z cpu MHz : 2999.998 2025-05-07T19:43:03.4596385Z cache size : 36608 KB 2025-05-07T19:43:03.4596458Z physical id : 0 2025-05-07T19:43:03.4596528Z siblings : 48 2025-05-07T19:43:03.4596610Z core id : 6 2025-05-07T19:43:03.4596735Z cpu cores : 24 2025-05-07T19:43:03.4596806Z apicid : 13 2025-05-07T19:43:03.4596883Z initial apicid : 13 2025-05-07T19:43:03.4596969Z fpu : yes 2025-05-07T19:43:03.4597050Z fpu_exception : yes 2025-05-07T19:43:03.4597123Z cpuid level : 13 2025-05-07T19:43:03.4597204Z wp : yes 2025-05-07T19:43:03.4599256Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4599629Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4599713Z bogomips : 5999.99 2025-05-07T19:43:03.4599790Z clflush size : 64 2025-05-07T19:43:03.4599873Z cache_alignment : 64 2025-05-07T19:43:03.4600012Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4600093Z power management: 2025-05-07T19:43:03.4600097Z 2025-05-07T19:43:03.4600170Z processor : 55 2025-05-07T19:43:03.4600268Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4600345Z cpu family : 6 2025-05-07T19:43:03.4600417Z model : 85 2025-05-07T19:43:03.4600564Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4600660Z stepping : 7 2025-05-07T19:43:03.4600738Z microcode : 0x5003901 2025-05-07T19:43:03.4600809Z cpu MHz : 3227.883 2025-05-07T19:43:03.4600905Z cache size : 36608 KB 2025-05-07T19:43:03.4600979Z physical id : 0 2025-05-07T19:43:03.4601049Z siblings : 48 2025-05-07T19:43:03.4601119Z core id : 7 2025-05-07T19:43:03.4601207Z cpu cores : 24 2025-05-07T19:43:03.4601282Z apicid : 15 2025-05-07T19:43:03.4601359Z initial apicid : 15 2025-05-07T19:43:03.4601426Z fpu : yes 2025-05-07T19:43:03.4601514Z fpu_exception : yes 2025-05-07T19:43:03.4601589Z cpuid level : 13 2025-05-07T19:43:03.4601656Z wp : yes 2025-05-07T19:43:03.4603728Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4604105Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4604183Z bogomips : 5999.99 2025-05-07T19:43:03.4604271Z clflush size : 64 2025-05-07T19:43:03.4604351Z cache_alignment : 64 2025-05-07T19:43:03.4604472Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4604564Z power management: 2025-05-07T19:43:03.4604568Z 2025-05-07T19:43:03.4604641Z processor : 56 2025-05-07T19:43:03.4604724Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4604801Z cpu family : 6 2025-05-07T19:43:03.4604889Z model : 85 2025-05-07T19:43:03.4605040Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4605116Z stepping : 7 2025-05-07T19:43:03.4605206Z microcode : 0x5003901 2025-05-07T19:43:03.4605279Z cpu MHz : 3268.081 2025-05-07T19:43:03.4605355Z cache size : 36608 KB 2025-05-07T19:43:03.4605474Z physical id : 0 2025-05-07T19:43:03.4605561Z siblings : 48 2025-05-07T19:43:03.4605628Z core id : 8 2025-05-07T19:43:03.4605700Z cpu cores : 24 2025-05-07T19:43:03.4605795Z apicid : 17 2025-05-07T19:43:03.4605917Z initial apicid : 17 2025-05-07T19:43:03.4605987Z fpu : yes 2025-05-07T19:43:03.4606064Z fpu_exception : yes 2025-05-07T19:43:03.4606150Z cpuid level : 13 2025-05-07T19:43:03.4606222Z wp : yes 2025-05-07T19:43:03.4608279Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4608653Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4608725Z bogomips : 5999.99 2025-05-07T19:43:03.4608805Z clflush size : 64 2025-05-07T19:43:03.4608896Z cache_alignment : 64 2025-05-07T19:43:03.4609015Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4609090Z power management: 2025-05-07T19:43:03.4609094Z 2025-05-07T19:43:03.4609179Z processor : 57 2025-05-07T19:43:03.4609257Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4609326Z cpu family : 6 2025-05-07T19:43:03.4609395Z model : 85 2025-05-07T19:43:03.4609554Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4609626Z stepping : 7 2025-05-07T19:43:03.4609699Z microcode : 0x5003901 2025-05-07T19:43:03.4609784Z cpu MHz : 3306.789 2025-05-07T19:43:03.4609857Z cache size : 36608 KB 2025-05-07T19:43:03.4609931Z physical id : 0 2025-05-07T19:43:03.4610011Z siblings : 48 2025-05-07T19:43:03.4610094Z core id : 9 2025-05-07T19:43:03.4610165Z cpu cores : 24 2025-05-07T19:43:03.4610235Z apicid : 19 2025-05-07T19:43:03.4610326Z initial apicid : 19 2025-05-07T19:43:03.4610394Z fpu : yes 2025-05-07T19:43:03.4610475Z fpu_exception : yes 2025-05-07T19:43:03.4610549Z cpuid level : 13 2025-05-07T19:43:03.4610636Z wp : yes 2025-05-07T19:43:03.4612687Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4613068Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4613149Z bogomips : 5999.99 2025-05-07T19:43:03.4613222Z clflush size : 64 2025-05-07T19:43:03.4613317Z cache_alignment : 64 2025-05-07T19:43:03.4613439Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4613519Z power management: 2025-05-07T19:43:03.4613523Z 2025-05-07T19:43:03.4613603Z processor : 58 2025-05-07T19:43:03.4613696Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4613771Z cpu family : 6 2025-05-07T19:43:03.4613841Z model : 85 2025-05-07T19:43:03.4613997Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4614072Z stepping : 7 2025-05-07T19:43:03.4614152Z microcode : 0x5003901 2025-05-07T19:43:03.4614225Z cpu MHz : 3239.242 2025-05-07T19:43:03.4614319Z cache size : 36608 KB 2025-05-07T19:43:03.4614395Z physical id : 0 2025-05-07T19:43:03.4614465Z siblings : 48 2025-05-07T19:43:03.4614591Z core id : 10 2025-05-07T19:43:03.4614667Z cpu cores : 24 2025-05-07T19:43:03.4614738Z apicid : 21 2025-05-07T19:43:03.4614814Z initial apicid : 21 2025-05-07T19:43:03.4614894Z fpu : yes 2025-05-07T19:43:03.4614972Z fpu_exception : yes 2025-05-07T19:43:03.4615090Z cpuid level : 13 2025-05-07T19:43:03.4615169Z wp : yes 2025-05-07T19:43:03.4617222Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4617599Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4617686Z bogomips : 5999.99 2025-05-07T19:43:03.4617759Z clflush size : 64 2025-05-07T19:43:03.4617838Z cache_alignment : 64 2025-05-07T19:43:03.4617968Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4618046Z power management: 2025-05-07T19:43:03.4618050Z 2025-05-07T19:43:03.4618127Z processor : 59 2025-05-07T19:43:03.4618213Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4618299Z cpu family : 6 2025-05-07T19:43:03.4618373Z model : 85 2025-05-07T19:43:03.4618526Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4618617Z stepping : 7 2025-05-07T19:43:03.4618694Z microcode : 0x5003901 2025-05-07T19:43:03.4618774Z cpu MHz : 2999.998 2025-05-07T19:43:03.4618849Z cache size : 36608 KB 2025-05-07T19:43:03.4618936Z physical id : 0 2025-05-07T19:43:03.4619007Z siblings : 48 2025-05-07T19:43:03.4619079Z core id : 11 2025-05-07T19:43:03.4619167Z cpu cores : 24 2025-05-07T19:43:03.4619241Z apicid : 23 2025-05-07T19:43:03.4619318Z initial apicid : 23 2025-05-07T19:43:03.4619458Z fpu : yes 2025-05-07T19:43:03.4619555Z fpu_exception : yes 2025-05-07T19:43:03.4619631Z cpuid level : 13 2025-05-07T19:43:03.4619708Z wp : yes 2025-05-07T19:43:03.4622114Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4622511Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4622598Z bogomips : 5999.99 2025-05-07T19:43:03.4622693Z clflush size : 64 2025-05-07T19:43:03.4622782Z cache_alignment : 64 2025-05-07T19:43:03.4622915Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4623005Z power management: 2025-05-07T19:43:03.4623009Z 2025-05-07T19:43:03.4623091Z processor : 60 2025-05-07T19:43:03.4623179Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4623255Z cpu family : 6 2025-05-07T19:43:03.4623337Z model : 85 2025-05-07T19:43:03.4623495Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4623578Z stepping : 7 2025-05-07T19:43:03.4623666Z microcode : 0x5003901 2025-05-07T19:43:03.4623739Z cpu MHz : 3145.828 2025-05-07T19:43:03.4623819Z cache size : 36608 KB 2025-05-07T19:43:03.4623897Z physical id : 0 2025-05-07T19:43:03.4623982Z siblings : 48 2025-05-07T19:43:03.4624055Z core id : 12 2025-05-07T19:43:03.4624131Z cpu cores : 24 2025-05-07T19:43:03.4624263Z apicid : 25 2025-05-07T19:43:03.4624360Z initial apicid : 25 2025-05-07T19:43:03.4624439Z fpu : yes 2025-05-07T19:43:03.4624520Z fpu_exception : yes 2025-05-07T19:43:03.4624617Z cpuid level : 13 2025-05-07T19:43:03.4624690Z wp : yes 2025-05-07T19:43:03.4626977Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4627384Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4627467Z bogomips : 5999.99 2025-05-07T19:43:03.4627558Z clflush size : 64 2025-05-07T19:43:03.4627657Z cache_alignment : 64 2025-05-07T19:43:03.4627787Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4627873Z power management: 2025-05-07T19:43:03.4627878Z 2025-05-07T19:43:03.4627962Z processor : 61 2025-05-07T19:43:03.4628048Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4628129Z cpu family : 6 2025-05-07T19:43:03.4628201Z model : 85 2025-05-07T19:43:03.4628375Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4628452Z stepping : 7 2025-05-07T19:43:03.4628533Z microcode : 0x5003901 2025-05-07T19:43:03.4628617Z cpu MHz : 3118.774 2025-05-07T19:43:03.4628696Z cache size : 36608 KB 2025-05-07T19:43:03.4628776Z physical id : 0 2025-05-07T19:43:03.4628849Z siblings : 48 2025-05-07T19:43:03.4628932Z core id : 13 2025-05-07T19:43:03.4629007Z cpu cores : 24 2025-05-07T19:43:03.4629084Z apicid : 27 2025-05-07T19:43:03.4629162Z initial apicid : 27 2025-05-07T19:43:03.4629245Z fpu : yes 2025-05-07T19:43:03.4629332Z fpu_exception : yes 2025-05-07T19:43:03.4629414Z cpuid level : 13 2025-05-07T19:43:03.4629494Z wp : yes 2025-05-07T19:43:03.4631728Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4632229Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4632314Z bogomips : 5999.99 2025-05-07T19:43:03.4632396Z clflush size : 64 2025-05-07T19:43:03.4632476Z cache_alignment : 64 2025-05-07T19:43:03.4632610Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4632686Z power management: 2025-05-07T19:43:03.4632690Z 2025-05-07T19:43:03.4632769Z processor : 62 2025-05-07T19:43:03.4632860Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4632937Z cpu family : 6 2025-05-07T19:43:03.4633010Z model : 85 2025-05-07T19:43:03.4633160Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4633252Z stepping : 7 2025-05-07T19:43:03.4633333Z microcode : 0x5003901 2025-05-07T19:43:03.4633406Z cpu MHz : 3169.542 2025-05-07T19:43:03.4633481Z cache size : 36608 KB 2025-05-07T19:43:03.4633566Z physical id : 0 2025-05-07T19:43:03.4633645Z siblings : 48 2025-05-07T19:43:03.4633714Z core id : 14 2025-05-07T19:43:03.4633799Z cpu cores : 24 2025-05-07T19:43:03.4633871Z apicid : 29 2025-05-07T19:43:03.4633950Z initial apicid : 29 2025-05-07T19:43:03.4634018Z fpu : yes 2025-05-07T19:43:03.4634170Z fpu_exception : yes 2025-05-07T19:43:03.4634245Z cpuid level : 13 2025-05-07T19:43:03.4634313Z wp : yes 2025-05-07T19:43:03.4636442Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4636812Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4636888Z bogomips : 5999.99 2025-05-07T19:43:03.4636976Z clflush size : 64 2025-05-07T19:43:03.4637065Z cache_alignment : 64 2025-05-07T19:43:03.4637182Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4637274Z power management: 2025-05-07T19:43:03.4637278Z 2025-05-07T19:43:03.4637352Z processor : 63 2025-05-07T19:43:03.4637439Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4637508Z cpu family : 6 2025-05-07T19:43:03.4637588Z model : 85 2025-05-07T19:43:03.4637744Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4637816Z stepping : 7 2025-05-07T19:43:03.4637900Z microcode : 0x5003901 2025-05-07T19:43:03.4637973Z cpu MHz : 3209.957 2025-05-07T19:43:03.4638042Z cache size : 36608 KB 2025-05-07T19:43:03.4638121Z physical id : 0 2025-05-07T19:43:03.4638207Z siblings : 48 2025-05-07T19:43:03.4638285Z core id : 15 2025-05-07T19:43:03.4638357Z cpu cores : 24 2025-05-07T19:43:03.4638440Z apicid : 31 2025-05-07T19:43:03.4638518Z initial apicid : 31 2025-05-07T19:43:03.4638592Z fpu : yes 2025-05-07T19:43:03.4638669Z fpu_exception : yes 2025-05-07T19:43:03.4638755Z cpuid level : 13 2025-05-07T19:43:03.4638832Z wp : yes 2025-05-07T19:43:03.4640899Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4641281Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4641359Z bogomips : 5999.99 2025-05-07T19:43:03.4641436Z clflush size : 64 2025-05-07T19:43:03.4641524Z cache_alignment : 64 2025-05-07T19:43:03.4641650Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4641727Z power management: 2025-05-07T19:43:03.4641731Z 2025-05-07T19:43:03.4641819Z processor : 64 2025-05-07T19:43:03.4641903Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4641980Z cpu family : 6 2025-05-07T19:43:03.4642053Z model : 85 2025-05-07T19:43:03.4642211Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4642291Z stepping : 7 2025-05-07T19:43:03.4642369Z microcode : 0x5003901 2025-05-07T19:43:03.4642460Z cpu MHz : 3249.107 2025-05-07T19:43:03.4642540Z cache size : 36608 KB 2025-05-07T19:43:03.4642614Z physical id : 0 2025-05-07T19:43:03.4642688Z siblings : 48 2025-05-07T19:43:03.4642770Z core id : 16 2025-05-07T19:43:03.4642848Z cpu cores : 24 2025-05-07T19:43:03.4642919Z apicid : 33 2025-05-07T19:43:03.4643006Z initial apicid : 33 2025-05-07T19:43:03.4643075Z fpu : yes 2025-05-07T19:43:03.4643180Z fpu_exception : yes 2025-05-07T19:43:03.4643258Z cpuid level : 13 2025-05-07T19:43:03.4643383Z wp : yes 2025-05-07T19:43:03.4645488Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4645862Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4645943Z bogomips : 5999.99 2025-05-07T19:43:03.4646019Z clflush size : 64 2025-05-07T19:43:03.4646101Z cache_alignment : 64 2025-05-07T19:43:03.4646230Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4646319Z power management: 2025-05-07T19:43:03.4646322Z 2025-05-07T19:43:03.4646395Z processor : 65 2025-05-07T19:43:03.4646487Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4646558Z cpu family : 6 2025-05-07T19:43:03.4646641Z model : 85 2025-05-07T19:43:03.4646788Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4646864Z stepping : 7 2025-05-07T19:43:03.4646941Z microcode : 0x5003901 2025-05-07T19:43:03.4647014Z cpu MHz : 3244.452 2025-05-07T19:43:03.4647101Z cache size : 36608 KB 2025-05-07T19:43:03.4647178Z physical id : 0 2025-05-07T19:43:03.4647250Z siblings : 48 2025-05-07T19:43:03.4647316Z core id : 17 2025-05-07T19:43:03.4647389Z cpu cores : 24 2025-05-07T19:43:03.4647457Z apicid : 35 2025-05-07T19:43:03.4647534Z initial apicid : 35 2025-05-07T19:43:03.4647606Z fpu : yes 2025-05-07T19:43:03.4647682Z fpu_exception : yes 2025-05-07T19:43:03.4647757Z cpuid level : 13 2025-05-07T19:43:03.4647824Z wp : yes 2025-05-07T19:43:03.4649900Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4650261Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4650347Z bogomips : 5999.99 2025-05-07T19:43:03.4650419Z clflush size : 64 2025-05-07T19:43:03.4650495Z cache_alignment : 64 2025-05-07T19:43:03.4650618Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4650703Z power management: 2025-05-07T19:43:03.4650710Z 2025-05-07T19:43:03.4650785Z processor : 66 2025-05-07T19:43:03.4650867Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4650945Z cpu family : 6 2025-05-07T19:43:03.4651014Z model : 85 2025-05-07T19:43:03.4651171Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4651243Z stepping : 7 2025-05-07T19:43:03.4651325Z microcode : 0x5003901 2025-05-07T19:43:03.4651399Z cpu MHz : 3783.987 2025-05-07T19:43:03.4651475Z cache size : 36608 KB 2025-05-07T19:43:03.4651557Z physical id : 0 2025-05-07T19:43:03.4651628Z siblings : 48 2025-05-07T19:43:03.4651704Z core id : 18 2025-05-07T19:43:03.4651776Z cpu cores : 24 2025-05-07T19:43:03.4651852Z apicid : 37 2025-05-07T19:43:03.4651928Z initial apicid : 37 2025-05-07T19:43:03.4651999Z fpu : yes 2025-05-07T19:43:03.4652075Z fpu_exception : yes 2025-05-07T19:43:03.4652159Z cpuid level : 13 2025-05-07T19:43:03.4652232Z wp : yes 2025-05-07T19:43:03.4654332Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4654748Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4654827Z bogomips : 5999.99 2025-05-07T19:43:03.4654912Z clflush size : 64 2025-05-07T19:43:03.4654989Z cache_alignment : 64 2025-05-07T19:43:03.4655108Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4655185Z power management: 2025-05-07T19:43:03.4655189Z 2025-05-07T19:43:03.4655270Z processor : 67 2025-05-07T19:43:03.4655353Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4655424Z cpu family : 6 2025-05-07T19:43:03.4655505Z model : 85 2025-05-07T19:43:03.4655656Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4655731Z stepping : 7 2025-05-07T19:43:03.4655809Z microcode : 0x5003901 2025-05-07T19:43:03.4655895Z cpu MHz : 2999.998 2025-05-07T19:43:03.4655969Z cache size : 36608 KB 2025-05-07T19:43:03.4656041Z physical id : 0 2025-05-07T19:43:03.4656120Z siblings : 48 2025-05-07T19:43:03.4656189Z core id : 19 2025-05-07T19:43:03.4656257Z cpu cores : 24 2025-05-07T19:43:03.4656328Z apicid : 39 2025-05-07T19:43:03.4656410Z initial apicid : 39 2025-05-07T19:43:03.4656476Z fpu : yes 2025-05-07T19:43:03.4656551Z fpu_exception : yes 2025-05-07T19:43:03.4656621Z cpuid level : 13 2025-05-07T19:43:03.4656692Z wp : yes 2025-05-07T19:43:03.4658751Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4659126Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4659202Z bogomips : 5999.99 2025-05-07T19:43:03.4659275Z clflush size : 64 2025-05-07T19:43:03.4659406Z cache_alignment : 64 2025-05-07T19:43:03.4659537Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4659612Z power management: 2025-05-07T19:43:03.4659616Z 2025-05-07T19:43:03.4659687Z processor : 68 2025-05-07T19:43:03.4659939Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4660014Z cpu family : 6 2025-05-07T19:43:03.4660085Z model : 85 2025-05-07T19:43:03.4660246Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4660322Z stepping : 7 2025-05-07T19:43:03.4660404Z microcode : 0x5003901 2025-05-07T19:43:03.4660477Z cpu MHz : 3257.803 2025-05-07T19:43:03.4660558Z cache size : 36608 KB 2025-05-07T19:43:03.4660650Z physical id : 0 2025-05-07T19:43:03.4660744Z siblings : 48 2025-05-07T19:43:03.4660834Z core id : 20 2025-05-07T19:43:03.4660950Z cpu cores : 24 2025-05-07T19:43:03.4661040Z apicid : 41 2025-05-07T19:43:03.4661136Z initial apicid : 41 2025-05-07T19:43:03.4661255Z fpu : yes 2025-05-07T19:43:03.4661358Z fpu_exception : yes 2025-05-07T19:43:03.4661450Z cpuid level : 13 2025-05-07T19:43:03.4661546Z wp : yes 2025-05-07T19:43:03.4663862Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4664322Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4664434Z bogomips : 5999.99 2025-05-07T19:43:03.4664526Z clflush size : 64 2025-05-07T19:43:03.4664620Z cache_alignment : 64 2025-05-07T19:43:03.4664762Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4664877Z power management: 2025-05-07T19:43:03.4664881Z 2025-05-07T19:43:03.4664973Z processor : 69 2025-05-07T19:43:03.4665073Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4665192Z cpu family : 6 2025-05-07T19:43:03.4665283Z model : 85 2025-05-07T19:43:03.4665455Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4665552Z stepping : 7 2025-05-07T19:43:03.4665675Z microcode : 0x5003901 2025-05-07T19:43:03.4665771Z cpu MHz : 2999.998 2025-05-07T19:43:03.4665866Z cache size : 36608 KB 2025-05-07T19:43:03.4665985Z physical id : 0 2025-05-07T19:43:03.4666078Z siblings : 48 2025-05-07T19:43:03.4666169Z core id : 21 2025-05-07T19:43:03.4666259Z cpu cores : 24 2025-05-07T19:43:03.4666371Z apicid : 43 2025-05-07T19:43:03.4666468Z initial apicid : 43 2025-05-07T19:43:03.4666556Z fpu : yes 2025-05-07T19:43:03.4666673Z fpu_exception : yes 2025-05-07T19:43:03.4666764Z cpuid level : 13 2025-05-07T19:43:03.4666851Z wp : yes 2025-05-07T19:43:03.4669121Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4669531Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4669624Z bogomips : 5999.99 2025-05-07T19:43:03.4669740Z clflush size : 64 2025-05-07T19:43:03.4669835Z cache_alignment : 64 2025-05-07T19:43:03.4669975Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4670068Z power management: 2025-05-07T19:43:03.4670073Z 2025-05-07T19:43:03.4670188Z processor : 70 2025-05-07T19:43:03.4670285Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4670374Z cpu family : 6 2025-05-07T19:43:03.4670489Z model : 85 2025-05-07T19:43:03.4670658Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4670747Z stepping : 7 2025-05-07T19:43:03.4670841Z microcode : 0x5003901 2025-05-07T19:43:03.4670953Z cpu MHz : 2999.998 2025-05-07T19:43:03.4671049Z cache size : 36608 KB 2025-05-07T19:43:03.4671141Z physical id : 0 2025-05-07T19:43:03.4671254Z siblings : 48 2025-05-07T19:43:03.4671342Z core id : 22 2025-05-07T19:43:03.4671431Z cpu cores : 24 2025-05-07T19:43:03.4671522Z apicid : 45 2025-05-07T19:43:03.4671637Z initial apicid : 45 2025-05-07T19:43:03.4671723Z fpu : yes 2025-05-07T19:43:03.4671817Z fpu_exception : yes 2025-05-07T19:43:03.4671932Z cpuid level : 13 2025-05-07T19:43:03.4672120Z wp : yes 2025-05-07T19:43:03.4674252Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4674735Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4674824Z bogomips : 5999.99 2025-05-07T19:43:03.4674911Z clflush size : 64 2025-05-07T19:43:03.4675031Z cache_alignment : 64 2025-05-07T19:43:03.4675162Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4675251Z power management: 2025-05-07T19:43:03.4675255Z 2025-05-07T19:43:03.4675344Z processor : 71 2025-05-07T19:43:03.4675466Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4675552Z cpu family : 6 2025-05-07T19:43:03.4675633Z model : 85 2025-05-07T19:43:03.4675815Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4675903Z stepping : 7 2025-05-07T19:43:03.4675989Z microcode : 0x5003901 2025-05-07T19:43:03.4676074Z cpu MHz : 2999.998 2025-05-07T19:43:03.4676187Z cache size : 36608 KB 2025-05-07T19:43:03.4676276Z physical id : 0 2025-05-07T19:43:03.4676361Z siblings : 48 2025-05-07T19:43:03.4676464Z core id : 23 2025-05-07T19:43:03.4676547Z cpu cores : 24 2025-05-07T19:43:03.4676629Z apicid : 47 2025-05-07T19:43:03.4676720Z initial apicid : 47 2025-05-07T19:43:03.4676825Z fpu : yes 2025-05-07T19:43:03.4676912Z fpu_exception : yes 2025-05-07T19:43:03.4676998Z cpuid level : 13 2025-05-07T19:43:03.4677102Z wp : yes 2025-05-07T19:43:03.4679167Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4679547Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4679654Z bogomips : 5999.99 2025-05-07T19:43:03.4679740Z clflush size : 64 2025-05-07T19:43:03.4679829Z cache_alignment : 64 2025-05-07T19:43:03.4679986Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4680078Z power management: 2025-05-07T19:43:03.4680082Z 2025-05-07T19:43:03.4680168Z processor : 72 2025-05-07T19:43:03.4680263Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4680372Z cpu family : 6 2025-05-07T19:43:03.4680458Z model : 85 2025-05-07T19:43:03.4680616Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4680733Z stepping : 7 2025-05-07T19:43:03.4680823Z microcode : 0x5003901 2025-05-07T19:43:03.4680909Z cpu MHz : 1202.348 2025-05-07T19:43:03.4681000Z cache size : 36608 KB 2025-05-07T19:43:03.4681112Z physical id : 1 2025-05-07T19:43:03.4681199Z siblings : 48 2025-05-07T19:43:03.4681284Z core id : 0 2025-05-07T19:43:03.4681398Z cpu cores : 24 2025-05-07T19:43:03.4681483Z apicid : 65 2025-05-07T19:43:03.4681572Z initial apicid : 65 2025-05-07T19:43:03.4681653Z fpu : yes 2025-05-07T19:43:03.4681759Z fpu_exception : yes 2025-05-07T19:43:03.4681843Z cpuid level : 13 2025-05-07T19:43:03.4682048Z wp : yes 2025-05-07T19:43:03.4684509Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4685759Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4685865Z bogomips : 5999.99 2025-05-07T19:43:03.4685984Z clflush size : 64 2025-05-07T19:43:03.4686080Z cache_alignment : 64 2025-05-07T19:43:03.4686228Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4686348Z power management: 2025-05-07T19:43:03.4686353Z 2025-05-07T19:43:03.4686450Z processor : 73 2025-05-07T19:43:03.4686540Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4686615Z cpu family : 6 2025-05-07T19:43:03.4686693Z model : 85 2025-05-07T19:43:03.4686852Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4686930Z stepping : 7 2025-05-07T19:43:03.4687022Z microcode : 0x5003901 2025-05-07T19:43:03.4687103Z cpu MHz : 1203.352 2025-05-07T19:43:03.4687180Z cache size : 36608 KB 2025-05-07T19:43:03.4687259Z physical id : 1 2025-05-07T19:43:03.4687344Z siblings : 48 2025-05-07T19:43:03.4687425Z core id : 1 2025-05-07T19:43:03.4687502Z cpu cores : 24 2025-05-07T19:43:03.4687586Z apicid : 67 2025-05-07T19:43:03.4687667Z initial apicid : 67 2025-05-07T19:43:03.4687741Z fpu : yes 2025-05-07T19:43:03.4687820Z fpu_exception : yes 2025-05-07T19:43:03.4687911Z cpuid level : 13 2025-05-07T19:43:03.4687984Z wp : yes 2025-05-07T19:43:03.4690218Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4690626Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4690705Z bogomips : 5999.99 2025-05-07T19:43:03.4690782Z clflush size : 64 2025-05-07T19:43:03.4690873Z cache_alignment : 64 2025-05-07T19:43:03.4691002Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4691083Z power management: 2025-05-07T19:43:03.4691088Z 2025-05-07T19:43:03.4691176Z processor : 74 2025-05-07T19:43:03.4691262Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4691336Z cpu family : 6 2025-05-07T19:43:03.4691409Z model : 85 2025-05-07T19:43:03.4691572Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4691647Z stepping : 7 2025-05-07T19:43:03.4691726Z microcode : 0x5003901 2025-05-07T19:43:03.4691811Z cpu MHz : 1187.019 2025-05-07T19:43:03.4691888Z cache size : 36608 KB 2025-05-07T19:43:03.4691965Z physical id : 1 2025-05-07T19:43:03.4692037Z siblings : 48 2025-05-07T19:43:03.4692114Z core id : 2 2025-05-07T19:43:03.4692192Z cpu cores : 24 2025-05-07T19:43:03.4692266Z apicid : 69 2025-05-07T19:43:03.4692345Z initial apicid : 69 2025-05-07T19:43:03.4692427Z fpu : yes 2025-05-07T19:43:03.4692505Z fpu_exception : yes 2025-05-07T19:43:03.4692581Z cpuid level : 13 2025-05-07T19:43:03.4692657Z wp : yes 2025-05-07T19:43:03.4694984Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4695555Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4695634Z bogomips : 5999.99 2025-05-07T19:43:03.4695709Z clflush size : 64 2025-05-07T19:43:03.4695787Z cache_alignment : 64 2025-05-07T19:43:03.4695910Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4695984Z power management: 2025-05-07T19:43:03.4695987Z 2025-05-07T19:43:03.4696059Z processor : 75 2025-05-07T19:43:03.4696147Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4696218Z cpu family : 6 2025-05-07T19:43:03.4696286Z model : 85 2025-05-07T19:43:03.4696429Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4696510Z stepping : 7 2025-05-07T19:43:03.4696584Z microcode : 0x5003901 2025-05-07T19:43:03.4696654Z cpu MHz : 1213.753 2025-05-07T19:43:03.4696729Z cache size : 36608 KB 2025-05-07T19:43:03.4696805Z physical id : 1 2025-05-07T19:43:03.4696873Z siblings : 48 2025-05-07T19:43:03.4696946Z core id : 3 2025-05-07T19:43:03.4697027Z cpu cores : 24 2025-05-07T19:43:03.4697094Z apicid : 71 2025-05-07T19:43:03.4697175Z initial apicid : 71 2025-05-07T19:43:03.4697240Z fpu : yes 2025-05-07T19:43:03.4697322Z fpu_exception : yes 2025-05-07T19:43:03.4697392Z cpuid level : 13 2025-05-07T19:43:03.4697461Z wp : yes 2025-05-07T19:43:03.4699590Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4700142Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4700222Z bogomips : 5999.99 2025-05-07T19:43:03.4700306Z clflush size : 64 2025-05-07T19:43:03.4700388Z cache_alignment : 64 2025-05-07T19:43:03.4700516Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4700607Z power management: 2025-05-07T19:43:03.4700612Z 2025-05-07T19:43:03.4700691Z processor : 76 2025-05-07T19:43:03.4700775Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4700847Z cpu family : 6 2025-05-07T19:43:03.4700926Z model : 85 2025-05-07T19:43:03.4701081Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4701155Z stepping : 7 2025-05-07T19:43:03.4701241Z microcode : 0x5003901 2025-05-07T19:43:03.4701316Z cpu MHz : 2999.998 2025-05-07T19:43:03.4701394Z cache size : 36608 KB 2025-05-07T19:43:03.4701472Z physical id : 1 2025-05-07T19:43:03.4701552Z siblings : 48 2025-05-07T19:43:03.4701626Z core id : 4 2025-05-07T19:43:03.4701703Z cpu cores : 24 2025-05-07T19:43:03.4701794Z apicid : 73 2025-05-07T19:43:03.4701879Z initial apicid : 73 2025-05-07T19:43:03.4701958Z fpu : yes 2025-05-07T19:43:03.4702038Z fpu_exception : yes 2025-05-07T19:43:03.4702120Z cpuid level : 13 2025-05-07T19:43:03.4702192Z wp : yes 2025-05-07T19:43:03.4704413Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4704875Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4704961Z bogomips : 5999.99 2025-05-07T19:43:03.4705101Z clflush size : 64 2025-05-07T19:43:03.4705190Z cache_alignment : 64 2025-05-07T19:43:03.4705318Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4705400Z power management: 2025-05-07T19:43:03.4705404Z 2025-05-07T19:43:03.4705490Z processor : 77 2025-05-07T19:43:03.4705581Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4705660Z cpu family : 6 2025-05-07T19:43:03.4705736Z model : 85 2025-05-07T19:43:03.4705906Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4705987Z stepping : 7 2025-05-07T19:43:03.4706071Z microcode : 0x5003901 2025-05-07T19:43:03.4706164Z cpu MHz : 1203.060 2025-05-07T19:43:03.4706248Z cache size : 36608 KB 2025-05-07T19:43:03.4706328Z physical id : 1 2025-05-07T19:43:03.4706409Z siblings : 48 2025-05-07T19:43:03.4706490Z core id : 5 2025-05-07T19:43:03.4706566Z cpu cores : 24 2025-05-07T19:43:03.4706639Z apicid : 75 2025-05-07T19:43:03.4706734Z initial apicid : 75 2025-05-07T19:43:03.4706811Z fpu : yes 2025-05-07T19:43:03.4706894Z fpu_exception : yes 2025-05-07T19:43:03.4706981Z cpuid level : 13 2025-05-07T19:43:03.4707069Z wp : yes 2025-05-07T19:43:03.4709300Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4709715Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4709800Z bogomips : 5999.99 2025-05-07T19:43:03.4709879Z clflush size : 64 2025-05-07T19:43:03.4709964Z cache_alignment : 64 2025-05-07T19:43:03.4710100Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4710179Z power management: 2025-05-07T19:43:03.4710184Z 2025-05-07T19:43:03.4710266Z processor : 78 2025-05-07T19:43:03.4710361Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4710434Z cpu family : 6 2025-05-07T19:43:03.4710511Z model : 85 2025-05-07T19:43:03.4710671Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4710755Z stepping : 7 2025-05-07T19:43:03.4710833Z microcode : 0x5003901 2025-05-07T19:43:03.4710907Z cpu MHz : 1204.234 2025-05-07T19:43:03.4711002Z cache size : 36608 KB 2025-05-07T19:43:03.4711081Z physical id : 1 2025-05-07T19:43:03.4711159Z siblings : 48 2025-05-07T19:43:03.4711234Z core id : 6 2025-05-07T19:43:03.4711322Z cpu cores : 24 2025-05-07T19:43:03.4711402Z apicid : 77 2025-05-07T19:43:03.4711482Z initial apicid : 77 2025-05-07T19:43:03.4711560Z fpu : yes 2025-05-07T19:43:03.4711647Z fpu_exception : yes 2025-05-07T19:43:03.4711726Z cpuid level : 13 2025-05-07T19:43:03.4711803Z wp : yes 2025-05-07T19:43:03.4713999Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4714365Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4714503Z bogomips : 5999.99 2025-05-07T19:43:03.4714573Z clflush size : 64 2025-05-07T19:43:03.4714648Z cache_alignment : 64 2025-05-07T19:43:03.4714818Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4714898Z power management: 2025-05-07T19:43:03.4714902Z 2025-05-07T19:43:03.4714973Z processor : 79 2025-05-07T19:43:03.4715060Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4715143Z cpu family : 6 2025-05-07T19:43:03.4715213Z model : 85 2025-05-07T19:43:03.4715364Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4715433Z stepping : 7 2025-05-07T19:43:03.4715524Z microcode : 0x5003901 2025-05-07T19:43:03.4715601Z cpu MHz : 2999.998 2025-05-07T19:43:03.4715673Z cache size : 36608 KB 2025-05-07T19:43:03.4715749Z physical id : 1 2025-05-07T19:43:03.4715817Z siblings : 48 2025-05-07T19:43:03.4715895Z core id : 7 2025-05-07T19:43:03.4715966Z cpu cores : 24 2025-05-07T19:43:03.4716058Z apicid : 79 2025-05-07T19:43:03.4716131Z initial apicid : 79 2025-05-07T19:43:03.4716202Z fpu : yes 2025-05-07T19:43:03.4716275Z fpu_exception : yes 2025-05-07T19:43:03.4716356Z cpuid level : 13 2025-05-07T19:43:03.4716426Z wp : yes 2025-05-07T19:43:03.4718483Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4718852Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4718928Z bogomips : 5999.99 2025-05-07T19:43:03.4719000Z clflush size : 64 2025-05-07T19:43:03.4719080Z cache_alignment : 64 2025-05-07T19:43:03.4719198Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4719274Z power management: 2025-05-07T19:43:03.4719278Z 2025-05-07T19:43:03.4719352Z processor : 80 2025-05-07T19:43:03.4719431Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4719501Z cpu family : 6 2025-05-07T19:43:03.4719570Z model : 85 2025-05-07T19:43:03.4719715Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4719782Z stepping : 7 2025-05-07T19:43:03.4719858Z microcode : 0x5003901 2025-05-07T19:43:03.4719941Z cpu MHz : 1194.746 2025-05-07T19:43:03.4720019Z cache size : 36608 KB 2025-05-07T19:43:03.4720092Z physical id : 1 2025-05-07T19:43:03.4720166Z siblings : 48 2025-05-07T19:43:03.4720235Z core id : 8 2025-05-07T19:43:03.4720305Z cpu cores : 24 2025-05-07T19:43:03.4720377Z apicid : 81 2025-05-07T19:43:03.4720456Z initial apicid : 81 2025-05-07T19:43:03.4720522Z fpu : yes 2025-05-07T19:43:03.4720594Z fpu_exception : yes 2025-05-07T19:43:03.4720662Z cpuid level : 13 2025-05-07T19:43:03.4720729Z wp : yes 2025-05-07T19:43:03.4722779Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4723145Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4723262Z bogomips : 5999.99 2025-05-07T19:43:03.4723333Z clflush size : 64 2025-05-07T19:43:03.4723406Z cache_alignment : 64 2025-05-07T19:43:03.4723524Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4723596Z power management: 2025-05-07T19:43:03.4723646Z 2025-05-07T19:43:03.4723717Z processor : 81 2025-05-07T19:43:03.4723799Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4723868Z cpu family : 6 2025-05-07T19:43:03.4723934Z model : 85 2025-05-07T19:43:03.4724075Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4724150Z stepping : 7 2025-05-07T19:43:03.4724223Z microcode : 0x5003901 2025-05-07T19:43:03.4724291Z cpu MHz : 1192.685 2025-05-07T19:43:03.4724369Z cache size : 36608 KB 2025-05-07T19:43:03.4724438Z physical id : 1 2025-05-07T19:43:03.4724505Z siblings : 48 2025-05-07T19:43:03.4724569Z core id : 9 2025-05-07T19:43:03.4724640Z cpu cores : 24 2025-05-07T19:43:03.4724708Z apicid : 83 2025-05-07T19:43:03.4724779Z initial apicid : 83 2025-05-07T19:43:03.4724851Z fpu : yes 2025-05-07T19:43:03.4724924Z fpu_exception : yes 2025-05-07T19:43:03.4724995Z cpuid level : 13 2025-05-07T19:43:03.4725060Z wp : yes 2025-05-07T19:43:03.4727122Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4727489Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4727573Z bogomips : 5999.99 2025-05-07T19:43:03.4727649Z clflush size : 64 2025-05-07T19:43:03.4727724Z cache_alignment : 64 2025-05-07T19:43:03.4727844Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4727924Z power management: 2025-05-07T19:43:03.4727928Z 2025-05-07T19:43:03.4728000Z processor : 82 2025-05-07T19:43:03.4728077Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4728159Z cpu family : 6 2025-05-07T19:43:03.4728232Z model : 85 2025-05-07T19:43:03.4728375Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4728443Z stepping : 7 2025-05-07T19:43:03.4728523Z microcode : 0x5003901 2025-05-07T19:43:03.4728590Z cpu MHz : 2999.998 2025-05-07T19:43:03.4728664Z cache size : 36608 KB 2025-05-07T19:43:03.4728744Z physical id : 1 2025-05-07T19:43:03.4728811Z siblings : 48 2025-05-07T19:43:03.4728878Z core id : 10 2025-05-07T19:43:03.4728946Z cpu cores : 24 2025-05-07T19:43:03.4729020Z apicid : 85 2025-05-07T19:43:03.4729092Z initial apicid : 85 2025-05-07T19:43:03.4729157Z fpu : yes 2025-05-07T19:43:03.4729239Z fpu_exception : yes 2025-05-07T19:43:03.4729308Z cpuid level : 13 2025-05-07T19:43:03.4729375Z wp : yes 2025-05-07T19:43:03.4731439Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4731801Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4731872Z bogomips : 5999.99 2025-05-07T19:43:03.4731954Z clflush size : 64 2025-05-07T19:43:03.4732028Z cache_alignment : 64 2025-05-07T19:43:03.4732216Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4732288Z power management: 2025-05-07T19:43:03.4732293Z 2025-05-07T19:43:03.4732369Z processor : 83 2025-05-07T19:43:03.4732500Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4732571Z cpu family : 6 2025-05-07T19:43:03.4732640Z model : 85 2025-05-07T19:43:03.4732782Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4732851Z stepping : 7 2025-05-07T19:43:03.4732923Z microcode : 0x5003901 2025-05-07T19:43:03.4732998Z cpu MHz : 2999.998 2025-05-07T19:43:03.4733068Z cache size : 36608 KB 2025-05-07T19:43:03.4733137Z physical id : 1 2025-05-07T19:43:03.4733211Z siblings : 48 2025-05-07T19:43:03.4733278Z core id : 11 2025-05-07T19:43:03.4733345Z cpu cores : 24 2025-05-07T19:43:03.4733413Z apicid : 87 2025-05-07T19:43:03.4733491Z initial apicid : 87 2025-05-07T19:43:03.4733557Z fpu : yes 2025-05-07T19:43:03.4733631Z fpu_exception : yes 2025-05-07T19:43:03.4733711Z cpuid level : 13 2025-05-07T19:43:03.4733778Z wp : yes 2025-05-07T19:43:03.4735828Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4736194Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4736269Z bogomips : 5999.99 2025-05-07T19:43:03.4736339Z clflush size : 64 2025-05-07T19:43:03.4736418Z cache_alignment : 64 2025-05-07T19:43:03.4736533Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4736615Z power management: 2025-05-07T19:43:03.4736619Z 2025-05-07T19:43:03.4736691Z processor : 84 2025-05-07T19:43:03.4736784Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4736862Z cpu family : 6 2025-05-07T19:43:03.4736931Z model : 85 2025-05-07T19:43:03.4737081Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4737158Z stepping : 7 2025-05-07T19:43:03.4737231Z microcode : 0x5003901 2025-05-07T19:43:03.4737305Z cpu MHz : 2999.998 2025-05-07T19:43:03.4737386Z cache size : 36608 KB 2025-05-07T19:43:03.4737457Z physical id : 1 2025-05-07T19:43:03.4737527Z siblings : 48 2025-05-07T19:43:03.4737608Z core id : 12 2025-05-07T19:43:03.4737678Z cpu cores : 24 2025-05-07T19:43:03.4737746Z apicid : 89 2025-05-07T19:43:03.4737817Z initial apicid : 89 2025-05-07T19:43:03.4737891Z fpu : yes 2025-05-07T19:43:03.4737965Z fpu_exception : yes 2025-05-07T19:43:03.4738034Z cpuid level : 13 2025-05-07T19:43:03.4738099Z wp : yes 2025-05-07T19:43:03.4740433Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4740830Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4740916Z bogomips : 5999.99 2025-05-07T19:43:03.4740999Z clflush size : 64 2025-05-07T19:43:03.4741109Z cache_alignment : 64 2025-05-07T19:43:03.4741244Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4741379Z power management: 2025-05-07T19:43:03.4741384Z 2025-05-07T19:43:03.4741463Z processor : 85 2025-05-07T19:43:03.4741547Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4741627Z cpu family : 6 2025-05-07T19:43:03.4741701Z model : 85 2025-05-07T19:43:03.4741920Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4742002Z stepping : 7 2025-05-07T19:43:03.4742083Z microcode : 0x5003901 2025-05-07T19:43:03.4742162Z cpu MHz : 2999.998 2025-05-07T19:43:03.4742244Z cache size : 36608 KB 2025-05-07T19:43:03.4742331Z physical id : 1 2025-05-07T19:43:03.4742405Z siblings : 48 2025-05-07T19:43:03.4742479Z core id : 13 2025-05-07T19:43:03.4742575Z cpu cores : 24 2025-05-07T19:43:03.4742650Z apicid : 91 2025-05-07T19:43:03.4742734Z initial apicid : 91 2025-05-07T19:43:03.4742807Z fpu : yes 2025-05-07T19:43:03.4742900Z fpu_exception : yes 2025-05-07T19:43:03.4742980Z cpuid level : 13 2025-05-07T19:43:03.4743052Z wp : yes 2025-05-07T19:43:03.4745295Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4745697Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4745777Z bogomips : 5999.99 2025-05-07T19:43:03.4745864Z clflush size : 64 2025-05-07T19:43:03.4745952Z cache_alignment : 64 2025-05-07T19:43:03.4746079Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4746172Z power management: 2025-05-07T19:43:03.4746180Z 2025-05-07T19:43:03.4746260Z processor : 86 2025-05-07T19:43:03.4746344Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4746419Z cpu family : 6 2025-05-07T19:43:03.4746497Z model : 85 2025-05-07T19:43:03.4746657Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4746736Z stepping : 7 2025-05-07T19:43:03.4746819Z microcode : 0x5003901 2025-05-07T19:43:03.4746893Z cpu MHz : 1219.809 2025-05-07T19:43:03.4746972Z cache size : 36608 KB 2025-05-07T19:43:03.4747048Z physical id : 1 2025-05-07T19:43:03.4747134Z siblings : 48 2025-05-07T19:43:03.4747208Z core id : 14 2025-05-07T19:43:03.4747280Z cpu cores : 24 2025-05-07T19:43:03.4747355Z apicid : 93 2025-05-07T19:43:03.4747445Z initial apicid : 93 2025-05-07T19:43:03.4747517Z fpu : yes 2025-05-07T19:43:03.4747597Z fpu_exception : yes 2025-05-07T19:43:03.4747694Z cpuid level : 13 2025-05-07T19:43:03.4747768Z wp : yes 2025-05-07T19:43:03.4750008Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4750414Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4750496Z bogomips : 5999.99 2025-05-07T19:43:03.4750576Z clflush size : 64 2025-05-07T19:43:03.4750669Z cache_alignment : 64 2025-05-07T19:43:03.4750793Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4750874Z power management: 2025-05-07T19:43:03.4750878Z 2025-05-07T19:43:03.4750986Z processor : 87 2025-05-07T19:43:03.4751121Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4751196Z cpu family : 6 2025-05-07T19:43:03.4751268Z model : 85 2025-05-07T19:43:03.4751434Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4751564Z stepping : 7 2025-05-07T19:43:03.4751647Z microcode : 0x5003901 2025-05-07T19:43:03.4751736Z cpu MHz : 1204.250 2025-05-07T19:43:03.4751815Z cache size : 36608 KB 2025-05-07T19:43:03.4751889Z physical id : 1 2025-05-07T19:43:03.4751963Z siblings : 48 2025-05-07T19:43:03.4752150Z core id : 15 2025-05-07T19:43:03.4752220Z cpu cores : 24 2025-05-07T19:43:03.4752289Z apicid : 95 2025-05-07T19:43:03.4752364Z initial apicid : 95 2025-05-07T19:43:03.4752441Z fpu : yes 2025-05-07T19:43:03.4752514Z fpu_exception : yes 2025-05-07T19:43:03.4752586Z cpuid level : 13 2025-05-07T19:43:03.4752661Z wp : yes 2025-05-07T19:43:03.4754713Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4755082Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4755163Z bogomips : 5999.99 2025-05-07T19:43:03.4755240Z clflush size : 64 2025-05-07T19:43:03.4755313Z cache_alignment : 64 2025-05-07T19:43:03.4755436Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4755511Z power management: 2025-05-07T19:43:03.4755515Z 2025-05-07T19:43:03.4755587Z processor : 88 2025-05-07T19:43:03.4755667Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4755748Z cpu family : 6 2025-05-07T19:43:03.4755816Z model : 85 2025-05-07T19:43:03.4755956Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4756040Z stepping : 7 2025-05-07T19:43:03.4756119Z microcode : 0x5003901 2025-05-07T19:43:03.4756191Z cpu MHz : 1202.048 2025-05-07T19:43:03.4756265Z cache size : 36608 KB 2025-05-07T19:43:03.4756345Z physical id : 1 2025-05-07T19:43:03.4756415Z siblings : 48 2025-05-07T19:43:03.4756484Z core id : 16 2025-05-07T19:43:03.4756569Z cpu cores : 24 2025-05-07T19:43:03.4756635Z apicid : 97 2025-05-07T19:43:03.4756710Z initial apicid : 97 2025-05-07T19:43:03.4756780Z fpu : yes 2025-05-07T19:43:03.4756864Z fpu_exception : yes 2025-05-07T19:43:03.4756935Z cpuid level : 13 2025-05-07T19:43:03.4757003Z wp : yes 2025-05-07T19:43:03.4759075Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4759441Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4759514Z bogomips : 5999.99 2025-05-07T19:43:03.4759601Z clflush size : 64 2025-05-07T19:43:03.4759679Z cache_alignment : 64 2025-05-07T19:43:03.4759795Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4759879Z power management: 2025-05-07T19:43:03.4759883Z 2025-05-07T19:43:03.4759956Z processor : 89 2025-05-07T19:43:03.4760036Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4760104Z cpu family : 6 2025-05-07T19:43:03.4760225Z model : 85 2025-05-07T19:43:03.4760374Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4760447Z stepping : 7 2025-05-07T19:43:03.4760525Z microcode : 0x5003901 2025-05-07T19:43:03.4760639Z cpu MHz : 1203.706 2025-05-07T19:43:03.4760714Z cache size : 36608 KB 2025-05-07T19:43:03.4760783Z physical id : 1 2025-05-07T19:43:03.4760859Z siblings : 48 2025-05-07T19:43:03.4760929Z core id : 17 2025-05-07T19:43:03.4761006Z cpu cores : 24 2025-05-07T19:43:03.4761082Z apicid : 99 2025-05-07T19:43:03.4761158Z initial apicid : 99 2025-05-07T19:43:03.4761227Z fpu : yes 2025-05-07T19:43:03.4761306Z fpu_exception : yes 2025-05-07T19:43:03.4761384Z cpuid level : 13 2025-05-07T19:43:03.4761450Z wp : yes 2025-05-07T19:43:03.4763507Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4763886Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4763958Z bogomips : 5999.99 2025-05-07T19:43:03.4764027Z clflush size : 64 2025-05-07T19:43:03.4764105Z cache_alignment : 64 2025-05-07T19:43:03.4764229Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4764306Z power management: 2025-05-07T19:43:03.4764310Z 2025-05-07T19:43:03.4764396Z processor : 90 2025-05-07T19:43:03.4764478Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4764545Z cpu family : 6 2025-05-07T19:43:03.4764611Z model : 85 2025-05-07T19:43:03.4764767Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4764834Z stepping : 7 2025-05-07T19:43:03.4764905Z microcode : 0x5003901 2025-05-07T19:43:03.4764976Z cpu MHz : 1202.152 2025-05-07T19:43:03.4765054Z cache size : 36608 KB 2025-05-07T19:43:03.4765123Z physical id : 1 2025-05-07T19:43:03.4765192Z siblings : 48 2025-05-07T19:43:03.4765268Z core id : 18 2025-05-07T19:43:03.4765345Z cpu cores : 24 2025-05-07T19:43:03.4765413Z apicid : 101 2025-05-07T19:43:03.4765495Z initial apicid : 101 2025-05-07T19:43:03.4765564Z fpu : yes 2025-05-07T19:43:03.4765638Z fpu_exception : yes 2025-05-07T19:43:03.4765711Z cpuid level : 13 2025-05-07T19:43:03.4765782Z wp : yes 2025-05-07T19:43:03.4767845Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4768219Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4768292Z bogomips : 5999.99 2025-05-07T19:43:03.4768365Z clflush size : 64 2025-05-07T19:43:03.4768440Z cache_alignment : 64 2025-05-07T19:43:03.4768564Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4768642Z power management: 2025-05-07T19:43:03.4768646Z 2025-05-07T19:43:03.4768718Z processor : 91 2025-05-07T19:43:03.4768804Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4768873Z cpu family : 6 2025-05-07T19:43:03.4768944Z model : 85 2025-05-07T19:43:03.4769111Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4769238Z stepping : 7 2025-05-07T19:43:03.4769314Z microcode : 0x5003901 2025-05-07T19:43:03.4769386Z cpu MHz : 1195.443 2025-05-07T19:43:03.4769466Z cache size : 36608 KB 2025-05-07T19:43:03.4769585Z physical id : 1 2025-05-07T19:43:03.4769656Z siblings : 48 2025-05-07T19:43:03.4769726Z core id : 19 2025-05-07T19:43:03.4769801Z cpu cores : 24 2025-05-07T19:43:03.4769870Z apicid : 103 2025-05-07T19:43:03.4769950Z initial apicid : 103 2025-05-07T19:43:03.4770032Z fpu : yes 2025-05-07T19:43:03.4770106Z fpu_exception : yes 2025-05-07T19:43:03.4770179Z cpuid level : 13 2025-05-07T19:43:03.4770246Z wp : yes 2025-05-07T19:43:03.4772314Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4772680Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4772758Z bogomips : 5999.99 2025-05-07T19:43:03.4772837Z clflush size : 64 2025-05-07T19:43:03.4772908Z cache_alignment : 64 2025-05-07T19:43:03.4773027Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4773111Z power management: 2025-05-07T19:43:03.4773115Z 2025-05-07T19:43:03.4773188Z processor : 92 2025-05-07T19:43:03.4773267Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4773352Z cpu family : 6 2025-05-07T19:43:03.4773420Z model : 85 2025-05-07T19:43:03.4773569Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4773643Z stepping : 7 2025-05-07T19:43:03.4773723Z microcode : 0x5003901 2025-05-07T19:43:03.4773791Z cpu MHz : 1202.476 2025-05-07T19:43:03.4773867Z cache size : 36608 KB 2025-05-07T19:43:03.4773948Z physical id : 1 2025-05-07T19:43:03.4774018Z siblings : 48 2025-05-07T19:43:03.4774086Z core id : 20 2025-05-07T19:43:03.4774157Z cpu cores : 24 2025-05-07T19:43:03.4774231Z apicid : 105 2025-05-07T19:43:03.4774304Z initial apicid : 105 2025-05-07T19:43:03.4774370Z fpu : yes 2025-05-07T19:43:03.4774450Z fpu_exception : yes 2025-05-07T19:43:03.4774531Z cpuid level : 13 2025-05-07T19:43:03.4774598Z wp : yes 2025-05-07T19:43:03.4776652Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4777032Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4777102Z bogomips : 5999.99 2025-05-07T19:43:03.4777188Z clflush size : 64 2025-05-07T19:43:03.4777271Z cache_alignment : 64 2025-05-07T19:43:03.4777390Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4777464Z power management: 2025-05-07T19:43:03.4777468Z 2025-05-07T19:43:03.4777550Z processor : 93 2025-05-07T19:43:03.4777628Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4777698Z cpu family : 6 2025-05-07T19:43:03.4777777Z model : 85 2025-05-07T19:43:03.4777922Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4777997Z stepping : 7 2025-05-07T19:43:03.4778131Z microcode : 0x5003901 2025-05-07T19:43:03.4778214Z cpu MHz : 1202.289 2025-05-07T19:43:03.4778291Z cache size : 36608 KB 2025-05-07T19:43:03.4778371Z physical id : 1 2025-05-07T19:43:03.4778454Z siblings : 48 2025-05-07T19:43:03.4778524Z core id : 21 2025-05-07T19:43:03.4778641Z cpu cores : 24 2025-05-07T19:43:03.4778715Z apicid : 107 2025-05-07T19:43:03.4778799Z initial apicid : 107 2025-05-07T19:43:03.4778867Z fpu : yes 2025-05-07T19:43:03.4778942Z fpu_exception : yes 2025-05-07T19:43:03.4779014Z cpuid level : 13 2025-05-07T19:43:03.4779096Z wp : yes 2025-05-07T19:43:03.4781721Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4782265Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4782351Z bogomips : 5999.99 2025-05-07T19:43:03.4782433Z clflush size : 64 2025-05-07T19:43:03.4782518Z cache_alignment : 64 2025-05-07T19:43:03.4782659Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4782743Z power management: 2025-05-07T19:43:03.4782747Z 2025-05-07T19:43:03.4782829Z processor : 94 2025-05-07T19:43:03.4782928Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4783008Z cpu family : 6 2025-05-07T19:43:03.4783080Z model : 85 2025-05-07T19:43:03.4783250Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4783330Z stepping : 7 2025-05-07T19:43:03.4783411Z microcode : 0x5003901 2025-05-07T19:43:03.4783496Z cpu MHz : 2999.998 2025-05-07T19:43:03.4783591Z cache size : 36608 KB 2025-05-07T19:43:03.4783671Z physical id : 1 2025-05-07T19:43:03.4783751Z siblings : 48 2025-05-07T19:43:03.4783829Z core id : 22 2025-05-07T19:43:03.4783911Z cpu cores : 24 2025-05-07T19:43:03.4783991Z apicid : 109 2025-05-07T19:43:03.4784078Z initial apicid : 109 2025-05-07T19:43:03.4784161Z fpu : yes 2025-05-07T19:43:03.4784242Z fpu_exception : yes 2025-05-07T19:43:03.4784319Z cpuid level : 13 2025-05-07T19:43:03.4784391Z wp : yes 2025-05-07T19:43:03.4786632Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4787033Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4787125Z bogomips : 5999.99 2025-05-07T19:43:03.4787205Z clflush size : 64 2025-05-07T19:43:03.4787291Z cache_alignment : 64 2025-05-07T19:43:03.4787420Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4787510Z power management: 2025-05-07T19:43:03.4787514Z 2025-05-07T19:43:03.4787589Z processor : 95 2025-05-07T19:43:03.4787675Z vendor_id : GenuineIntel 2025-05-07T19:43:03.4787759Z cpu family : 6 2025-05-07T19:43:03.4787832Z model : 85 2025-05-07T19:43:03.4787987Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.4788063Z stepping : 7 2025-05-07T19:43:03.4788158Z microcode : 0x5003901 2025-05-07T19:43:03.4788235Z cpu MHz : 1204.205 2025-05-07T19:43:03.4788404Z cache size : 36608 KB 2025-05-07T19:43:03.4788497Z physical id : 1 2025-05-07T19:43:03.4788573Z siblings : 48 2025-05-07T19:43:03.4788648Z core id : 23 2025-05-07T19:43:03.4788724Z cpu cores : 24 2025-05-07T19:43:03.4788814Z apicid : 111 2025-05-07T19:43:03.4788962Z initial apicid : 111 2025-05-07T19:43:03.4789038Z fpu : yes 2025-05-07T19:43:03.4789130Z fpu_exception : yes 2025-05-07T19:43:03.4789209Z cpuid level : 13 2025-05-07T19:43:03.4789279Z wp : yes 2025-05-07T19:43:03.4791523Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.4791922Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.4792007Z bogomips : 5999.99 2025-05-07T19:43:03.4792091Z clflush size : 64 2025-05-07T19:43:03.4792177Z cache_alignment : 64 2025-05-07T19:43:03.4792306Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.4792386Z power management: 2025-05-07T19:43:03.4792390Z 2025-05-07T19:43:03.4792403Z 2025-05-07T19:43:03.4792517Z ################################################################################ 2025-05-07T19:43:03.4792611Z [INFO] Print PCI info ... 2025-05-07T19:43:03.4792690Z + lspci -v 2025-05-07T19:43:03.4792694Z 2025-05-07T19:43:03.4792884Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:03.4792997Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:03.4793114Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:03.4793122Z 2025-05-07T19:43:03.4793333Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:03.4793417Z Physical Slot: 1 2025-05-07T19:43:03.4793527Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:03.4793532Z 2025-05-07T19:43:03.4793801Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:03.4793996Z Physical Slot: 1 2025-05-07T19:43:03.4794116Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:03.4794121Z 2025-05-07T19:43:03.4794496Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:03.4794570Z Physical Slot: 3 2025-05-07T19:43:03.4794669Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:03.4794794Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:03.4794916Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:03.4794920Z 2025-05-07T19:43:03.4795220Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:03.4795318Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:03.4795399Z Physical Slot: 4 2025-05-07T19:43:03.4795517Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:03.4795659Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:03.4795760Z Capabilities: 2025-05-07T19:43:03.4795845Z Kernel driver in use: nvme 2025-05-07T19:43:03.4795849Z 2025-05-07T19:43:03.4796057Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:03.4796142Z Physical Slot: 5 2025-05-07T19:43:03.4796264Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:03.4796402Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:03.4796528Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:03.4796682Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:03.4796823Z Capabilities: 2025-05-07T19:43:03.4796908Z Kernel driver in use: ena 2025-05-07T19:43:03.4796912Z 2025-05-07T19:43:03.4796916Z 2025-05-07T19:43:03.4797081Z ################################################################################ 2025-05-07T19:43:03.4797186Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:03.4797258Z + uname -a 2025-05-07T19:43:03.4797262Z 2025-05-07T19:43:03.4797650Z Linux 8b1652485f5b 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:03.4797655Z 2025-05-07T19:43:03.4797731Z + uname -m 2025-05-07T19:43:03.4797735Z 2025-05-07T19:43:03.4797808Z x86_64 2025-05-07T19:43:03.4797812Z 2025-05-07T19:43:03.4797901Z + cat /proc/version 2025-05-07T19:43:03.4797906Z 2025-05-07T19:43:03.4798647Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:03.4798655Z 2025-05-07T19:43:03.4798744Z + cat /etc/os-release 2025-05-07T19:43:03.4798748Z 2025-05-07T19:43:03.4798843Z NAME="Amazon Linux" 2025-05-07T19:43:03.4798921Z VERSION="2023" 2025-05-07T19:43:03.4799004Z ID="amzn" 2025-05-07T19:43:03.4799082Z ID_LIKE="fedora" 2025-05-07T19:43:03.4799173Z VERSION_ID="2023" 2025-05-07T19:43:03.4799267Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:03.4799372Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:03.4799456Z ANSI_COLOR="0;33" 2025-05-07T19:43:03.4799576Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:03.4799753Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:03.4799915Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:03.4800081Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:03.4800270Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:03.4800347Z VENDOR_NAME="AWS" 2025-05-07T19:43:03.4800471Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:03.4800554Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:03.4800558Z 2025-05-07T19:43:03.4840794Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:03.4840940Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:03.4841248Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:03.4841329Z env: 2025-05-07T19:43:03.4841441Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:03.4841523Z BUILD_ENV: build_binary 2025-05-07T19:43:03.4841619Z BUILD_TARGET: default 2025-05-07T19:43:03.4841710Z BUILD_VARIANT: cuda 2025-05-07T19:43:03.4841795Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:03.4841880Z ##[endgroup] 2025-05-07T19:43:03.9835089Z ################################################################################ 2025-05-07T19:43:03.9862068Z [INFO] Printing general display info ... 2025-05-07T19:43:03.9862561Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:04.0719987Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:04.0730935Z /usr/bin/sudo 2025-05-07T19:43:04.0744263Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:04.0755896Z /usr/bin/yum 2025-05-07T19:43:04.0759858Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:04.0785876Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:04.2965552Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:43:04.3924026Z Dependencies resolved. 2025-05-07T19:43:04.4137870Z Nothing to do. 2025-05-07T19:43:04.4138199Z Complete! 2025-05-07T19:43:04.4884175Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:04.4906023Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:04.7139889Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:43:04.7662200Z Dependencies resolved. 2025-05-07T19:43:04.7825852Z ================================================================================ 2025-05-07T19:43:04.7827742Z Package Arch Version Repository Size 2025-05-07T19:43:04.7828985Z ================================================================================ 2025-05-07T19:43:04.7829876Z Installing: 2025-05-07T19:43:04.7830789Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:04.7832187Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:04.7832752Z 2025-05-07T19:43:04.7832846Z Transaction Summary 2025-05-07T19:43:04.7833219Z ================================================================================ 2025-05-07T19:43:04.7833522Z Install 2 Packages 2025-05-07T19:43:04.7833671Z 2025-05-07T19:43:04.7833769Z Total download size: 347 k 2025-05-07T19:43:04.7834016Z Installed size: 883 k 2025-05-07T19:43:04.7834272Z Downloading Packages: 2025-05-07T19:43:05.0965255Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.5 MB/s | 28 kB 00:00 2025-05-07T19:43:05.1004997Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 14 MB/s | 319 kB 00:00 2025-05-07T19:43:05.1015027Z -------------------------------------------------------------------------------- 2025-05-07T19:43:05.1015536Z Total 1.1 MB/s | 347 kB 00:00 2025-05-07T19:43:05.1261900Z Running transaction check 2025-05-07T19:43:05.1318229Z Transaction check succeeded. 2025-05-07T19:43:05.1319364Z Running transaction test 2025-05-07T19:43:05.1480828Z Transaction test succeeded. 2025-05-07T19:43:05.1481235Z Running transaction 2025-05-07T19:43:05.1760827Z Preparing : 1/1 2025-05-07T19:43:05.1832067Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:05.1870585Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:06.2290984Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:06.2293385Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:06.2663648Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:06.2664049Z 2025-05-07T19:43:06.2664148Z Installed: 2025-05-07T19:43:06.2664509Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:06.2664869Z 2025-05-07T19:43:06.2664955Z Complete! 2025-05-07T19:43:06.3093346Z + hostname 2025-05-07T19:43:06.3093610Z 2025-05-07T19:43:06.3100524Z 8b1652485f5b 2025-05-07T19:43:06.3100988Z 2025-05-07T19:43:06.3101293Z + sudo lshw -C display 2025-05-07T19:43:06.3101801Z 2025-05-07T19:43:06.5069592Z *-display UNCLAIMED 2025-05-07T19:43:06.5070539Z description: VGA compatible controller 2025-05-07T19:43:06.5071512Z product: Amazon.com, Inc. 2025-05-07T19:43:06.5072383Z vendor: Amazon.com, Inc. 2025-05-07T19:43:06.5073217Z physical id: 3 2025-05-07T19:43:06.5073749Z bus info: pci@0000:00:03.0 2025-05-07T19:43:06.5074049Z version: 00 2025-05-07T19:43:06.5074289Z width: 32 bits 2025-05-07T19:43:06.5074548Z clock: 33MHz 2025-05-07T19:43:06.5074809Z capabilities: vga_controller bus_master 2025-05-07T19:43:06.5075156Z configuration: latency=0 2025-05-07T19:43:06.5075489Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:06.5091625Z 2025-05-07T19:43:06.5091885Z ################################################################################ 2025-05-07T19:43:06.5092314Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:06.5202428Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:06.5223972Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:06.5224618Z [CHECK] nvidia-smi not found 2025-05-07T19:43:06.5224958Z ################################################################################ 2025-05-07T19:43:06.5225616Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:06.5364502Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:06.5394451Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:06.5394977Z [CHECK] rocminfo not found 2025-05-07T19:43:06.5406405Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:06.5407459Z [CHECK] rocm-smi not found 2025-05-07T19:43:06.5513924Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:06.5514420Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:06.5514988Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:06.5515342Z env: 2025-05-07T19:43:06.5515570Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:06.5515900Z BUILD_ENV: build_binary 2025-05-07T19:43:06.5516154Z BUILD_TARGET: default 2025-05-07T19:43:06.5516417Z BUILD_VARIANT: cuda 2025-05-07T19:43:06.5516698Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:06.5516951Z ##[endgroup] 2025-05-07T19:43:06.9797159Z ################################################################################ 2025-05-07T19:43:06.9798229Z # Setup Miniconda 2025-05-07T19:43:06.9798832Z # 2025-05-07T19:43:06.9822711Z # [2025-05-07T19:43:06.981Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:06.9823180Z ################################################################################ 2025-05-07T19:43:06.9823576Z 2025-05-07T19:43:06.9844176Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:07.0726982Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:07.0728077Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:07.0728651Z 2025-05-07T19:43:07.0741639Z 2025-05-07T19:43:07.0742200Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:07.0770922Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:08.0369257Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:08.0369761Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:08.0370057Z 2025-05-07T19:43:08.0527484Z PREFIX=/github/home/miniconda 2025-05-07T19:43:08.4062921Z Unpacking payload ... 2025-05-07T19:43:08.8859747Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:09.5581334Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:11.4156925Z 2025-05-07T19:43:11.4157565Z Installing base environment... 2025-05-07T19:43:11.4157909Z 2025-05-07T19:43:12.4077718Z Preparing transaction: ...working... done 2025-05-07T19:43:15.2668432Z Executing transaction: ...working... done 2025-05-07T19:43:15.8201485Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:15.8896136Z installation finished. 2025-05-07T19:43:15.8898054Z 2025-05-07T19:43:15.8898843Z + rm -f miniconda.sh 2025-05-07T19:43:15.8899072Z 2025-05-07T19:43:15.9040048Z 2025-05-07T19:43:15.9040327Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:15.9040845Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:16.2701215Z 2025-05-07T19:43:16.2701877Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:16.2703061Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:16.2704175Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:16.2705270Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:16.2706391Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:16.2707190Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:16.2708081Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:16.2708593Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:16.2709182Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:16.2709757Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:16.2710474Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:16.2710884Z modified /github/home/.bashrc 2025-05-07T19:43:16.2711076Z 2025-05-07T19:43:16.2711302Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:16.2711651Z 2025-05-07T19:43:16.3235907Z 2025-05-07T19:43:16.3236185Z + . /github/home/.bashrc 2025-05-07T19:43:16.3236513Z 2025-05-07T19:43:17.1127808Z 2025-05-07T19:43:17.1128428Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:17.1154581Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:28.8534844Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:30.3138374Z Solving environment: | / - \ | / - \ | / - done 2025-05-07T19:43:30.4029405Z 2025-05-07T19:43:30.4030081Z ## Package Plan ## 2025-05-07T19:43:30.4030655Z 2025-05-07T19:43:30.4031057Z environment location: /github/home/miniconda 2025-05-07T19:43:30.4031781Z 2025-05-07T19:43:30.4032057Z added / updated specs: 2025-05-07T19:43:30.4032874Z - conda-libmamba-solver 2025-05-07T19:43:30.4033690Z - libarchive 2025-05-07T19:43:30.4034300Z - libmamba 2025-05-07T19:43:30.4034922Z - libmambapy 2025-05-07T19:43:30.4035300Z 2025-05-07T19:43:30.4035313Z 2025-05-07T19:43:30.4035670Z The following packages will be downloaded: 2025-05-07T19:43:30.4036374Z 2025-05-07T19:43:30.4036711Z package | build 2025-05-07T19:43:30.4037499Z ---------------------------|----------------- 2025-05-07T19:43:30.4038011Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:30.4038576Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:30.4039062Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:30.4039622Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:30.4040127Z ------------------------------------------------------------ 2025-05-07T19:43:30.4040551Z Total: 1.4 MB 2025-05-07T19:43:30.4040791Z 2025-05-07T19:43:30.4040923Z The following packages will be UPDATED: 2025-05-07T19:43:30.4041188Z 2025-05-07T19:43:30.4045611Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:30.4046523Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:30.4046963Z 2025-05-07T19:43:30.4047211Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:30.4047600Z 2025-05-07T19:43:30.4047956Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:30.4048862Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:30.4049410Z 2025-05-07T19:43:30.4049414Z 2025-05-07T19:43:30.4049671Z 2025-05-07T19:43:30.4049835Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:30.4050268Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:30.4050514Z 2025-05-07T19:43:30.4050860Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:30.4051124Z 2025-05-07T19:43:30.4051128Z 2025-05-07T19:43:30.4051381Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:30.4051693Z 2025-05-07T19:43:30.4051697Z 2025-05-07T19:43:30.4051865Z 2025-05-07T19:43:30.4512684Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:30.4513239Z 2025-05-07T19:43:30.4513304Z 2025-05-07T19:43:30.4513309Z 2025-05-07T19:43:30.4597054Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:30.4598021Z 2025-05-07T19:43:30.4598035Z 2025-05-07T19:43:30.4624354Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:30.4625256Z 2025-05-07T19:43:30.4625270Z 2025-05-07T19:43:30.4625312Z 2025-05-07T19:43:30.4678523Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:30.4679465Z 2025-05-07T19:43:30.4679479Z 2025-05-07T19:43:30.4760924Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:30.4761854Z 2025-05-07T19:43:30.4836827Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:30.4837668Z 2025-05-07T19:43:30.5044473Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:30.5242162Z conda-25.3.1 | 1.1 MB | ###5 | 35% 2025-05-07T19:43:30.6202452Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:30.6203581Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:30.6204595Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:30.6205074Z 2025-05-07T19:43:30.6205292Z 2025-05-07T19:43:30.6205587Z  2025-05-07T19:43:30.6205829Z 2025-05-07T19:43:30.6205833Z 2025-05-07T19:43:30.6206014Z  2025-05-07T19:43:30.6206268Z 2025-05-07T19:43:30.6206272Z 2025-05-07T19:43:30.6206275Z 2025-05-07T19:43:30.6206476Z  done 2025-05-07T19:43:30.7215549Z Preparing transaction: | done 2025-05-07T19:43:30.8223186Z Verifying transaction: - done 2025-05-07T19:43:32.1253202Z Executing transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:33.6999233Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:33.7022792Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:34.4201228Z Channels: 2025-05-07T19:43:34.4201788Z - defaults 2025-05-07T19:43:34.4202042Z Platform: linux-64 2025-05-07T19:43:35.4938983Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:35.6247018Z Solving environment: / - Channels: 2025-05-07T19:43:35.6247432Z - defaults 2025-05-07T19:43:35.6247698Z Platform: linux-64 2025-05-07T19:43:35.9020602Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:36.1104887Z Solving environment: / - \ done 2025-05-07T19:43:36.2262277Z | done 2025-05-07T19:43:36.2927899Z 2025-05-07T19:43:36.2928376Z ## Package Plan ## 2025-05-07T19:43:36.2928602Z 2025-05-07T19:43:36.2928752Z environment location: /github/home/miniconda 2025-05-07T19:43:36.2929053Z 2025-05-07T19:43:36.2929192Z added / updated specs: 2025-05-07T19:43:36.2929509Z - conda 2025-05-07T19:43:36.2929639Z 2025-05-07T19:43:36.2929669Z 2025-05-07T19:43:36.2929801Z The following packages will be downloaded: 2025-05-07T19:43:36.2930038Z 2025-05-07T19:43:36.2930165Z package | build 2025-05-07T19:43:36.2930551Z ---------------------------|----------------- 2025-05-07T19:43:36.2930950Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:36.2931680Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:36.2932134Z ------------------------------------------------------------ 2025-05-07T19:43:36.2932502Z Total: 1.4 MB 2025-05-07T19:43:36.2932905Z 2025-05-07T19:43:36.2933141Z The following packages will be UPDATED: 2025-05-07T19:43:36.2933355Z 2025-05-07T19:43:36.2933875Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:36.2934413Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:36.2934670Z 2025-05-07T19:43:36.2934674Z 2025-05-07T19:43:36.2934706Z 2025-05-07T19:43:36.2934855Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:36.2935223Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:36.2935477Z 2025-05-07T19:43:36.3303174Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:36.3303492Z 2025-05-07T19:43:36.3543303Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:36.5217275Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:36.5217590Z 2025-05-07T19:43:36.5218068Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:36.5218335Z 2025-05-07T19:43:36.5345950Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:36.5347160Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:36.5347753Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:36.5348091Z 2025-05-07T19:43:36.5348336Z 2025-05-07T19:43:36.5348535Z  done 2025-05-07T19:43:36.6357629Z Preparing transaction: - done 2025-05-07T19:43:36.7367344Z Verifying transaction: | done 2025-05-07T19:43:38.7400670Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:39.2997466Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:39.2998514Z + conda clean --packages --tarball -y 2025-05-07T19:43:39.2999153Z 2025-05-07T19:43:39.7375135Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:39.7375629Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:39.7926658Z 2025-05-07T19:43:39.7929924Z + conda clean --all -y 2025-05-07T19:43:39.7930455Z 2025-05-07T19:43:40.2342452Z There are no unused tarball(s) to remove. 2025-05-07T19:43:40.2343062Z Will remove 1 index cache(s). 2025-05-07T19:43:40.2343422Z There are no unused package(s) to remove. 2025-05-07T19:43:40.2343767Z There are no tempfile(s) to remove. 2025-05-07T19:43:40.2344120Z There are no logfile(s) to remove. 2025-05-07T19:43:40.2888163Z 2025-05-07T19:43:40.2888720Z + conda info 2025-05-07T19:43:40.2889146Z 2025-05-07T19:43:40.8453365Z 2025-05-07T19:43:40.8453688Z active environment : base 2025-05-07T19:43:40.8454165Z active env location : /github/home/miniconda 2025-05-07T19:43:40.8454591Z shell level : 1 2025-05-07T19:43:40.8454922Z user config file : /github/home/.condarc 2025-05-07T19:43:40.8455545Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:40.8455946Z conda version : 25.3.1 2025-05-07T19:43:40.8456283Z conda-build version : not installed 2025-05-07T19:43:40.8456613Z python version : 3.13.2.final.0 2025-05-07T19:43:40.8456974Z solver : libmamba (default) 2025-05-07T19:43:40.8457354Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:40.8457706Z __conda=25.3.1=0 2025-05-07T19:43:40.8458038Z __glibc=2.34=0 2025-05-07T19:43:40.8458347Z __linux=6.1.130=0 2025-05-07T19:43:40.8458678Z __unix=0=0 2025-05-07T19:43:40.8459043Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:40.8459629Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:40.8460287Z conda av metadata url : None 2025-05-07T19:43:40.8460725Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:40.8461223Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:40.8461639Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:40.8462074Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:40.8462598Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:40.8462996Z /github/home/.conda/pkgs 2025-05-07T19:43:40.8463369Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:40.8463764Z /github/home/.conda/envs 2025-05-07T19:43:40.8464120Z platform : linux-64 2025-05-07T19:43:40.8465060Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:40.8466143Z UID:GID : 0:0 2025-05-07T19:43:40.8466407Z netrc file : None 2025-05-07T19:43:40.8466695Z offline mode : False 2025-05-07T19:43:40.8466868Z 2025-05-07T19:43:40.9031036Z 2025-05-07T19:43:40.9031923Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:40.9033801Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_e0648f08-abc8-461b-9716-2492ee8e7325 ... 2025-05-07T19:43:40.9034911Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:40.9171165Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.9 2025-05-07T19:43:40.9171764Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.9 2025-05-07T19:43:40.9172654Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:40.9173034Z env: 2025-05-07T19:43:40.9173283Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:40.9173660Z BUILD_ENV: build_binary 2025-05-07T19:43:40.9173954Z BUILD_TARGET: default 2025-05-07T19:43:40.9174256Z BUILD_VARIANT: cuda 2025-05-07T19:43:40.9174523Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:40.9174827Z ##[endgroup] 2025-05-07T19:43:41.3168375Z ################################################################################ 2025-05-07T19:43:41.3168825Z # Create Conda Environment 2025-05-07T19:43:41.3169114Z # 2025-05-07T19:43:41.3183495Z # [2025-05-07T19:43:41.317Z] + create_conda_environment build_binary 3.9 2025-05-07T19:43:41.3184688Z ################################################################################ 2025-05-07T19:43:41.3184936Z 2025-05-07T19:43:41.3203014Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:41.4026417Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:41.4027601Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:41.4028566Z + conda info --envs 2025-05-07T19:43:41.4029051Z 2025-05-07T19:43:41.9745402Z 2025-05-07T19:43:41.9745911Z # conda environments: 2025-05-07T19:43:41.9746934Z # 2025-05-07T19:43:41.9747397Z base /github/home/miniconda 2025-05-07T19:43:41.9747656Z 2025-05-07T19:43:42.0333147Z 2025-05-07T19:43:42.0334088Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:43.6672111Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:43.6672999Z 2025-05-07T19:43:43.6685459Z 2025-05-07T19:43:43.6693335Z [SETUP] Creating new Conda environment (Python 3.9) ... 2025-05-07T19:43:43.6717862Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.9 2025-05-07T19:43:44.2486165Z Channels: 2025-05-07T19:43:44.2486821Z - defaults 2025-05-07T19:43:44.2487465Z Platform: linux-64 2025-05-07T19:43:45.6136718Z Collecting package metadata (repodata.json): - \ | / - \ | / done 2025-05-07T19:43:45.7143556Z Solving environment: \ done 2025-05-07T19:43:45.7431791Z 2025-05-07T19:43:45.7432793Z ## Package Plan ## 2025-05-07T19:43:45.7433272Z 2025-05-07T19:43:45.7433880Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:45.7434732Z 2025-05-07T19:43:45.7434845Z added / updated specs: 2025-05-07T19:43:45.7435124Z - python=3.9 2025-05-07T19:43:45.7435304Z 2025-05-07T19:43:45.7435308Z 2025-05-07T19:43:45.7435445Z The following packages will be downloaded: 2025-05-07T19:43:45.7435688Z 2025-05-07T19:43:45.7435892Z package | build 2025-05-07T19:43:45.7436248Z ---------------------------|----------------- 2025-05-07T19:43:45.7436700Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:45.7437147Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:45.7437632Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:45.7438108Z python-3.9.21 | he870216_1 25.1 MB 2025-05-07T19:43:45.7438599Z setuptools-78.1.1 | py39h06a4308_0 1.7 MB 2025-05-07T19:43:45.7439063Z wheel-0.45.1 | py39h06a4308_0 114 KB 2025-05-07T19:43:45.7439467Z ------------------------------------------------------------ 2025-05-07T19:43:45.7439851Z Total: 27.1 MB 2025-05-07T19:43:45.7440085Z 2025-05-07T19:43:45.7440228Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:45.7440506Z 2025-05-07T19:43:45.7440736Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:45.7441250Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:45.7442033Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:45.7442668Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:45.7443184Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:45.7443694Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:45.7444204Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:45.7444709Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:45.7445220Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:45.7445683Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:45.7446158Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:45.7446622Z python pkgs/main/linux-64::python-3.9.21-he870216_1 2025-05-07T19:43:45.7447093Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:45.7447635Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py39h06a4308_0 2025-05-07T19:43:45.7448144Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:45.7448600Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:45.7449160Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:45.7449715Z wheel pkgs/main/linux-64::wheel-0.45.1-py39h06a4308_0 2025-05-07T19:43:45.7450148Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:45.7450528Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:45.7450807Z 2025-05-07T19:43:45.7450811Z 2025-05-07T19:43:45.7450815Z 2025-05-07T19:43:45.7450966Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:45.7451381Z python-3.9.21 | 25.1 MB | | 0% 2025-05-07T19:43:45.7451905Z 2025-05-07T19:43:45.7452420Z setuptools-78.1.1 | 1.7 MB | | 0%  2025-05-07T19:43:45.7452781Z 2025-05-07T19:43:45.7452785Z 2025-05-07T19:43:45.7453024Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:45.7453310Z 2025-05-07T19:43:45.7453314Z 2025-05-07T19:43:45.7453317Z 2025-05-07T19:43:45.7453563Z wheel-0.45.1 | 114 KB | | 0%  2025-05-07T19:43:45.7453932Z 2025-05-07T19:43:45.7453935Z 2025-05-07T19:43:45.7453939Z 2025-05-07T19:43:45.7453942Z 2025-05-07T19:43:45.7462582Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:45.7463450Z 2025-05-07T19:43:45.7463462Z 2025-05-07T19:43:45.7463472Z 2025-05-07T19:43:45.7463482Z 2025-05-07T19:43:45.7463492Z 2025-05-07T19:43:45.7756140Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:45.7756493Z 2025-05-07T19:43:45.7756856Z 2025-05-07T19:43:45.7756871Z 2025-05-07T19:43:45.7936473Z wheel-0.45.1 | 114 KB | ########## | 100%  2025-05-07T19:43:45.7936810Z 2025-05-07T19:43:45.7936816Z 2025-05-07T19:43:45.7936821Z 2025-05-07T19:43:45.7936825Z 2025-05-07T19:43:45.7936829Z 2025-05-07T19:43:45.8027803Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:45.8028737Z 2025-05-07T19:43:45.8028763Z 2025-05-07T19:43:45.8028774Z 2025-05-07T19:43:45.8030361Z 2025-05-07T19:43:45.8060513Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:45.8061430Z 2025-05-07T19:43:45.8061444Z 2025-05-07T19:43:45.8089903Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:45.8090234Z 2025-05-07T19:43:45.8090238Z 2025-05-07T19:43:45.8090242Z 2025-05-07T19:43:45.8090246Z 2025-05-07T19:43:45.8091875Z 2025-05-07T19:43:45.8360085Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:45.8360423Z 2025-05-07T19:43:45.8360429Z 2025-05-07T19:43:45.8435185Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:45.8501104Z python-3.9.21 | 25.1 MB | #6 | 17% 2025-05-07T19:43:45.8501428Z 2025-05-07T19:43:45.8501716Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:45.8501987Z 2025-05-07T19:43:45.8502231Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:45.8502500Z 2025-05-07T19:43:45.8502512Z 2025-05-07T19:43:45.8502524Z 2025-05-07T19:43:45.8506592Z wheel-0.45.1 | 114 KB | ########## | 100%  2025-05-07T19:43:45.8506880Z 2025-05-07T19:43:45.8506883Z 2025-05-07T19:43:45.8506893Z 2025-05-07T19:43:45.8646482Z wheel-0.45.1 | 114 KB | ########## | 100%  2025-05-07T19:43:45.8646786Z 2025-05-07T19:43:45.8646792Z 2025-05-07T19:43:45.8646822Z 2025-05-07T19:43:45.8646825Z 2025-05-07T19:43:45.8647079Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:45.8647371Z 2025-05-07T19:43:45.8647381Z 2025-05-07T19:43:45.8647385Z 2025-05-07T19:43:45.8647388Z 2025-05-07T19:43:45.9434560Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:46.1015967Z python-3.9.21 | 25.1 MB | ######3 | 64% 2025-05-07T19:43:46.1016281Z 2025-05-07T19:43:46.1078492Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:46.1079027Z python-3.9.21 | 25.1 MB | ########## | 100% 2025-05-07T19:43:46.6007878Z python-3.9.21 | 25.1 MB | ########## | 100% 2025-05-07T19:43:46.6010852Z python-3.9.21 | 25.1 MB | ########## | 100% 2025-05-07T19:43:46.6011829Z 2025-05-07T19:43:46.6012514Z 2025-05-07T19:43:46.6013156Z  2025-05-07T19:43:46.6013783Z 2025-05-07T19:43:46.6013797Z 2025-05-07T19:43:46.6014285Z  2025-05-07T19:43:46.6014952Z 2025-05-07T19:43:46.6014964Z 2025-05-07T19:43:46.6014975Z 2025-05-07T19:43:46.6015510Z  2025-05-07T19:43:46.6015884Z 2025-05-07T19:43:46.6015914Z 2025-05-07T19:43:46.6015917Z 2025-05-07T19:43:46.6015921Z 2025-05-07T19:43:46.6016110Z  2025-05-07T19:43:46.6016344Z 2025-05-07T19:43:46.6016350Z 2025-05-07T19:43:46.6016355Z 2025-05-07T19:43:46.6016646Z 2025-05-07T19:43:46.6016649Z 2025-05-07T19:43:46.6016897Z  done 2025-05-07T19:43:46.8127585Z Preparing transaction: / - done 2025-05-07T19:43:47.9523363Z Verifying transaction: | / - \ | / - \ | / - done 2025-05-07T19:43:50.0649948Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:50.0688834Z # 2025-05-07T19:43:50.0689577Z # To activate this environment, use 2025-05-07T19:43:50.0690457Z # 2025-05-07T19:43:50.0691051Z # $ conda activate build_binary 2025-05-07T19:43:50.0691876Z # 2025-05-07T19:43:50.0692532Z # To deactivate an active environment, use 2025-05-07T19:43:50.0693443Z # 2025-05-07T19:43:50.0693983Z # $ conda deactivate 2025-05-07T19:43:50.0694486Z 2025-05-07T19:43:50.1552741Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:50.1585424Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:52.8984175Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:52.8985801Z 2025-05-07T19:43:52.8986242Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (25.1) 2025-05-07T19:43:52.8986904Z Collecting pip 2025-05-07T19:43:52.8987545Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:52.8988054Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:52.8988971Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 104.8 MB/s eta 0:00:00 2025-05-07T19:43:52.8989402Z Installing collected packages: pip 2025-05-07T19:43:52.8989731Z Attempting uninstall: pip 2025-05-07T19:43:52.8990107Z Found existing installation: pip 25.1 2025-05-07T19:43:52.8990477Z Uninstalling pip-25.1: 2025-05-07T19:43:52.8990783Z Successfully uninstalled pip-25.1 2025-05-07T19:43:52.8991150Z Successfully installed pip-25.1.1 2025-05-07T19:43:52.8991357Z 2025-05-07T19:43:52.9585864Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:52.9614088Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:53.6236234Z Channels: 2025-05-07T19:43:53.6236751Z - conda-forge 2025-05-07T19:43:53.6237525Z Platform: linux-64 2025-05-07T19:44:03.2738255Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:44:05.0258290Z Solving environment: \ | / - \ done 2025-05-07T19:44:05.0722148Z 2025-05-07T19:44:05.0722718Z ## Package Plan ## 2025-05-07T19:44:05.0723263Z 2025-05-07T19:44:05.0723922Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:05.0724903Z 2025-05-07T19:44:05.0725220Z added / updated specs: 2025-05-07T19:44:05.0725995Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:05.0726605Z 2025-05-07T19:44:05.0726618Z 2025-05-07T19:44:05.0726972Z The following packages will be downloaded: 2025-05-07T19:44:05.0727680Z 2025-05-07T19:44:05.0727843Z package | build 2025-05-07T19:44:05.0728202Z ---------------------------|----------------- 2025-05-07T19:44:05.0728636Z cffi-1.17.1 | py39h15c3d72_0 236 KB conda-forge 2025-05-07T19:44:05.0729140Z cryptography-44.0.3 | py39h7170ec2_0 1.5 MB conda-forge 2025-05-07T19:44:05.0729645Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:05.0730127Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:05.0730611Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:05.0731445Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:05.0731920Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:05.0732442Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:05.0732922Z python_abi-3.9 | 2_cp39 4 KB conda-forge 2025-05-07T19:44:05.0733462Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:05.0734004Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:05.0734522Z ------------------------------------------------------------ 2025-05-07T19:44:05.0734931Z Total: 6.3 MB 2025-05-07T19:44:05.0735167Z 2025-05-07T19:44:05.0735316Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:05.0735570Z 2025-05-07T19:44:05.0735840Z cffi conda-forge/linux-64::cffi-1.17.1-py39h15c3d72_0 2025-05-07T19:44:05.0736384Z cryptography conda-forge/linux-64::cryptography-44.0.3-py39h7170ec2_0 2025-05-07T19:44:05.0736964Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:05.0737497Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:05.0738030Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:05.0738573Z python_abi conda-forge/linux-64::python_abi-3.9-2_cp39 2025-05-07T19:44:05.0739137Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:05.0740127Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:05.0740507Z 2025-05-07T19:44:05.0740669Z The following packages will be UPDATED: 2025-05-07T19:44:05.0740896Z 2025-05-07T19:44:05.0743541Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:05.0744469Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:05.0745284Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:05.0746091Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:05.0746558Z 2025-05-07T19:44:05.0746563Z 2025-05-07T19:44:05.0746596Z 2025-05-07T19:44:05.0746764Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:05.0747171Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:05.0747458Z 2025-05-07T19:44:05.0747813Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:05.0748086Z 2025-05-07T19:44:05.0748089Z 2025-05-07T19:44:05.0748337Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:05.0748598Z 2025-05-07T19:44:05.0748602Z 2025-05-07T19:44:05.0748605Z 2025-05-07T19:44:05.0748842Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:05.0749141Z 2025-05-07T19:44:05.0749145Z 2025-05-07T19:44:05.0749149Z 2025-05-07T19:44:05.0749153Z 2025-05-07T19:44:05.0761244Z cffi-1.17.1 | 236 KB | | 0%  2025-05-07T19:44:05.0762218Z 2025-05-07T19:44:05.0762230Z 2025-05-07T19:44:05.0762240Z 2025-05-07T19:44:05.0762250Z 2025-05-07T19:44:05.0763973Z 2025-05-07T19:44:05.0765186Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:05.0766217Z 2025-05-07T19:44:05.0766292Z 2025-05-07T19:44:05.0766303Z 2025-05-07T19:44:05.0766314Z 2025-05-07T19:44:05.0766345Z 2025-05-07T19:44:05.0766357Z 2025-05-07T19:44:05.0767091Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:05.0767923Z 2025-05-07T19:44:05.0767934Z 2025-05-07T19:44:05.0767944Z 2025-05-07T19:44:05.0767954Z 2025-05-07T19:44:05.0767964Z 2025-05-07T19:44:05.0768010Z 2025-05-07T19:44:05.0768277Z 2025-05-07T19:44:05.0769087Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:05.0770014Z 2025-05-07T19:44:05.0770025Z 2025-05-07T19:44:05.0770035Z 2025-05-07T19:44:05.0770045Z 2025-05-07T19:44:05.0770056Z 2025-05-07T19:44:05.0770067Z 2025-05-07T19:44:05.0770077Z 2025-05-07T19:44:05.0770087Z 2025-05-07T19:44:05.0770581Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:05.0770900Z 2025-05-07T19:44:05.0770904Z 2025-05-07T19:44:05.0770908Z 2025-05-07T19:44:05.0770911Z 2025-05-07T19:44:05.0770914Z 2025-05-07T19:44:05.0770918Z 2025-05-07T19:44:05.0770922Z 2025-05-07T19:44:05.0770932Z 2025-05-07T19:44:05.0770935Z 2025-05-07T19:44:05.0771236Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:05.0771530Z 2025-05-07T19:44:05.0771534Z 2025-05-07T19:44:05.0771537Z 2025-05-07T19:44:05.0771541Z 2025-05-07T19:44:05.0771544Z 2025-05-07T19:44:05.0771548Z 2025-05-07T19:44:05.0771556Z 2025-05-07T19:44:05.0771559Z 2025-05-07T19:44:05.0771562Z 2025-05-07T19:44:05.0771566Z 2025-05-07T19:44:05.1546622Z python_abi-3.9 | 4 KB | | 0%  2025-05-07T19:44:05.1547240Z 2025-05-07T19:44:05.1625270Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:05.1626136Z 2025-05-07T19:44:05.1626150Z 2025-05-07T19:44:05.1626161Z 2025-05-07T19:44:05.1626172Z 2025-05-07T19:44:05.1768172Z cffi-1.17.1 | 236 KB | ########## | 100%  2025-05-07T19:44:05.1769012Z 2025-05-07T19:44:05.1769027Z 2025-05-07T19:44:05.1770112Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:05.1770912Z 2025-05-07T19:44:05.1770925Z 2025-05-07T19:44:05.1798234Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:05.1798555Z 2025-05-07T19:44:05.1798559Z 2025-05-07T19:44:05.1798563Z 2025-05-07T19:44:05.1798838Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:05.1799132Z 2025-05-07T19:44:05.1799136Z 2025-05-07T19:44:05.1799140Z 2025-05-07T19:44:05.1935714Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:05.1953559Z openssl-3.5.0 | 3.0 MB | 7 | 7% 2025-05-07T19:44:05.1953888Z 2025-05-07T19:44:05.1953921Z 2025-05-07T19:44:05.1953925Z 2025-05-07T19:44:05.1953929Z 2025-05-07T19:44:05.1953933Z 2025-05-07T19:44:05.2008859Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:05.2009213Z 2025-05-07T19:44:05.2009217Z 2025-05-07T19:44:05.2009221Z 2025-05-07T19:44:05.2009225Z 2025-05-07T19:44:05.2009301Z 2025-05-07T19:44:05.2092341Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:05.2093268Z 2025-05-07T19:44:05.2093307Z 2025-05-07T19:44:05.2093318Z 2025-05-07T19:44:05.2093329Z 2025-05-07T19:44:05.2093340Z 2025-05-07T19:44:05.2093350Z 2025-05-07T19:44:05.2093360Z 2025-05-07T19:44:05.2093404Z 2025-05-07T19:44:05.2102937Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:05.2103895Z 2025-05-07T19:44:05.2103907Z 2025-05-07T19:44:05.2103917Z 2025-05-07T19:44:05.2103927Z 2025-05-07T19:44:05.2103937Z 2025-05-07T19:44:05.2103948Z 2025-05-07T19:44:05.2116552Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:05.2116864Z 2025-05-07T19:44:05.2116868Z 2025-05-07T19:44:05.2116871Z 2025-05-07T19:44:05.2116875Z 2025-05-07T19:44:05.2116878Z 2025-05-07T19:44:05.2116882Z 2025-05-07T19:44:05.2116885Z 2025-05-07T19:44:05.2116889Z 2025-05-07T19:44:05.2159099Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:05.2159483Z 2025-05-07T19:44:05.2159488Z 2025-05-07T19:44:05.2159491Z 2025-05-07T19:44:05.2159495Z 2025-05-07T19:44:05.2159498Z 2025-05-07T19:44:05.2159502Z 2025-05-07T19:44:05.2263152Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:05.2264379Z 2025-05-07T19:44:05.2264383Z 2025-05-07T19:44:05.2264387Z 2025-05-07T19:44:05.2264390Z 2025-05-07T19:44:05.2264633Z cffi-1.17.1 | 236 KB | ########## | 100%  2025-05-07T19:44:05.2264937Z 2025-05-07T19:44:05.2264941Z 2025-05-07T19:44:05.2264945Z 2025-05-07T19:44:05.2264949Z 2025-05-07T19:44:05.2332715Z cffi-1.17.1 | 236 KB | ########## | 100%  2025-05-07T19:44:05.2333077Z 2025-05-07T19:44:05.2333081Z 2025-05-07T19:44:05.2333085Z 2025-05-07T19:44:05.2333089Z 2025-05-07T19:44:05.2333093Z 2025-05-07T19:44:05.2333096Z 2025-05-07T19:44:05.2333100Z 2025-05-07T19:44:05.2333104Z 2025-05-07T19:44:05.2333126Z 2025-05-07T19:44:05.2348193Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:05.2348542Z 2025-05-07T19:44:05.2348546Z 2025-05-07T19:44:05.2348568Z 2025-05-07T19:44:05.2348572Z 2025-05-07T19:44:05.2348575Z 2025-05-07T19:44:05.2348579Z 2025-05-07T19:44:05.2348582Z 2025-05-07T19:44:05.2348605Z 2025-05-07T19:44:05.2348609Z 2025-05-07T19:44:05.2493883Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:05.2494830Z 2025-05-07T19:44:05.2494845Z 2025-05-07T19:44:05.2494856Z 2025-05-07T19:44:05.2494866Z 2025-05-07T19:44:05.2494876Z 2025-05-07T19:44:05.2494887Z 2025-05-07T19:44:05.2494897Z 2025-05-07T19:44:05.2499016Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:05.2500222Z 2025-05-07T19:44:05.2500234Z 2025-05-07T19:44:05.2500245Z 2025-05-07T19:44:05.2500255Z 2025-05-07T19:44:05.2500266Z 2025-05-07T19:44:05.2500276Z 2025-05-07T19:44:05.2500287Z 2025-05-07T19:44:05.2500749Z 2025-05-07T19:44:05.2500763Z 2025-05-07T19:44:05.2500774Z 2025-05-07T19:44:05.2519457Z python_abi-3.9 | 4 KB | ########## | 100%  2025-05-07T19:44:05.2520411Z 2025-05-07T19:44:05.2520425Z 2025-05-07T19:44:05.2520436Z 2025-05-07T19:44:05.2520447Z 2025-05-07T19:44:05.2520457Z 2025-05-07T19:44:05.2520498Z 2025-05-07T19:44:05.2520509Z 2025-05-07T19:44:05.2520520Z 2025-05-07T19:44:05.2520530Z 2025-05-07T19:44:05.2520541Z 2025-05-07T19:44:05.2527101Z python_abi-3.9 | 4 KB | ########## | 100%  2025-05-07T19:44:05.2527474Z 2025-05-07T19:44:05.2528284Z 2025-05-07T19:44:05.2530694Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:05.2530966Z 2025-05-07T19:44:05.2530985Z 2025-05-07T19:44:05.2531019Z 2025-05-07T19:44:05.2531022Z 2025-05-07T19:44:05.2531026Z 2025-05-07T19:44:05.2531029Z 2025-05-07T19:44:05.2531033Z 2025-05-07T19:44:05.2702741Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:05.2703757Z 2025-05-07T19:44:05.2703770Z 2025-05-07T19:44:05.2703781Z 2025-05-07T19:44:05.2797914Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:05.2798582Z 2025-05-07T19:44:05.2798600Z 2025-05-07T19:44:05.2798605Z 2025-05-07T19:44:05.2798609Z 2025-05-07T19:44:05.2798746Z 2025-05-07T19:44:05.2867512Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:05.2867860Z 2025-05-07T19:44:05.2867865Z 2025-05-07T19:44:05.2867869Z 2025-05-07T19:44:05.2867874Z 2025-05-07T19:44:05.2867877Z 2025-05-07T19:44:05.2867905Z 2025-05-07T19:44:05.2867909Z 2025-05-07T19:44:05.2867912Z 2025-05-07T19:44:05.3030182Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:05.3206648Z openssl-3.5.0 | 3.0 MB | #######3 | 73% 2025-05-07T19:44:05.3206977Z 2025-05-07T19:44:05.3206981Z 2025-05-07T19:44:05.3207218Z 2025-05-07T19:44:05.3207333Z 2025-05-07T19:44:05.3207339Z 2025-05-07T19:44:05.3207447Z 2025-05-07T19:44:05.3207458Z 2025-05-07T19:44:05.3207462Z 2025-05-07T19:44:05.3207467Z 2025-05-07T19:44:05.3294311Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:05.3295135Z 2025-05-07T19:44:05.3295140Z 2025-05-07T19:44:05.3295423Z 2025-05-07T19:44:05.3295427Z 2025-05-07T19:44:05.3295431Z 2025-05-07T19:44:05.3295435Z 2025-05-07T19:44:05.3295439Z 2025-05-07T19:44:05.3295443Z 2025-05-07T19:44:05.3295447Z 2025-05-07T19:44:05.3295450Z 2025-05-07T19:44:05.3424497Z python_abi-3.9 | 4 KB | ########## | 100%  2025-05-07T19:44:05.3425451Z 2025-05-07T19:44:05.3425502Z 2025-05-07T19:44:05.3425510Z 2025-05-07T19:44:05.3425516Z 2025-05-07T19:44:05.3425541Z 2025-05-07T19:44:05.3425603Z 2025-05-07T19:44:05.3425655Z 2025-05-07T19:44:05.3542517Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:05.3611188Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:05.3612027Z 2025-05-07T19:44:05.3612043Z 2025-05-07T19:44:05.3612054Z 2025-05-07T19:44:05.3612065Z 2025-05-07T19:44:05.3612110Z 2025-05-07T19:44:05.3612121Z 2025-05-07T19:44:05.3613031Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:05.3613915Z 2025-05-07T19:44:05.3613926Z 2025-05-07T19:44:05.3613937Z 2025-05-07T19:44:05.3613949Z 2025-05-07T19:44:05.3613959Z 2025-05-07T19:44:05.3613979Z 2025-05-07T19:44:05.3882849Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:05.3883771Z 2025-05-07T19:44:05.4288900Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:05.4289786Z 2025-05-07T19:44:05.4290447Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:05.4292567Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:05.4293559Z 2025-05-07T19:44:05.4295949Z 2025-05-07T19:44:05.4296202Z  2025-05-07T19:44:05.4296432Z 2025-05-07T19:44:05.4296437Z 2025-05-07T19:44:05.4296639Z  2025-05-07T19:44:05.4296866Z 2025-05-07T19:44:05.4296870Z 2025-05-07T19:44:05.4296873Z 2025-05-07T19:44:05.4297062Z  2025-05-07T19:44:05.4297316Z 2025-05-07T19:44:05.4297320Z 2025-05-07T19:44:05.4297324Z 2025-05-07T19:44:05.4297328Z 2025-05-07T19:44:05.4297511Z  2025-05-07T19:44:05.4297745Z 2025-05-07T19:44:05.4297749Z 2025-05-07T19:44:05.4297752Z 2025-05-07T19:44:05.4297756Z 2025-05-07T19:44:05.4297784Z 2025-05-07T19:44:05.4297972Z  2025-05-07T19:44:05.4298208Z 2025-05-07T19:44:05.4298212Z 2025-05-07T19:44:05.4298215Z 2025-05-07T19:44:05.4298219Z 2025-05-07T19:44:05.4298226Z 2025-05-07T19:44:05.4298231Z 2025-05-07T19:44:05.4298455Z  2025-05-07T19:44:05.4298697Z 2025-05-07T19:44:05.4298701Z 2025-05-07T19:44:05.4298704Z 2025-05-07T19:44:05.4298708Z 2025-05-07T19:44:05.4298711Z 2025-05-07T19:44:05.4298714Z 2025-05-07T19:44:05.4298722Z 2025-05-07T19:44:05.4298945Z  2025-05-07T19:44:05.4299185Z 2025-05-07T19:44:05.4299189Z 2025-05-07T19:44:05.4299192Z 2025-05-07T19:44:05.4299195Z 2025-05-07T19:44:05.4299199Z 2025-05-07T19:44:05.4299202Z 2025-05-07T19:44:05.4299205Z 2025-05-07T19:44:05.4299209Z 2025-05-07T19:44:05.4299410Z  2025-05-07T19:44:05.4299807Z 2025-05-07T19:44:05.4299811Z 2025-05-07T19:44:05.4299814Z 2025-05-07T19:44:05.4299818Z 2025-05-07T19:44:05.4299821Z 2025-05-07T19:44:05.4299825Z 2025-05-07T19:44:05.4299832Z 2025-05-07T19:44:05.4299836Z 2025-05-07T19:44:05.4299840Z 2025-05-07T19:44:05.4300051Z  2025-05-07T19:44:05.4300321Z 2025-05-07T19:44:05.4300325Z 2025-05-07T19:44:05.4300329Z 2025-05-07T19:44:05.4300332Z 2025-05-07T19:44:05.4300336Z 2025-05-07T19:44:05.4300339Z 2025-05-07T19:44:05.4300461Z 2025-05-07T19:44:05.4300465Z 2025-05-07T19:44:05.4300468Z 2025-05-07T19:44:05.4300471Z 2025-05-07T19:44:05.4300694Z  done 2025-05-07T19:44:05.5303813Z Preparing transaction: / done 2025-05-07T19:44:05.6313113Z Verifying transaction: \ done 2025-05-07T19:44:07.0348973Z Executing transaction: / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:07.1320855Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:08.8300809Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:08.8320452Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:08.8345358Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:09.4990422Z Channels: 2025-05-07T19:44:09.4990744Z - conda-forge 2025-05-07T19:44:09.4990985Z Platform: linux-64 2025-05-07T19:44:12.6855691Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:13.1204243Z Solving environment: \ done 2025-05-07T19:44:13.1676288Z 2025-05-07T19:44:13.1676759Z ## Package Plan ## 2025-05-07T19:44:13.1676977Z 2025-05-07T19:44:13.1677210Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:13.1677553Z 2025-05-07T19:44:13.1677673Z added / updated specs: 2025-05-07T19:44:13.1677938Z - libxcrypt 2025-05-07T19:44:13.1678095Z 2025-05-07T19:44:13.1678101Z 2025-05-07T19:44:13.1678236Z The following packages will be downloaded: 2025-05-07T19:44:13.1678475Z 2025-05-07T19:44:13.1678630Z package | build 2025-05-07T19:44:13.1679300Z ---------------------------|----------------- 2025-05-07T19:44:13.1679789Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:13.1680247Z ------------------------------------------------------------ 2025-05-07T19:44:13.1680659Z Total: 98 KB 2025-05-07T19:44:13.1680917Z 2025-05-07T19:44:13.1681060Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:13.1681336Z 2025-05-07T19:44:13.1681595Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:13.1682088Z 2025-05-07T19:44:13.1682094Z 2025-05-07T19:44:13.1682100Z 2025-05-07T19:44:13.1682304Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:13.2857840Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:13.2877778Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:13.2985671Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:13.2987802Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:13.2988712Z 2025-05-07T19:44:13.2989012Z done 2025-05-07T19:44:13.4003102Z Preparing transaction: / done 2025-05-07T19:44:13.5008469Z Verifying transaction: \ done 2025-05-07T19:44:13.6018178Z Executing transaction: / done 2025-05-07T19:44:16.8997296Z [SETUP] Copying over ... 2025-05-07T19:44:16.8998073Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.9/crypt.h 2025-05-07T19:44:16.8998685Z 2025-05-07T19:44:16.9021244Z 2025-05-07T19:44:18.5021229Z [SETUP] Installed Python version: Python 3.9.21 2025-05-07T19:44:18.5021847Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:18.5088064Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:18.5088599Z . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:18.5089296Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:18.5089672Z env: 2025-05-07T19:44:18.5089936Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:18.5090306Z BUILD_ENV: build_binary 2025-05-07T19:44:18.5090585Z BUILD_TARGET: default 2025-05-07T19:44:18.5090878Z BUILD_VARIANT: cuda 2025-05-07T19:44:18.5091141Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:18.5091689Z ##[endgroup] 2025-05-07T19:44:18.9335332Z ################################################################################ 2025-05-07T19:44:18.9336420Z # Install C/C++ Compilers 2025-05-07T19:44:18.9337164Z # 2025-05-07T19:44:18.9349165Z # [2025-05-07T19:44:18.934Z] + install_cxx_compiler build_binary gcc 2025-05-07T19:44:18.9349750Z ################################################################################ 2025-05-07T19:44:18.9349998Z 2025-05-07T19:44:18.9367004Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:19.0184136Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:19.0190236Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:19.0214370Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:19.6832764Z Channels: 2025-05-07T19:44:19.6833664Z - conda-forge 2025-05-07T19:44:19.6834352Z Platform: linux-64 2025-05-07T19:44:22.7678216Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:23.1954163Z Solving environment: \ done 2025-05-07T19:44:23.2413160Z 2025-05-07T19:44:23.2413691Z ## Package Plan ## 2025-05-07T19:44:23.2414047Z 2025-05-07T19:44:23.2414309Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:23.2414672Z 2025-05-07T19:44:23.2414786Z added / updated specs: 2025-05-07T19:44:23.2415119Z - sysroot_linux-64=2.17 2025-05-07T19:44:23.2415302Z 2025-05-07T19:44:23.2415306Z 2025-05-07T19:44:23.2415447Z The following packages will be downloaded: 2025-05-07T19:44:23.2415774Z 2025-05-07T19:44:23.2415904Z package | build 2025-05-07T19:44:23.2416280Z ---------------------------|----------------- 2025-05-07T19:44:23.2416736Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:23.2417301Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:23.2417764Z ------------------------------------------------------------ 2025-05-07T19:44:23.2418166Z Total: 15.4 MB 2025-05-07T19:44:23.2418396Z 2025-05-07T19:44:23.2418536Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:23.2418810Z 2025-05-07T19:44:23.2419129Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:23.2419941Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:23.2420287Z 2025-05-07T19:44:23.2420291Z 2025-05-07T19:44:23.2420295Z 2025-05-07T19:44:23.2420454Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:23.2420896Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:23.2421150Z 2025-05-07T19:44:23.4405029Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:23.4405495Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:23.4405811Z 2025-05-07T19:44:23.4562903Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:23.4563758Z 2025-05-07T19:44:23.5950712Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.5951177Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.6472928Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.6473730Z 2025-05-07T19:44:23.6475205Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.6476036Z 2025-05-07T19:44:24.0590813Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:24.0591301Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:24.0591732Z 2025-05-07T19:44:24.0591985Z 2025-05-07T19:44:24.0592171Z  done 2025-05-07T19:44:24.1601362Z Preparing transaction: / done 2025-05-07T19:44:24.3609737Z Verifying transaction: \ | done 2025-05-07T19:44:24.4620016Z Executing transaction: - done 2025-05-07T19:44:24.5441862Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:24.5442365Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:26.1778537Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:26.1800088Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:26.1824795Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:26.8919154Z Channels: 2025-05-07T19:44:26.8919862Z - conda-forge 2025-05-07T19:44:26.8920527Z Platform: linux-64 2025-05-07T19:44:29.9682755Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:31.1040544Z Solving environment: \ | / done 2025-05-07T19:44:31.1550844Z 2025-05-07T19:44:31.1551467Z ## Package Plan ## 2025-05-07T19:44:31.1551939Z 2025-05-07T19:44:31.1552551Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:31.1553573Z 2025-05-07T19:44:31.1553849Z added / updated specs: 2025-05-07T19:44:31.1554629Z - gxx_linux-64=11.4.0 2025-05-07T19:44:31.1555098Z 2025-05-07T19:44:31.1555111Z 2025-05-07T19:44:31.1555468Z The following packages will be downloaded: 2025-05-07T19:44:31.1556164Z 2025-05-07T19:44:31.1556495Z package | build 2025-05-07T19:44:31.1557473Z ---------------------------|----------------- 2025-05-07T19:44:31.1558736Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:31.1560275Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:31.1560785Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:31.1561307Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:31.1561789Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:31.1562307Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:31.1562859Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:31.1563368Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:31.1563917Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:31.1564409Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:31.1565069Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:31.1565799Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:31.1566239Z ------------------------------------------------------------ 2025-05-07T19:44:31.1566638Z Total: 91.6 MB 2025-05-07T19:44:31.1566871Z 2025-05-07T19:44:31.1567011Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:31.1567291Z 2025-05-07T19:44:31.1567609Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:31.1568252Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:31.1568847Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:31.1569771Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:31.1570342Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:31.1570927Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:31.1571540Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:31.1572157Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:31.1593622Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:31.1594614Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:31.1595151Z 2025-05-07T19:44:31.1595286Z The following packages will be UPDATED: 2025-05-07T19:44:31.1595516Z 2025-05-07T19:44:31.1596046Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:31.1596888Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:31.1597350Z 2025-05-07T19:44:31.1597354Z 2025-05-07T19:44:31.1597383Z 2025-05-07T19:44:31.1597541Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:31.1597978Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:31.1598261Z 2025-05-07T19:44:31.1598601Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:31.1598863Z 2025-05-07T19:44:31.1598867Z 2025-05-07T19:44:31.1599185Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:31.1599475Z 2025-05-07T19:44:31.1599478Z 2025-05-07T19:44:31.1599482Z 2025-05-07T19:44:31.1599752Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:31.1600042Z 2025-05-07T19:44:31.1600045Z 2025-05-07T19:44:31.1600049Z 2025-05-07T19:44:31.1600053Z 2025-05-07T19:44:31.1600304Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:31.1600631Z 2025-05-07T19:44:31.1600640Z 2025-05-07T19:44:31.1600644Z 2025-05-07T19:44:31.1600647Z 2025-05-07T19:44:31.1600651Z 2025-05-07T19:44:31.1600914Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:31.1601226Z 2025-05-07T19:44:31.1601259Z 2025-05-07T19:44:31.1601262Z 2025-05-07T19:44:31.1601265Z 2025-05-07T19:44:31.1601269Z 2025-05-07T19:44:31.1601272Z 2025-05-07T19:44:31.1601551Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:31.1601986Z 2025-05-07T19:44:31.1601989Z 2025-05-07T19:44:31.1601997Z 2025-05-07T19:44:31.1602001Z 2025-05-07T19:44:31.1602004Z 2025-05-07T19:44:31.1602035Z 2025-05-07T19:44:31.1602038Z 2025-05-07T19:44:31.1602300Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:31.1602604Z 2025-05-07T19:44:31.1602607Z 2025-05-07T19:44:31.1602610Z 2025-05-07T19:44:31.1602614Z 2025-05-07T19:44:31.1602617Z 2025-05-07T19:44:31.1602621Z 2025-05-07T19:44:31.1602624Z 2025-05-07T19:44:31.1602637Z 2025-05-07T19:44:31.1602929Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:31.1603231Z 2025-05-07T19:44:31.1603234Z 2025-05-07T19:44:31.1603238Z 2025-05-07T19:44:31.1603241Z 2025-05-07T19:44:31.1603244Z 2025-05-07T19:44:31.1603247Z 2025-05-07T19:44:31.1603250Z 2025-05-07T19:44:31.1603254Z 2025-05-07T19:44:31.1603258Z 2025-05-07T19:44:31.1603550Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:31.1603846Z 2025-05-07T19:44:31.1603849Z 2025-05-07T19:44:31.1603857Z 2025-05-07T19:44:31.1603860Z 2025-05-07T19:44:31.1603863Z 2025-05-07T19:44:31.1603867Z 2025-05-07T19:44:31.1603870Z 2025-05-07T19:44:31.1603873Z 2025-05-07T19:44:31.1603877Z 2025-05-07T19:44:31.1603881Z 2025-05-07T19:44:31.1604149Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:31.1604444Z 2025-05-07T19:44:31.1604447Z 2025-05-07T19:44:31.1604451Z 2025-05-07T19:44:31.1604581Z 2025-05-07T19:44:31.1604586Z 2025-05-07T19:44:31.1604589Z 2025-05-07T19:44:31.1604592Z 2025-05-07T19:44:31.1604596Z 2025-05-07T19:44:31.1604600Z 2025-05-07T19:44:31.1604603Z 2025-05-07T19:44:31.1604606Z 2025-05-07T19:44:31.3071744Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:31.3072796Z 2025-05-07T19:44:31.3072811Z 2025-05-07T19:44:31.3072822Z 2025-05-07T19:44:31.3072833Z 2025-05-07T19:44:31.3073734Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:31.3075013Z 2025-05-07T19:44:31.3075024Z 2025-05-07T19:44:31.3075733Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:31.3076577Z 2025-05-07T19:44:31.3076589Z 2025-05-07T19:44:31.3076610Z 2025-05-07T19:44:31.3586216Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:31.3586556Z 2025-05-07T19:44:31.3736528Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:31.4078171Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:31.4078601Z 2025-05-07T19:44:31.4078692Z 2025-05-07T19:44:31.4181157Z libstdcxx-devel_linu | 11.1 MB | #######2 | 73%  2025-05-07T19:44:31.4182672Z 2025-05-07T19:44:31.4182730Z 2025-05-07T19:44:31.4182751Z 2025-05-07T19:44:31.4184010Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.4185545Z 2025-05-07T19:44:31.4185567Z 2025-05-07T19:44:31.4185589Z 2025-05-07T19:44:31.4585749Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.4586088Z 2025-05-07T19:44:31.4597525Z gxx_impl_linux-64-11 | 11.2 MB | #######5 | 76%  2025-05-07T19:44:31.4597827Z 2025-05-07T19:44:31.4597838Z 2025-05-07T19:44:31.4597841Z 2025-05-07T19:44:31.4597845Z 2025-05-07T19:44:31.4597849Z 2025-05-07T19:44:31.4626986Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:31.4627321Z 2025-05-07T19:44:31.4627332Z 2025-05-07T19:44:31.4627361Z 2025-05-07T19:44:31.4627374Z 2025-05-07T19:44:31.4736385Z libstdcxx-15.1.0 | 3.7 MB | ########9 | 90%  2025-05-07T19:44:31.5124870Z gcc_impl_linux-64-11 | 53.0 MB | #4 | 14% 2025-05-07T19:44:31.5125157Z 2025-05-07T19:44:31.5125380Z 2025-05-07T19:44:31.5125387Z 2025-05-07T19:44:31.5125392Z 2025-05-07T19:44:31.5390823Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.5391151Z 2025-05-07T19:44:31.5391156Z 2025-05-07T19:44:31.5391160Z 2025-05-07T19:44:31.5391163Z 2025-05-07T19:44:31.5391178Z 2025-05-07T19:44:31.5461976Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.5462309Z 2025-05-07T19:44:31.5462313Z 2025-05-07T19:44:31.5462317Z 2025-05-07T19:44:31.5462320Z 2025-05-07T19:44:31.5462324Z 2025-05-07T19:44:31.5462327Z 2025-05-07T19:44:31.5749435Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:31.5822062Z gcc_impl_linux-64-11 | 53.0 MB | ###2 | 32% 2025-05-07T19:44:31.5822371Z 2025-05-07T19:44:31.5822376Z 2025-05-07T19:44:31.5822380Z 2025-05-07T19:44:31.5822383Z 2025-05-07T19:44:31.5822387Z 2025-05-07T19:44:31.5822390Z 2025-05-07T19:44:31.5822394Z 2025-05-07T19:44:31.5960341Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:31.5961301Z 2025-05-07T19:44:31.5961315Z 2025-05-07T19:44:31.5961359Z 2025-05-07T19:44:31.5961370Z 2025-05-07T19:44:31.5961381Z 2025-05-07T19:44:31.5961392Z 2025-05-07T19:44:31.5961402Z 2025-05-07T19:44:31.6012464Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.6012859Z 2025-05-07T19:44:31.6012864Z 2025-05-07T19:44:31.6014133Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:31.6014418Z 2025-05-07T19:44:31.6014432Z 2025-05-07T19:44:31.6014436Z 2025-05-07T19:44:31.6014439Z 2025-05-07T19:44:31.6014443Z 2025-05-07T19:44:31.6014446Z 2025-05-07T19:44:31.6184166Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.6185171Z 2025-05-07T19:44:31.6368669Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:31.6368960Z 2025-05-07T19:44:31.6368965Z 2025-05-07T19:44:31.6368968Z 2025-05-07T19:44:31.6368972Z 2025-05-07T19:44:31.6368975Z 2025-05-07T19:44:31.6368979Z 2025-05-07T19:44:31.6368982Z 2025-05-07T19:44:31.6368986Z 2025-05-07T19:44:31.6368989Z 2025-05-07T19:44:31.6382223Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:31.6382560Z 2025-05-07T19:44:31.6382766Z 2025-05-07T19:44:31.6382770Z 2025-05-07T19:44:31.6382773Z 2025-05-07T19:44:31.6382776Z 2025-05-07T19:44:31.6382780Z 2025-05-07T19:44:31.6382784Z 2025-05-07T19:44:31.6382787Z 2025-05-07T19:44:31.6382797Z 2025-05-07T19:44:31.6407303Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.6407645Z 2025-05-07T19:44:31.6407650Z 2025-05-07T19:44:31.6407654Z 2025-05-07T19:44:31.6407668Z 2025-05-07T19:44:31.6407672Z 2025-05-07T19:44:31.6407675Z 2025-05-07T19:44:31.6407678Z 2025-05-07T19:44:31.6407682Z 2025-05-07T19:44:31.6419770Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:31.6420127Z 2025-05-07T19:44:31.6420140Z 2025-05-07T19:44:31.6420144Z 2025-05-07T19:44:31.6420149Z 2025-05-07T19:44:31.6420153Z 2025-05-07T19:44:31.6420157Z 2025-05-07T19:44:31.6420161Z 2025-05-07T19:44:31.6420166Z 2025-05-07T19:44:31.6547683Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.6548045Z 2025-05-07T19:44:31.6548050Z 2025-05-07T19:44:31.6548054Z 2025-05-07T19:44:31.6548057Z 2025-05-07T19:44:31.6548061Z 2025-05-07T19:44:31.6548064Z 2025-05-07T19:44:31.6548068Z 2025-05-07T19:44:31.6548071Z 2025-05-07T19:44:31.6548075Z 2025-05-07T19:44:31.6548152Z 2025-05-07T19:44:31.6558836Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:31.6559851Z 2025-05-07T19:44:31.6559866Z 2025-05-07T19:44:31.6559877Z 2025-05-07T19:44:31.6559887Z 2025-05-07T19:44:31.6559897Z 2025-05-07T19:44:31.6559907Z 2025-05-07T19:44:31.6559918Z 2025-05-07T19:44:31.6559928Z 2025-05-07T19:44:31.6559975Z 2025-05-07T19:44:31.6559986Z 2025-05-07T19:44:31.6559996Z 2025-05-07T19:44:31.6560841Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:31.6561760Z 2025-05-07T19:44:31.6561771Z 2025-05-07T19:44:31.6561781Z 2025-05-07T19:44:31.6561791Z 2025-05-07T19:44:31.6561815Z 2025-05-07T19:44:31.6561825Z 2025-05-07T19:44:31.6561835Z 2025-05-07T19:44:31.6561869Z 2025-05-07T19:44:31.6561880Z 2025-05-07T19:44:31.6562158Z 2025-05-07T19:44:31.6571280Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.6571627Z 2025-05-07T19:44:31.6571632Z 2025-05-07T19:44:31.6571636Z 2025-05-07T19:44:31.6571640Z 2025-05-07T19:44:31.6571668Z 2025-05-07T19:44:31.6571672Z 2025-05-07T19:44:31.6571687Z 2025-05-07T19:44:31.6571691Z 2025-05-07T19:44:31.6571694Z 2025-05-07T19:44:31.6571698Z 2025-05-07T19:44:31.6571701Z 2025-05-07T19:44:31.6656635Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.6656994Z 2025-05-07T19:44:31.6656999Z 2025-05-07T19:44:31.6657003Z 2025-05-07T19:44:31.6657006Z 2025-05-07T19:44:31.6750330Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.6935089Z gcc_impl_linux-64-11 | 53.0 MB | ####7 | 47% 2025-05-07T19:44:31.6935971Z 2025-05-07T19:44:31.6935984Z 2025-05-07T19:44:31.6935995Z 2025-05-07T19:44:31.6936006Z 2025-05-07T19:44:31.6936016Z 2025-05-07T19:44:31.6936027Z 2025-05-07T19:44:31.6936037Z 2025-05-07T19:44:31.6936959Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.6937554Z 2025-05-07T19:44:31.6937558Z 2025-05-07T19:44:31.6937562Z 2025-05-07T19:44:31.6937565Z 2025-05-07T19:44:31.6937569Z 2025-05-07T19:44:31.6937880Z 2025-05-07T19:44:31.6937885Z 2025-05-07T19:44:31.7041205Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.7042201Z 2025-05-07T19:44:31.7042215Z 2025-05-07T19:44:31.7042226Z 2025-05-07T19:44:31.7042237Z 2025-05-07T19:44:31.7042247Z 2025-05-07T19:44:31.7043021Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.7043896Z 2025-05-07T19:44:31.7043907Z 2025-05-07T19:44:31.7043917Z 2025-05-07T19:44:31.7043955Z 2025-05-07T19:44:31.7043965Z 2025-05-07T19:44:31.7806241Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.7806597Z 2025-05-07T19:44:31.7806602Z 2025-05-07T19:44:31.7806606Z 2025-05-07T19:44:31.7806609Z 2025-05-07T19:44:31.7806613Z 2025-05-07T19:44:31.7806617Z 2025-05-07T19:44:31.7806967Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.7807287Z 2025-05-07T19:44:31.7807291Z 2025-05-07T19:44:31.7807294Z 2025-05-07T19:44:31.7807312Z 2025-05-07T19:44:31.7807315Z 2025-05-07T19:44:31.7807319Z 2025-05-07T19:44:31.7861815Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.7862798Z 2025-05-07T19:44:31.7862812Z 2025-05-07T19:44:31.7862823Z 2025-05-07T19:44:31.7863534Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.8166902Z gcc_impl_linux-64-11 | 53.0 MB | ######6 | 67% 2025-05-07T19:44:31.8167722Z 2025-05-07T19:44:31.8167735Z 2025-05-07T19:44:31.8167776Z 2025-05-07T19:44:31.8167817Z 2025-05-07T19:44:31.8167827Z 2025-05-07T19:44:31.8167838Z 2025-05-07T19:44:31.8167848Z 2025-05-07T19:44:31.8167858Z 2025-05-07T19:44:31.8167869Z 2025-05-07T19:44:31.8168390Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.8168701Z 2025-05-07T19:44:31.8168705Z 2025-05-07T19:44:31.8168709Z 2025-05-07T19:44:31.8168741Z 2025-05-07T19:44:31.8168744Z 2025-05-07T19:44:31.8168748Z 2025-05-07T19:44:31.8168758Z 2025-05-07T19:44:31.8168762Z 2025-05-07T19:44:31.8170367Z 2025-05-07T19:44:31.8503128Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.8504129Z 2025-05-07T19:44:31.8504142Z 2025-05-07T19:44:31.8504154Z 2025-05-07T19:44:31.8504164Z 2025-05-07T19:44:31.8504175Z 2025-05-07T19:44:31.8504185Z 2025-05-07T19:44:31.8504195Z 2025-05-07T19:44:31.8504206Z 2025-05-07T19:44:31.8504990Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.8505897Z 2025-05-07T19:44:31.8505907Z 2025-05-07T19:44:31.8505948Z 2025-05-07T19:44:31.8505958Z 2025-05-07T19:44:31.8505968Z 2025-05-07T19:44:31.8505979Z 2025-05-07T19:44:31.8505989Z 2025-05-07T19:44:31.8506000Z 2025-05-07T19:44:31.8742409Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.8743406Z 2025-05-07T19:44:31.8743451Z 2025-05-07T19:44:31.8743462Z 2025-05-07T19:44:31.8743473Z 2025-05-07T19:44:31.8743512Z 2025-05-07T19:44:31.8743523Z 2025-05-07T19:44:31.8743533Z 2025-05-07T19:44:31.8743544Z 2025-05-07T19:44:31.8743554Z 2025-05-07T19:44:31.8743564Z 2025-05-07T19:44:31.8744349Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.8745230Z 2025-05-07T19:44:31.8745272Z 2025-05-07T19:44:31.8745282Z 2025-05-07T19:44:31.8745292Z 2025-05-07T19:44:31.8745302Z 2025-05-07T19:44:31.8745312Z 2025-05-07T19:44:31.8745323Z 2025-05-07T19:44:31.8745333Z 2025-05-07T19:44:31.8745343Z 2025-05-07T19:44:31.8745370Z 2025-05-07T19:44:31.8942822Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.8943841Z 2025-05-07T19:44:31.8943855Z 2025-05-07T19:44:31.8943866Z 2025-05-07T19:44:31.8943877Z 2025-05-07T19:44:31.8943888Z 2025-05-07T19:44:31.8943898Z 2025-05-07T19:44:31.8943909Z 2025-05-07T19:44:31.8943920Z 2025-05-07T19:44:31.8943930Z 2025-05-07T19:44:31.8943941Z 2025-05-07T19:44:31.8943951Z 2025-05-07T19:44:31.8945232Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.8946215Z 2025-05-07T19:44:31.8946226Z 2025-05-07T19:44:31.8946237Z 2025-05-07T19:44:31.8946247Z 2025-05-07T19:44:31.8946257Z 2025-05-07T19:44:31.8946267Z 2025-05-07T19:44:31.8946277Z 2025-05-07T19:44:31.8946287Z 2025-05-07T19:44:31.8946297Z 2025-05-07T19:44:31.8946307Z 2025-05-07T19:44:31.8946317Z 2025-05-07T19:44:31.9863124Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.9864620Z 2025-05-07T19:44:32.0864067Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:32.1098924Z gcc_impl_linux-64-11 | 53.0 MB | ########1 | 82% 2025-05-07T19:44:32.1099994Z 2025-05-07T19:44:32.1100010Z 2025-05-07T19:44:32.4272175Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:32.6691536Z gcc_impl_linux-64-11 | 53.0 MB | #########3 | 94% 2025-05-07T19:44:33.1959982Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:33.1964558Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:33.1965611Z 2025-05-07T19:44:33.1966222Z 2025-05-07T19:44:33.1966890Z  2025-05-07T19:44:33.1967527Z 2025-05-07T19:44:33.1967539Z 2025-05-07T19:44:33.1968019Z  2025-05-07T19:44:33.1968676Z 2025-05-07T19:44:33.1968688Z 2025-05-07T19:44:33.1968724Z 2025-05-07T19:44:33.1969309Z  2025-05-07T19:44:33.1969537Z 2025-05-07T19:44:33.1969542Z 2025-05-07T19:44:33.1969546Z 2025-05-07T19:44:33.1969550Z 2025-05-07T19:44:33.1969769Z  2025-05-07T19:44:33.1970001Z 2025-05-07T19:44:33.1970005Z 2025-05-07T19:44:33.1970008Z 2025-05-07T19:44:33.1970012Z 2025-05-07T19:44:33.1970016Z 2025-05-07T19:44:33.1970239Z  2025-05-07T19:44:33.1970473Z 2025-05-07T19:44:33.1970476Z 2025-05-07T19:44:33.1970480Z 2025-05-07T19:44:33.1970483Z 2025-05-07T19:44:33.1970486Z 2025-05-07T19:44:33.1970490Z 2025-05-07T19:44:33.1970726Z  2025-05-07T19:44:33.1970963Z 2025-05-07T19:44:33.1970967Z 2025-05-07T19:44:33.1970971Z 2025-05-07T19:44:33.1970975Z 2025-05-07T19:44:33.1970978Z 2025-05-07T19:44:33.1970982Z 2025-05-07T19:44:33.1970990Z 2025-05-07T19:44:33.1971182Z  2025-05-07T19:44:33.1971445Z 2025-05-07T19:44:33.1971449Z 2025-05-07T19:44:33.1971452Z 2025-05-07T19:44:33.1971455Z 2025-05-07T19:44:33.1971459Z 2025-05-07T19:44:33.1971462Z 2025-05-07T19:44:33.1971466Z 2025-05-07T19:44:33.1971469Z 2025-05-07T19:44:33.1971669Z  2025-05-07T19:44:33.1971941Z 2025-05-07T19:44:33.1971945Z 2025-05-07T19:44:33.1971948Z 2025-05-07T19:44:33.1971952Z 2025-05-07T19:44:33.1971955Z 2025-05-07T19:44:33.1971959Z 2025-05-07T19:44:33.1971962Z 2025-05-07T19:44:33.1971966Z 2025-05-07T19:44:33.1971969Z 2025-05-07T19:44:33.1972170Z  2025-05-07T19:44:33.1972436Z 2025-05-07T19:44:33.1972440Z 2025-05-07T19:44:33.1972444Z 2025-05-07T19:44:33.1972447Z 2025-05-07T19:44:33.1972450Z 2025-05-07T19:44:33.1972458Z 2025-05-07T19:44:33.1972461Z 2025-05-07T19:44:33.1972464Z 2025-05-07T19:44:33.1972467Z 2025-05-07T19:44:33.1972471Z 2025-05-07T19:44:33.1972673Z  2025-05-07T19:44:33.1972949Z 2025-05-07T19:44:33.1972953Z 2025-05-07T19:44:33.1972957Z 2025-05-07T19:44:33.1972960Z 2025-05-07T19:44:33.1972964Z 2025-05-07T19:44:33.1972967Z 2025-05-07T19:44:33.1972971Z 2025-05-07T19:44:33.1973222Z 2025-05-07T19:44:33.1973226Z 2025-05-07T19:44:33.1973230Z 2025-05-07T19:44:33.1973234Z 2025-05-07T19:44:33.1973458Z  done 2025-05-07T19:44:33.2978642Z Preparing transaction: \ done 2025-05-07T19:44:33.5985495Z Verifying transaction: / - \ done 2025-05-07T19:44:33.6999242Z Executing transaction: / done 2025-05-07T19:44:33.7870140Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:37.4742246Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:37.4744498Z 2025-05-07T19:44:37.4765578Z 2025-05-07T19:44:37.4784152Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:37.4803577Z 2025-05-07T19:44:37.4803585Z 2025-05-07T19:44:37.4823936Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:37.4825746Z 2025-05-07T19:44:37.4836856Z 2025-05-07T19:44:37.4855714Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:37.4856383Z 2025-05-07T19:44:37.4864287Z 2025-05-07T19:44:39.2680393Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:39.2680747Z 2025-05-07T19:44:39.3438781Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:41.1339000Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:41.1339353Z 2025-05-07T19:44:41.1918478Z [CHECK] Binary gcc found in PATH 2025-05-07T19:44:42.9632331Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:42.9633187Z 2025-05-07T19:44:43.0202205Z [CHECK] Binary c++ found in PATH 2025-05-07T19:44:44.8110285Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:44.8110615Z 2025-05-07T19:44:44.8842673Z [CHECK] Binary g++ found in PATH 2025-05-07T19:44:44.8845345Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:44:44.8846722Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:44:44.8847364Z 2025-05-07T19:44:46.6848781Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:46.6849297Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:46.6849889Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:46.6850199Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:46.6850602Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:46.6851042Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:46.6851384Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:46.6851730Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:46.6852048Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:46.6852327Z #define __CHAR_BIT__ 8 2025-05-07T19:44:46.6852624Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:46.6852927Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:46.6853212Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:46.6853561Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:46.6853867Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:46.6854229Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6854568Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:46.6854917Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:46.6855283Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:46.6855673Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:46.6856158Z #define __DBL_DENORM_MIN__ ((double)4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:46.6856642Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:46.6857024Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:46.6857342Z #define __GCC_IEC_559 2 2025-05-07T19:44:46.6857645Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:46.6858062Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:46.6858379Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:46.6858960Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:46.6859376Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6859878Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:46.6860214Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.6860523Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:46.6860851Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:46.6861148Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:46.6861467Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:46.6861757Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:46.6862070Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:46.6862479Z #define __INT8_C(c) c 2025-05-07T19:44:46.6862775Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:46.6863104Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6863500Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:46.6863882Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.6864281Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:46.6864623Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.6864921Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6865257Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:46.6865565Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:46.6866029Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:46.6866490Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:46.6866842Z #define __linux 1 2025-05-07T19:44:46.6867122Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:46.6867429Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:46.6867778Z #define __unix 1 2025-05-07T19:44:46.6868030Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:46.6868366Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:46.6868666Z #define __WINT_MIN__ 0U 2025-05-07T19:44:46.6868967Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.6869282Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:46.6869613Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:46.6869919Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:46.6870228Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:46.6870572Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:46.6870910Z #define __INT64_C(c) c ## L 2025-05-07T19:44:46.6871233Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:46.6871563Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:46.6871886Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:46.6872370Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:46.6872790Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:46.6873058Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:46.6873357Z #define __DBL_DIG__ 15 2025-05-07T19:44:46.6873602Z #define __FLT32_DIG__ 6 2025-05-07T19:44:46.6873945Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:46.6874340Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:46.6874600Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:46.6874979Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:46.6875340Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:46.6875627Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.6875901Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:46.6876317Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:46.6876733Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:46.6877040Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:46.6877332Z #define __unix__ 1 2025-05-07T19:44:46.6877563Z #define __INT_WIDTH__ 32 2025-05-07T19:44:46.6877848Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:46.6878107Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:46.6878395Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:46.6878673Z #define __UINT16_C(c) c 2025-05-07T19:44:46.6878948Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:46.6879211Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:46.6879599Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:46.6879974Z #define __gnu_linux__ 1 2025-05-07T19:44:46.6880334Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:46.6880658Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.6880956Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6881273Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:46.6881541Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:46.6881831Z #define __GNUC__ 11 2025-05-07T19:44:46.6882871Z #define __pie__ 2 2025-05-07T19:44:46.6883141Z #define __MMX__ 1 2025-05-07T19:44:46.6883461Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:46.6883787Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:46.6884378Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:46.6884709Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:46.6885121Z #define __DBL_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.6885567Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6885946Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.6886238Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:46.6886569Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:46.6886906Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:46.6887231Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:46.6887525Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:46.6887874Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:46.6888206Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:46.6888545Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:46.6888897Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:46.6889181Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:46.6889505Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:46.6889810Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:46.6890129Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:46.6890418Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:46.6890797Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.6891200Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:46.6891531Z #define __SSE2_MATH__ 1 2025-05-07T19:44:46.6891846Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:46.6892184Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6892537Z #define __amd64 1 2025-05-07T19:44:46.6892791Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:46.6893117Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:46.6893452Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:46.6893827Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:46.6894119Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:46.6894448Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:46.6894730Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:46.6895052Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:46.6895585Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:46.6895856Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:46.6896160Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:46.6896446Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:46.6896729Z #define __x86_64 1 2025-05-07T19:44:46.6896970Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:46.6897384Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:46.6897857Z #define __DBL_MIN__ ((double)2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:46.6898352Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:46.6898864Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.6899265Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:46.6899613Z #define __LP64__ 1 2025-05-07T19:44:46.6899862Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6900445Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:46.6900878Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:46.6901209Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:46.6901511Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.6901854Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:46.6902160Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:46.6902584Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:46.6902903Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:46.6903194Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:46.6903513Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.6903879Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:46.6904307Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:46.6904617Z #define __FLT_DIG__ 6 2025-05-07T19:44:46.6904911Z #define __NO_INLINE__ 1 2025-05-07T19:44:46.6905183Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:46.6905573Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:46.6906066Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:46.6906352Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:46.6906680Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:46.6906963Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:46.6907275Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:46.6907561Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:46.6907931Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:46.6908253Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:46.6908577Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:46.6908915Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.6909309Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:46.6909638Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:46.6909925Z #define __FLT128_DIG__ 33 2025-05-07T19:44:46.6910221Z #define __INT32_C(c) c 2025-05-07T19:44:46.6910487Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:46.6910821Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:46.6911132Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:46.6911473Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:46.6911821Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:46.6912296Z #define unix 1 2025-05-07T19:44:46.6912534Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:46.6912884Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6913229Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:46.6913554Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:46.6913926Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:46.6914188Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:46.6914495Z #define __ELF__ 1 2025-05-07T19:44:46.6914734Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:46.6915058Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:46.6915343Z #define __FLT_RADIX__ 2 2025-05-07T19:44:46.6915627Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:46.6915994Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:46.6916405Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:46.6916696Z #define __SSE_MATH__ 1 2025-05-07T19:44:46.6916928Z #define __k8 1 2025-05-07T19:44:46.6917258Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:46.6917644Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:46.6917977Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:46.6918295Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:46.6918589Z #define __LDBL_DIG__ 18 2025-05-07T19:44:46.6918841Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:46.6919132Z #define __x86_64__ 1 2025-05-07T19:44:46.6919374Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:46.6919712Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:46.6920088Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6920447Z #define __FLT64_DIG__ 15 2025-05-07T19:44:46.6920740Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6921137Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.6921478Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6921796Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:46.6922094Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6922444Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:46.6922825Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:46.6923285Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:46.6923655Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:46.6924043Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:46.6924418Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:46.6924737Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:46.6925071Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:46.6925401Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:46.6925731Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:46.6925993Z #define __SEG_FS 1 2025-05-07T19:44:46.6926270Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:46.6926629Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:46.6926954Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6927290Z #define __SEG_GS 1 2025-05-07T19:44:46.6927616Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:46.6928045Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:46.6928331Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:46.6928658Z #define __INT16_TYPE__ short int 2025-05-07T19:44:46.6928947Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:46.6929275Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:46.6929547Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:46.6929828Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:46.6930099Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:46.6930476Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.6930899Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6931198Z #define linux 1 2025-05-07T19:44:46.6931459Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6931746Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.6932050Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:46.6932312Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:46.6932607Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:46.6932880Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:46.6933266Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.6933733Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:46.6934084Z #define __code_model_small__ 1 2025-05-07T19:44:46.6934404Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:46.6934702Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:46.6934990Z #define __k8__ 1 2025-05-07T19:44:46.6935227Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:46.6935559Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:46.6935868Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:46.6936146Z #define __pic__ 2 2025-05-07T19:44:46.6936417Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6936765Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:46.6937095Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6937439Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:46.6937850Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.6938229Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:46.6938543Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:46.6938851Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.6939202Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:46.6939466Z #define __linux__ 1 2025-05-07T19:44:46.6939807Z #define __INT64_TYPE__ long int 2025-05-07T19:44:46.6940260Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:46.6940587Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:46.6940971Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:46.6941257Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:46.6941619Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6941996Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:46.6942356Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:46.6942653Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:46.6943009Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:46.6943340Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:46.6943731Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.6944233Z #define __SSE__ 1 2025-05-07T19:44:46.6944492Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:46.6944898Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.6945286Z #define __amd64__ 1 2025-05-07T19:44:46.6945566Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:46.6945842Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:46.6946170Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:46.6946469Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:46.6946794Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:46.6947154Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:46.6947464Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:46.6947790Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:46.6948085Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:46.6948501Z #define __DBL_EPSILON__ ((double)2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:46.6949023Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:46.6949457Z #define _LP64 1 2025-05-07T19:44:46.6949697Z #define __UINT8_C(c) c 2025-05-07T19:44:46.6949992Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:46.6950293Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:46.6950676Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:46.6950981Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:46.6951354Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:46.6951781Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.6952399Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.6952781Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6953062Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.6953377Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:46.6953729Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:46.6954104Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:46.6954371Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:46.6954699Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:46.6955074Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:46.6955319Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:46.6955572Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:46.6955812Z #define __FXSR__ 1 2025-05-07T19:44:46.6956118Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.6956567Z #define __DBL_NORM_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.6956995Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.6957316Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:46.6957562Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:46.6957900Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:46.6958254Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:46.6958506Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:46.6958736Z #define __PIC__ 2 2025-05-07T19:44:46.6959015Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:46.6959447Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.6959896Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:46.6960228Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.6960568Z #define __SSE2__ 1 2025-05-07T19:44:46.6960832Z #define __INT32_TYPE__ int 2025-05-07T19:44:46.6961099Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:46.6961407Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.6961751Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:46.6962157Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:46.6962436Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:46.6962738Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:46.6963016Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6963327Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:46.6963586Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:46.6963869Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:46.6964249Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6964536Z #define __PIE__ 2 2025-05-07T19:44:46.6964881Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:46.6965262Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:46.6965612Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:46.6965970Z #define __INT16_C(c) c 2025-05-07T19:44:46.6966200Z #define __STDC__ 1 2025-05-07T19:44:46.6966418Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:46.6966694Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:46.6967013Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.6967303Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:46.6967659Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:46.6967979Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:46.6968271Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.6968567Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:46.6968880Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:46.6969179Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:46.6969513Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.6969799Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:46.6970134Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.6970573Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.6970967Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:46.6971316Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:46.6971633Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:46.6971928Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:46.6972096Z 2025-05-07T19:44:46.7601287Z 2025-05-07T19:44:46.7602495Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:44:46.7604001Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:44:46.7604730Z 2025-05-07T19:44:48.5727453Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:48.5727904Z #define __cpp_attributes 200809L 2025-05-07T19:44:48.5728490Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:44:48.5728899Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:48.5729255Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:48.5729539Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:48.5729935Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:48.5730329Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:48.5730661Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:44:48.5731148Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:48.5731514Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:48.5731830Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:48.5732114Z #define __CHAR_BIT__ 8 2025-05-07T19:44:48.5732399Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:48.5732674Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:48.5732988Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:48.5733281Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:48.5733616Z #define __cpp_static_assert 201411L 2025-05-07T19:44:48.5733958Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:48.5734290Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5734650Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:48.5734974Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:48.5735365Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:48.5735724Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:48.5736192Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:48.5736671Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:48.5737051Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:48.5737389Z #define __GCC_IEC_559 2 2025-05-07T19:44:48.5737663Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:48.5737989Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:48.5738321Z #define __cpp_binary_literals 201304L 2025-05-07T19:44:48.5738673Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:48.5739269Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:44:48.5739806Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:48.5740159Z #define __cpp_variadic_templates 200704L 2025-05-07T19:44:48.5740566Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5740937Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:48.5741277Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.5741627Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:48.5741941Z #define __cpp_variable_templates 201304L 2025-05-07T19:44:48.5742315Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:48.5742742Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:48.5743069Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:48.5743379Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:44:48.5743782Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:44:48.5744157Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:48.5744474Z #define __INT8_C(c) c 2025-05-07T19:44:48.5744742Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:48.5745074Z #define __cpp_variadic_using 201611L 2025-05-07T19:44:48.5745453Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5745820Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:48.5746255Z #define __cpp_capture_star_this 201603L 2025-05-07T19:44:48.5746565Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:48.5746918Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.5747288Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:48.5747604Z #define __cpp_if_constexpr 201606L 2025-05-07T19:44:48.5747895Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.5748189Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5748503Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:48.5748787Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:48.5749213Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:48.5749643Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:48.5749971Z #define __linux 1 2025-05-07T19:44:48.5750207Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:48.5750524Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:48.5750814Z #define __unix 1 2025-05-07T19:44:48.5751075Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:48.5751370Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:44:48.5751690Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:48.5752001Z #define __WINT_MIN__ 0U 2025-05-07T19:44:48.5752263Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.5752584Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:48.5752873Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:48.5753176Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:48.5753436Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:48.5753754Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:48.5754061Z #define __INT64_C(c) c ## L 2025-05-07T19:44:48.5754364Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:48.5754671Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:48.5754983Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:48.5755318Z #define __cpp_aligned_new 201606L 2025-05-07T19:44:48.5755605Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:48.5755899Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:48.5756250Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:48.5756636Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:48.5756877Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:48.5757158Z #define __cpp_decltype_auto 201304L 2025-05-07T19:44:48.5757423Z #define __DBL_DIG__ 15 2025-05-07T19:44:48.5757669Z #define __FLT32_DIG__ 6 2025-05-07T19:44:48.5758010Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:48.5758366Z #define __GXX_WEAK__ 1 2025-05-07T19:44:48.5758629Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:48.5758890Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:48.5759246Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:48.5762383Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.5762768Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:48.5763085Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:44:48.5763453Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:44:48.5763878Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:48.5764321Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:48.5764635Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:48.5764903Z #define __unix__ 1 2025-05-07T19:44:48.5765159Z #define __INT_WIDTH__ 32 2025-05-07T19:44:48.5765493Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:48.5765774Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:48.5766036Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:48.5766339Z #define __UINT16_C(c) c 2025-05-07T19:44:48.5766582Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:48.5766866Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:48.5767228Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:48.5767636Z #define __gnu_linux__ 1 2025-05-07T19:44:48.5767910Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:48.5768177Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:48.5768485Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.5768781Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5769082Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:48.5769351Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:48.5769637Z #define __GNUC__ 11 2025-05-07T19:44:48.5769868Z #define __GXX_RTTI 1 2025-05-07T19:44:48.5770135Z #define __pie__ 2 2025-05-07T19:44:48.5770363Z #define __MMX__ 1 2025-05-07T19:44:48.5770625Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:48.5770932Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:48.5771220Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:48.5771518Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:48.5771774Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:48.5772103Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:44:48.5772441Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:48.5772817Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.5773196Z #define __cpp_raw_strings 200710L 2025-05-07T19:44:48.5773536Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5773892Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.5774172Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:48.5774479Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:48.5774793Z #define __cpp_fold_expressions 201603L 2025-05-07T19:44:48.5775130Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:48.5775416Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:48.5775722Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:48.5776019Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:48.5776359Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:48.5776646Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:48.5776974Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:48.5777269Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:48.5777554Z #define __cplusplus 201703L 2025-05-07T19:44:48.5777864Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:44:48.5778167Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:48.5778472Z #define __DEPRECATED 1 2025-05-07T19:44:48.5778739Z #define __cpp_rvalue_references 200610L 2025-05-07T19:44:48.5779082Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:48.5779356Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:48.5779811Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.5780376Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:48.5780717Z #define __SSE2_MATH__ 1 2025-05-07T19:44:48.5781062Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:48.5781393Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5781751Z #define __amd64 1 2025-05-07T19:44:48.5782508Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:48.5782822Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:48.5783116Z #define __GNUG__ 11 2025-05-07T19:44:48.5783567Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:48.5783910Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:48.5784216Z #define __cpp_nsdmi 200809L 2025-05-07T19:44:48.5784500Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:48.5784830Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:48.5785145Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:48.5785448Z #define __cpp_initializer_lists 200806L 2025-05-07T19:44:48.5785797Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:48.5786092Z #define __cpp_hex_float 201603L 2025-05-07T19:44:48.5786421Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:48.5786805Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:48.5787137Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:48.5787434Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:48.5787763Z #define __x86_64 1 2025-05-07T19:44:48.5788019Z #define __cpp_lambdas 200907L 2025-05-07T19:44:48.5788346Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:48.5788778Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:48.5789216Z #define __cpp_template_auto 201606L 2025-05-07T19:44:48.5789632Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:48.5790134Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:48.5790669Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.5791102Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:48.5791408Z #define __LP64__ 1 2025-05-07T19:44:48.5791660Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5792076Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:48.5792527Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:48.5792830Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.5793165Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:48.5793468Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:48.5793786Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:48.5794077Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:48.5794498Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.5794831Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:48.5795228Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:48.5795535Z #define __FLT_DIG__ 6 2025-05-07T19:44:48.5795770Z #define __NO_INLINE__ 1 2025-05-07T19:44:48.5796042Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:48.5796358Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:48.5796913Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:48.5797246Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:48.5797563Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:48.5797837Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:48.5798158Z #define __cpp_unicode_characters 201411L 2025-05-07T19:44:48.5798476Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:48.5798772Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:48.5799113Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:48.5799411Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:48.5799723Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:48.5800042Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.5800429Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:44:48.5800735Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:48.5801041Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:48.5801316Z #define __FLT128_DIG__ 33 2025-05-07T19:44:48.5801591Z #define __INT32_C(c) c 2025-05-07T19:44:48.5801849Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:48.5802348Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:48.5802681Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:48.5802988Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:48.5803463Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:48.5803799Z #define unix 1 2025-05-07T19:44:48.5804065Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:48.5804350Z #define __cpp_rtti 199711L 2025-05-07T19:44:48.5804660Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:48.5805113Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5805481Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:48.5805813Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:48.5806197Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:48.5806496Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:48.5806806Z #define __cpp_digit_separators 201309L 2025-05-07T19:44:48.5807138Z #define __ELF__ 1 2025-05-07T19:44:48.5807384Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:48.5807715Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:48.5808196Z #define __FLT_RADIX__ 2 2025-05-07T19:44:48.5808490Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:48.5808862Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:48.5809277Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:48.5809598Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:44:48.5809888Z #define __k8 1 2025-05-07T19:44:48.5810229Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:48.5810623Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:48.5810966Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:48.5811284Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:48.5811578Z #define __LDBL_DIG__ 18 2025-05-07T19:44:48.5811837Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:48.5812140Z #define __x86_64__ 1 2025-05-07T19:44:48.5812448Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:48.5812788Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:48.5813171Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5813502Z #define __FLT64_DIG__ 15 2025-05-07T19:44:48.5813827Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5814185Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.5814544Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5814813Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:48.5815123Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5815438Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:48.5815833Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:48.5816263Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:48.5816560Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:48.5816911Z #define __cpp_unicode_literals 200710L 2025-05-07T19:44:48.5817231Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:48.5817583Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:48.5817888Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:48.5818198Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:48.5818694Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:48.5819014Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:48.5819271Z #define __SEG_FS 1 2025-05-07T19:44:48.5819619Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:48.5819958Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:48.5820434Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5820783Z #define __SEG_GS 1 2025-05-07T19:44:48.5821203Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:48.5821665Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:48.5822151Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:48.5822497Z #define __INT16_TYPE__ short int 2025-05-07T19:44:48.5822799Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:48.5823166Z #define __cpp_structured_bindings 201606L 2025-05-07T19:44:48.5823519Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:48.5823790Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:48.5824106Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:48.5824479Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.5824933Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5825283Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:44:48.5825715Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:44:48.5826080Z #define linux 1 2025-05-07T19:44:48.5826403Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5826742Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.5827075Z #define __EXCEPTIONS 1 2025-05-07T19:44:48.5827345Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:48.5827666Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:48.5827960Z #define __cpp_range_based_for 201603L 2025-05-07T19:44:48.5828308Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:48.5828688Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.5829141Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:44:48.5829585Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:48.5829970Z #define __code_model_small__ 1 2025-05-07T19:44:48.5830294Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:48.5830632Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:44:48.5830992Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:48.5831298Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:44:48.5831641Z #define __k8__ 1 2025-05-07T19:44:48.5831891Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:48.5832231Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:48.5832661Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:48.5832926Z #define __pic__ 2 2025-05-07T19:44:48.5833179Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5833521Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:48.5833821Z #define __cpp_decltype 200707L 2025-05-07T19:44:48.5834114Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5834472Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:48.5834848Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.5835240Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:48.5835539Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.5835891Z #define __cpp_inline_variables 201606L 2025-05-07T19:44:48.5836186Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:48.5836468Z #define __linux__ 1 2025-05-07T19:44:48.5836705Z #define __INT64_TYPE__ long int 2025-05-07T19:44:48.5836996Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:48.5837284Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:48.5837568Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:48.5837886Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:44:48.5838210Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:48.5838537Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5838871Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:48.5839170Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:48.5839470Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:48.5839807Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:48.5840164Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.5840533Z #define __SSE__ 1 2025-05-07T19:44:48.5840793Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:48.5841139Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.5841521Z #define __amd64__ 1 2025-05-07T19:44:48.5841757Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:48.5842039Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:48.5842320Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:48.5842627Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:48.5842911Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:48.5843211Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:48.5843523Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:48.5843803Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:48.5844192Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:48.5844684Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:48.5845086Z #define _LP64 1 2025-05-07T19:44:48.5845319Z #define __UINT8_C(c) c 2025-05-07T19:44:48.5845596Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:48.5845879Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:48.5846194Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:48.5846502Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:48.5846935Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.5847450Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.5847843Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5848175Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.5848506Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:48.5848857Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:44:48.5849245Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:48.5849708Z #define __STDCPP_THREADS__ 1 2025-05-07T19:44:48.5849999Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:48.5850263Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:48.5850630Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:48.5851006Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:48.5851292Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:48.5851545Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:48.5851827Z #define __FXSR__ 1 2025-05-07T19:44:48.5852135Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.5852620Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.5853065Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.5853381Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:48.5853675Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:44:48.5853981Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:48.5854309Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:48.5854589Z #define __cpp_alias_templates 200704L 2025-05-07T19:44:48.5854978Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:48.5855360Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:48.5855664Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:48.5855919Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:48.5856185Z #define __PIC__ 2 2025-05-07T19:44:48.5856466Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:48.5856881Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.5857303Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:48.5877934Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.5878491Z #define __cpp_constexpr 201603L 2025-05-07T19:44:48.5878806Z #define __SSE2__ 1 2025-05-07T19:44:48.5879065Z #define __cpp_deduction_guides 201703L 2025-05-07T19:44:48.5879410Z #define __INT32_TYPE__ int 2025-05-07T19:44:48.5879679Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:48.5879999Z #define __cpp_exceptions 199711L 2025-05-07T19:44:48.5880282Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.5880654Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:48.5881029Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:48.5881339Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:48.5881642Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:48.5882274Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5882786Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:48.5883127Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:48.5883443Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:44:48.5883768Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:48.5884120Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5884451Z #define __PIE__ 2 2025-05-07T19:44:48.5884844Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:48.5885295Z #define __cpp_template_template_args 201611L 2025-05-07T19:44:48.5885644Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:48.5886019Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:48.5886439Z #define __INT16_C(c) c 2025-05-07T19:44:48.5886678Z #define __STDC__ 1 2025-05-07T19:44:48.5886943Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:48.5887215Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:48.5887532Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:48.5887992Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.5888319Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:48.5888707Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:48.5889063Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:48.5889352Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.5889655Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:44:48.5889966Z #define __SSE_MATH__ 1 2025-05-07T19:44:48.5890211Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:48.5890529Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:44:48.5890980Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:48.5891312Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:48.5891644Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.5891934Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:48.5892265Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.5892666Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.5893084Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:48.5893407Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:48.5893735Z #define _GNU_SOURCE 1 2025-05-07T19:44:48.5893998Z #define __cpp_init_captures 201304L 2025-05-07T19:44:48.5894324Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:48.5894719Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:48.5894879Z 2025-05-07T19:44:48.6333627Z 2025-05-07T19:44:48.6334247Z + conda run -n build_binary c++ --version 2025-05-07T19:44:48.6334952Z 2025-05-07T19:44:50.4101802Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:44:50.4102929Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:44:50.4104311Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:44:50.4105998Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:44:50.4107033Z 2025-05-07T19:44:50.4107047Z 2025-05-07T19:44:50.4683993Z 2025-05-07T19:44:50.4685211Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:44:50.4686284Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:44:50.4686623Z 2025-05-07T19:44:52.3510463Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:52.3510808Z 2025-05-07T19:44:52.3511116Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:44:52.3511761Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:44:52.3512138Z 2025-05-07T19:44:54.1968666Z #define __cplusplus 201703L 2025-05-07T19:44:54.1968967Z 2025-05-07T19:44:54.1969623Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:44:54.2058559Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:54.2059083Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:54.2060186Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:54.2060664Z env: 2025-05-07T19:44:54.2060951Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:54.2061294Z BUILD_ENV: build_binary 2025-05-07T19:44:54.2061603Z BUILD_TARGET: default 2025-05-07T19:44:54.2061870Z BUILD_VARIANT: cuda 2025-05-07T19:44:54.2062170Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:54.2062452Z ##[endgroup] 2025-05-07T19:44:54.6079689Z ################################################################################ 2025-05-07T19:44:54.6080751Z # Install Build Tools 2025-05-07T19:44:54.6081110Z # 2025-05-07T19:44:54.6092491Z # [2025-05-07T19:44:54.608Z] + install_build_tools build_binary 2025-05-07T19:44:54.6093766Z ################################################################################ 2025-05-07T19:44:54.6094572Z 2025-05-07T19:44:54.6106907Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:54.6981448Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:54.6984797Z [INSTALL] Installing build tools ... 2025-05-07T19:44:54.7009683Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:44:55.4175478Z Channels: 2025-05-07T19:44:55.4176166Z - conda-forge 2025-05-07T19:44:55.4176849Z Platform: linux-64 2025-05-07T19:44:58.5065019Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:01.7960326Z Solving environment: \ | / done 2025-05-07T19:45:01.8465780Z 2025-05-07T19:45:01.8466775Z ## Package Plan ## 2025-05-07T19:45:01.8467321Z 2025-05-07T19:45:01.8468145Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:01.8469570Z 2025-05-07T19:45:01.8469888Z added / updated specs: 2025-05-07T19:45:01.8470640Z - auditwheel 2025-05-07T19:45:01.8471248Z - bazel 2025-05-07T19:45:01.8471884Z - cmake[version='>=3.30'] 2025-05-07T19:45:01.8472633Z - hypothesis 2025-05-07T19:45:01.8473238Z - jinja2 2025-05-07T19:45:01.8473455Z - make 2025-05-07T19:45:01.8473690Z - ncurses 2025-05-07T19:45:01.8473935Z - ninja 2025-05-07T19:45:01.8474147Z - openblas 2025-05-07T19:45:01.8474398Z - patchelf 2025-05-07T19:45:01.8474625Z - pyyaml 2025-05-07T19:45:01.8474869Z - rhash 2025-05-07T19:45:01.8475088Z - scikit-build 2025-05-07T19:45:01.8475582Z - wheel 2025-05-07T19:45:01.8475703Z 2025-05-07T19:45:01.8475707Z 2025-05-07T19:45:01.8475832Z The following packages will be downloaded: 2025-05-07T19:45:01.8476081Z 2025-05-07T19:45:01.8476207Z package | build 2025-05-07T19:45:01.8476543Z ---------------------------|----------------- 2025-05-07T19:45:01.8476958Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:01.8477426Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:01.8477871Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:01.8478328Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:01.8478761Z bzip2-1.0.8 | h4bc722e_7 247 KB conda-forge 2025-05-07T19:45:01.8479169Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:01.8479595Z cairo-1.18.4 | h3394656_0 955 KB conda-forge 2025-05-07T19:45:01.8480024Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:01.8480427Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:01.8481043Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:01.8481515Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:01.8482490Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:01.8483079Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:01.8483684Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:01.8484265Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:01.8484764Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:01.8485307Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:01.8485827Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:01.8486341Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:01.8486800Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:01.8487276Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:01.8487768Z harfbuzz-11.0.0 | h76408a6_0 1.6 MB conda-forge 2025-05-07T19:45:01.8488374Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:01.8488952Z icu-75.1 | he02047a_0 11.6 MB conda-forge 2025-05-07T19:45:01.8489338Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:01.8489764Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:01.8490213Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:01.8490623Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:01.8491157Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:01.8491550Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:01.8492019Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:01.8492473Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:01.8492920Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:01.8493377Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:01.8493847Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:01.8494325Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:01.8494743Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:01.8495212Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:01.8495714Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:01.8496178Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:01.8496657Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:01.8497109Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:01.8497552Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:01.8497980Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:45:01.8498453Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:01.8498919Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:01.8499361Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:01.8499953Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:01.8501586Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:01.8502117Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:01.8502617Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:01.8503122Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:01.8503630Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:01.8504106Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:01.8504587Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:01.8505036Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:01.8505506Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:01.8505980Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:01.8506550Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:01.8507010Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:01.8507420Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:01.8507851Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:01.8508274Z markupsafe-3.0.2 | py39h9399b63_1 22 KB conda-forge 2025-05-07T19:45:01.8508734Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:01.8509173Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:01.8509614Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:01.8510090Z openjdk-23.0.2 | h53dfc1b_2 181.4 MB conda-forge 2025-05-07T19:45:01.8510597Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:01.8511059Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:01.8511475Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:01.8511905Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:01.8512374Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:01.8512833Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:01.8513306Z python-3.9.22 |h85ef794_1_cpython 22.5 MB conda-forge 2025-05-07T19:45:01.8513739Z pyyaml-6.0.2 | py39h9399b63_2 178 KB conda-forge 2025-05-07T19:45:01.8514181Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:01.8514616Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:01.8515052Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:01.8515531Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:01.8516004Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:01.8516488Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:01.8516897Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:01.8517331Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:01.8517774Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:01.8518207Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:01.8518679Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:01.8519194Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:01.8519674Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:01.8520136Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:01.8520625Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:01.8521120Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:01.8521585Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:01.8522073Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:01.8522556Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:01.8523046Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:01.8523536Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:01.8523955Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:01.8524394Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:01.8524833Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:01.8525268Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:01.8525664Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:01.8526079Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:01.8526487Z ------------------------------------------------------------ 2025-05-07T19:45:01.8526840Z Total: 342.5 MB 2025-05-07T19:45:01.8527058Z 2025-05-07T19:45:01.8527225Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:01.8527515Z 2025-05-07T19:45:01.8527733Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:01.8528212Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:01.8528707Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:01.8529163Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:01.8529613Z bzip2 conda-forge/linux-64::bzip2-1.0.8-h4bc722e_7 2025-05-07T19:45:01.8530041Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:01.8530496Z cairo conda-forge/linux-64::cairo-1.18.4-h3394656_0 2025-05-07T19:45:01.8530939Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:01.8531357Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:01.8531807Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:01.8532315Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:01.8532948Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:01.8533599Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:01.8534215Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:01.8536987Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:01.8537514Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:01.8538053Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:01.8538556Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:01.8539052Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:01.8539526Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:01.8540461Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:01.8541045Z harfbuzz conda-forge/linux-64::harfbuzz-11.0.0-h76408a6_0 2025-05-07T19:45:01.8541564Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:01.8542076Z icu conda-forge/linux-64::icu-75.1-he02047a_0 2025-05-07T19:45:01.8542525Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:01.8542973Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:01.8543477Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:01.8543944Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:01.8544409Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:01.8544876Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:01.8545391Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:01.8545979Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:01.8546460Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:01.8546979Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:01.8547541Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:01.8548049Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:01.8548539Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:01.8549062Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:01.8549641Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:01.8550221Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:01.8550773Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:01.8551411Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:01.8551897Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:01.8552523Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:45:01.8553050Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:01.8553537Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.8554044Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.8554555Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:01.8555045Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:01.8555579Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:01.8556090Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:01.8556593Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:01.8557090Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:01.8557599Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:01.8558126Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:01.8558577Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:01.8559055Z libuuid conda-forge/linux-64::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:01.8559488Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:01.8560003Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:01.8560692Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:01.8561285Z libzlib conda-forge/linux-64::libzlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:01.8561869Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:01.8562409Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py39h9399b63_1 2025-05-07T19:45:01.8563109Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:01.8563651Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:01.8564235Z openjdk conda-forge/linux-64::openjdk-23.0.2-h53dfc1b_2 2025-05-07T19:45:01.8564817Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:01.8565324Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:01.8565822Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:01.8566307Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:01.8566835Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:01.8567424Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:01.8567931Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py39h9399b63_2 2025-05-07T19:45:01.8568414Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:01.8568877Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:01.8569395Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:01.8570020Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:01.8570687Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:01.8571245Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:01.8571779Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:01.8572312Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:01.8572962Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:01.8573540Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:01.8574107Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:01.8574717Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:01.8575299Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:01.8575897Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:01.8576488Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:01.8577095Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:01.8577704Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:01.8578246Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:01.8578837Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:01.8579397Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.8579942Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:01.8580429Z zstd conda-forge/linux-64::zstd-1.5.7-hb8e6e7a_2 2025-05-07T19:45:01.8580748Z 2025-05-07T19:45:01.8580883Z The following packages will be UPDATED: 2025-05-07T19:45:01.8581147Z 2025-05-07T19:45:01.8581498Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:01.8582401Z python pkgs/main::python-3.9.21-he870216_1 --> conda-forge::python-3.9.22-h85ef794_1_cpython 2025-05-07T19:45:01.8583106Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:01.8583854Z wheel pkgs/main/linux-64::wheel-0.45.1-py39~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:01.8584700Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:01.8585296Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:01.8585694Z 2025-05-07T19:45:01.8585938Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:01.8586290Z 2025-05-07T19:45:01.8586541Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:01.8586932Z 2025-05-07T19:45:01.8586961Z 2025-05-07T19:45:01.8586965Z 2025-05-07T19:45:01.8587123Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:01.8587557Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:01.8587809Z 2025-05-07T19:45:01.8588125Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:01.8588400Z 2025-05-07T19:45:01.8588405Z 2025-05-07T19:45:01.8588627Z python-3.9.22 | 22.5 MB | | 0%  2025-05-07T19:45:01.8588895Z 2025-05-07T19:45:01.8588899Z 2025-05-07T19:45:01.8588903Z 2025-05-07T19:45:01.8592884Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:01.8593200Z 2025-05-07T19:45:01.8593218Z 2025-05-07T19:45:01.8593223Z 2025-05-07T19:45:01.8593229Z 2025-05-07T19:45:01.8605089Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:01.8605974Z 2025-05-07T19:45:01.8605990Z 2025-05-07T19:45:01.8606007Z 2025-05-07T19:45:01.8606026Z 2025-05-07T19:45:01.8607453Z 2025-05-07T19:45:01.8608289Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:01.8608628Z 2025-05-07T19:45:01.8608631Z 2025-05-07T19:45:01.8608635Z 2025-05-07T19:45:01.8608638Z 2025-05-07T19:45:01.8608642Z 2025-05-07T19:45:01.8608645Z 2025-05-07T19:45:01.8608905Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:01.8609410Z 2025-05-07T19:45:01.8609413Z 2025-05-07T19:45:01.8609416Z 2025-05-07T19:45:01.8609435Z 2025-05-07T19:45:01.8609439Z 2025-05-07T19:45:01.8609443Z 2025-05-07T19:45:01.8609446Z 2025-05-07T19:45:01.8609713Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:01.8610050Z 2025-05-07T19:45:01.8610053Z 2025-05-07T19:45:01.8610057Z 2025-05-07T19:45:01.8610060Z 2025-05-07T19:45:01.8610064Z 2025-05-07T19:45:01.8610067Z 2025-05-07T19:45:01.8610071Z 2025-05-07T19:45:01.8610074Z 2025-05-07T19:45:01.8610337Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:01.8610660Z 2025-05-07T19:45:01.8610664Z 2025-05-07T19:45:01.8610667Z 2025-05-07T19:45:01.8610671Z 2025-05-07T19:45:01.8610685Z 2025-05-07T19:45:01.8610688Z 2025-05-07T19:45:01.8610692Z 2025-05-07T19:45:01.8610695Z 2025-05-07T19:45:01.8610698Z 2025-05-07T19:45:01.8610958Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:01.8611287Z 2025-05-07T19:45:01.8611291Z 2025-05-07T19:45:01.8611294Z 2025-05-07T19:45:01.8611304Z 2025-05-07T19:45:01.8611307Z 2025-05-07T19:45:01.8611311Z 2025-05-07T19:45:01.8611314Z 2025-05-07T19:45:01.8611318Z 2025-05-07T19:45:01.8611321Z 2025-05-07T19:45:01.8611324Z 2025-05-07T19:45:01.8611643Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:01.8611984Z 2025-05-07T19:45:01.8612001Z 2025-05-07T19:45:01.8612004Z 2025-05-07T19:45:01.8612008Z 2025-05-07T19:45:01.8612011Z 2025-05-07T19:45:01.8612014Z 2025-05-07T19:45:01.8612018Z 2025-05-07T19:45:01.8612021Z 2025-05-07T19:45:01.8612025Z 2025-05-07T19:45:01.8612028Z 2025-05-07T19:45:01.8612031Z 2025-05-07T19:45:01.8612645Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:01.8612949Z 2025-05-07T19:45:01.8612962Z 2025-05-07T19:45:01.8612966Z 2025-05-07T19:45:01.8612969Z 2025-05-07T19:45:01.8612972Z 2025-05-07T19:45:01.8612981Z 2025-05-07T19:45:01.8612985Z 2025-05-07T19:45:01.8612988Z 2025-05-07T19:45:01.8613079Z 2025-05-07T19:45:01.8613084Z 2025-05-07T19:45:01.8613087Z 2025-05-07T19:45:01.8613091Z 2025-05-07T19:45:01.8613704Z harfbuzz-11.0.0 | 1.6 MB | | 0%  2025-05-07T19:45:01.8614016Z 2025-05-07T19:45:01.8614031Z 2025-05-07T19:45:01.8614035Z 2025-05-07T19:45:01.8614038Z 2025-05-07T19:45:01.8614042Z 2025-05-07T19:45:01.8614045Z 2025-05-07T19:45:01.8614048Z 2025-05-07T19:45:01.8614051Z 2025-05-07T19:45:01.8614055Z 2025-05-07T19:45:01.8614058Z 2025-05-07T19:45:01.8614061Z 2025-05-07T19:45:01.8614090Z 2025-05-07T19:45:01.8614093Z 2025-05-07T19:45:01.8614737Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:01.8615074Z 2025-05-07T19:45:01.8615089Z 2025-05-07T19:45:01.8615092Z 2025-05-07T19:45:01.8615095Z 2025-05-07T19:45:01.8615099Z 2025-05-07T19:45:01.8615102Z 2025-05-07T19:45:01.8615135Z 2025-05-07T19:45:01.8615139Z 2025-05-07T19:45:01.8615142Z 2025-05-07T19:45:01.8615148Z 2025-05-07T19:45:01.8615152Z 2025-05-07T19:45:01.8615156Z 2025-05-07T19:45:01.8615160Z 2025-05-07T19:45:01.8615163Z 2025-05-07T19:45:01.8615926Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:01.8616284Z 2025-05-07T19:45:01.8616298Z 2025-05-07T19:45:01.8616301Z 2025-05-07T19:45:01.8616304Z 2025-05-07T19:45:01.8616308Z 2025-05-07T19:45:01.8616312Z 2025-05-07T19:45:01.8616315Z 2025-05-07T19:45:01.8616318Z 2025-05-07T19:45:01.8616322Z 2025-05-07T19:45:01.8616325Z 2025-05-07T19:45:01.8616328Z 2025-05-07T19:45:01.8616332Z 2025-05-07T19:45:01.8616335Z 2025-05-07T19:45:01.8616339Z 2025-05-07T19:45:01.8616342Z 2025-05-07T19:45:01.8616779Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:01.8617103Z 2025-05-07T19:45:01.8617117Z 2025-05-07T19:45:01.8617120Z 2025-05-07T19:45:01.8617203Z 2025-05-07T19:45:01.8617206Z 2025-05-07T19:45:01.8617210Z 2025-05-07T19:45:01.8617217Z 2025-05-07T19:45:01.8617220Z 2025-05-07T19:45:01.8617224Z 2025-05-07T19:45:01.8617227Z 2025-05-07T19:45:01.8617230Z 2025-05-07T19:45:01.8617234Z 2025-05-07T19:45:01.8617237Z 2025-05-07T19:45:01.8617241Z 2025-05-07T19:45:01.8617244Z 2025-05-07T19:45:01.8617275Z 2025-05-07T19:45:01.8618264Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:01.8618656Z 2025-05-07T19:45:01.8618662Z 2025-05-07T19:45:01.8618666Z 2025-05-07T19:45:01.8618671Z 2025-05-07T19:45:01.8618676Z 2025-05-07T19:45:01.8618698Z 2025-05-07T19:45:01.8618732Z 2025-05-07T19:45:01.8618736Z 2025-05-07T19:45:01.8618741Z 2025-05-07T19:45:01.8618746Z 2025-05-07T19:45:01.8618750Z 2025-05-07T19:45:01.8618753Z 2025-05-07T19:45:01.8618757Z 2025-05-07T19:45:01.8618760Z 2025-05-07T19:45:01.8618763Z 2025-05-07T19:45:01.8618767Z 2025-05-07T19:45:01.8618771Z 2025-05-07T19:45:01.8619134Z cairo-1.18.4 | 955 KB | | 0%  2025-05-07T19:45:01.8619486Z 2025-05-07T19:45:01.8619490Z 2025-05-07T19:45:01.8619494Z 2025-05-07T19:45:01.8619498Z 2025-05-07T19:45:01.8619501Z 2025-05-07T19:45:01.8619505Z 2025-05-07T19:45:01.8619508Z 2025-05-07T19:45:01.8619512Z 2025-05-07T19:45:01.8619515Z 2025-05-07T19:45:01.8619519Z 2025-05-07T19:45:01.8619522Z 2025-05-07T19:45:01.8619526Z 2025-05-07T19:45:01.8619529Z 2025-05-07T19:45:01.8619594Z 2025-05-07T19:45:01.8619601Z 2025-05-07T19:45:01.8619606Z 2025-05-07T19:45:01.8619610Z 2025-05-07T19:45:01.8619635Z 2025-05-07T19:45:01.8619982Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:01.8620308Z 2025-05-07T19:45:01.8620311Z 2025-05-07T19:45:01.8620315Z 2025-05-07T19:45:01.8620318Z 2025-05-07T19:45:01.8620322Z 2025-05-07T19:45:01.8620326Z 2025-05-07T19:45:01.8620329Z 2025-05-07T19:45:01.8620338Z 2025-05-07T19:45:01.8620341Z 2025-05-07T19:45:01.8620344Z 2025-05-07T19:45:01.8620582Z 2025-05-07T19:45:01.8620587Z 2025-05-07T19:45:01.8620619Z 2025-05-07T19:45:01.8620622Z 2025-05-07T19:45:01.8620626Z 2025-05-07T19:45:01.8620636Z 2025-05-07T19:45:01.8620640Z 2025-05-07T19:45:01.8620644Z 2025-05-07T19:45:01.8620647Z 2025-05-07T19:45:02.0659100Z ... (more hidden) ... 2025-05-07T19:45:02.0659472Z 2025-05-07T19:45:02.0659477Z 2025-05-07T19:45:02.0659482Z 2025-05-07T19:45:02.0939497Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:02.0939937Z 2025-05-07T19:45:02.0940085Z 2025-05-07T19:45:02.0940095Z 2025-05-07T19:45:02.0940099Z 2025-05-07T19:45:02.1654997Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:02.1660010Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:02.1660311Z 2025-05-07T19:45:02.1660319Z 2025-05-07T19:45:02.1660358Z 2025-05-07T19:45:02.1665256Z cmake-4.0.2 | 19.4 MB | ###3 | 34%  2025-05-07T19:45:02.1665555Z 2025-05-07T19:45:02.1666065Z 2025-05-07T19:45:02.1676318Z python-3.9.22 | 22.5 MB | | 0%  2025-05-07T19:45:02.1676998Z 2025-05-07T19:45:02.2198220Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:02.2198562Z 2025-05-07T19:45:02.2198568Z 2025-05-07T19:45:02.2198573Z 2025-05-07T19:45:02.2198577Z 2025-05-07T19:45:02.2656294Z icu-75.1 | 11.6 MB | #####5 | 55%  2025-05-07T19:45:02.2657883Z openjdk-23.0.2 | 181.4 MB | 2 | 2% 2025-05-07T19:45:02.2658166Z 2025-05-07T19:45:02.2658176Z 2025-05-07T19:45:02.2658639Z 2025-05-07T19:45:02.2666465Z cmake-4.0.2 | 19.4 MB | ######8 | 68%  2025-05-07T19:45:02.2666756Z 2025-05-07T19:45:02.2666854Z 2025-05-07T19:45:02.2676239Z python-3.9.22 | 22.5 MB | ##5 | 25%  2025-05-07T19:45:02.2677105Z 2025-05-07T19:45:02.3197885Z bazel-7.5.0 | 47.4 MB | #3 | 14%  2025-05-07T19:45:02.3198218Z 2025-05-07T19:45:02.3198222Z 2025-05-07T19:45:02.3198226Z 2025-05-07T19:45:02.3198231Z 2025-05-07T19:45:02.3682287Z icu-75.1 | 11.6 MB | ########7 | 87%  2025-05-07T19:45:02.3682600Z 2025-05-07T19:45:02.3682606Z 2025-05-07T19:45:02.3682611Z 2025-05-07T19:45:02.3689405Z cmake-4.0.2 | 19.4 MB | #########9 | 100%  2025-05-07T19:45:02.3716294Z openjdk-23.0.2 | 181.4 MB | 4 | 4% 2025-05-07T19:45:02.3716635Z 2025-05-07T19:45:02.3716641Z 2025-05-07T19:45:02.3721646Z python-3.9.22 | 22.5 MB | ####5 | 45%  2025-05-07T19:45:02.3721941Z 2025-05-07T19:45:02.3938694Z bazel-7.5.0 | 47.4 MB | ###6 | 36%  2025-05-07T19:45:02.3938988Z 2025-05-07T19:45:02.3938998Z 2025-05-07T19:45:02.3939004Z 2025-05-07T19:45:02.3939009Z 2025-05-07T19:45:02.4462865Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:02.4463195Z 2025-05-07T19:45:02.4463208Z 2025-05-07T19:45:02.4463232Z 2025-05-07T19:45:02.4463236Z 2025-05-07T19:45:02.4463239Z 2025-05-07T19:45:02.4616261Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:02.4616599Z 2025-05-07T19:45:02.4616604Z 2025-05-07T19:45:02.4616607Z 2025-05-07T19:45:02.4740396Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:02.4774386Z openjdk-23.0.2 | 181.4 MB | 6 | 6% 2025-05-07T19:45:02.4775021Z 2025-05-07T19:45:02.5066889Z bazel-7.5.0 | 47.4 MB | #####2 | 53%  2025-05-07T19:45:02.5067212Z 2025-05-07T19:45:02.5067217Z 2025-05-07T19:45:02.5067220Z 2025-05-07T19:45:02.5067224Z 2025-05-07T19:45:02.5067227Z 2025-05-07T19:45:02.5067231Z 2025-05-07T19:45:02.5362547Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:02.5362898Z 2025-05-07T19:45:02.5362904Z 2025-05-07T19:45:02.5465512Z python-3.9.22 | 22.5 MB | ######2 | 62%  2025-05-07T19:45:02.5465820Z 2025-05-07T19:45:02.5466056Z 2025-05-07T19:45:02.5466062Z 2025-05-07T19:45:02.5466066Z 2025-05-07T19:45:02.5466094Z 2025-05-07T19:45:02.5742046Z libgrpc-1.71.0 | 7.6 MB | #########8 | 99%  2025-05-07T19:45:02.6069110Z openjdk-23.0.2 | 181.4 MB | 8 | 8% 2025-05-07T19:45:02.6069419Z 2025-05-07T19:45:02.6069430Z 2025-05-07T19:45:02.6069434Z 2025-05-07T19:45:02.6069438Z 2025-05-07T19:45:02.6069441Z 2025-05-07T19:45:02.6069882Z 2025-05-07T19:45:02.6248023Z openblas-0.3.29 | 5.8 MB | #######9 | 79%  2025-05-07T19:45:02.6248821Z 2025-05-07T19:45:02.6365189Z bazel-7.5.0 | 47.4 MB | ######7 | 68%  2025-05-07T19:45:02.6365497Z 2025-05-07T19:45:02.6368037Z 2025-05-07T19:45:02.6742228Z python-3.9.22 | 22.5 MB | ######## | 81%  2025-05-07T19:45:02.7023118Z openjdk-23.0.2 | 181.4 MB | # | 11% 2025-05-07T19:45:02.7023952Z 2025-05-07T19:45:02.7023966Z 2025-05-07T19:45:02.7023977Z 2025-05-07T19:45:02.7024007Z 2025-05-07T19:45:02.7024018Z 2025-05-07T19:45:02.7440757Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:02.7441901Z 2025-05-07T19:45:02.7441961Z 2025-05-07T19:45:02.7441979Z 2025-05-07T19:45:02.7441992Z 2025-05-07T19:45:02.7442002Z 2025-05-07T19:45:02.7442013Z 2025-05-07T19:45:02.7442024Z 2025-05-07T19:45:02.7526367Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:02.7527481Z 2025-05-07T19:45:02.7590565Z bazel-7.5.0 | 47.4 MB | ########1 | 81%  2025-05-07T19:45:02.7590860Z 2025-05-07T19:45:02.7590886Z 2025-05-07T19:45:02.7590890Z 2025-05-07T19:45:02.7590894Z 2025-05-07T19:45:02.7590897Z 2025-05-07T19:45:02.7591729Z 2025-05-07T19:45:02.7742527Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:02.8002874Z openjdk-23.0.2 | 181.4 MB | #3 | 14% 2025-05-07T19:45:02.8003396Z 2025-05-07T19:45:02.8003401Z 2025-05-07T19:45:02.8003417Z 2025-05-07T19:45:02.8003432Z 2025-05-07T19:45:02.8003436Z 2025-05-07T19:45:02.8003439Z 2025-05-07T19:45:02.8003442Z 2025-05-07T19:45:02.8003445Z 2025-05-07T19:45:02.8450067Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:02.8450419Z 2025-05-07T19:45:02.8450424Z 2025-05-07T19:45:02.8450428Z 2025-05-07T19:45:02.8450431Z 2025-05-07T19:45:02.8450436Z 2025-05-07T19:45:02.8450439Z 2025-05-07T19:45:02.8450443Z 2025-05-07T19:45:02.8743775Z libopenblas-0.3.29 | 5.6 MB | #########4 | 94%  2025-05-07T19:45:02.8962850Z openjdk-23.0.2 | 181.4 MB | #6 | 16% 2025-05-07T19:45:02.9004052Z 2025-05-07T19:45:02.9004432Z bazel-7.5.0 | 47.4 MB | #########3 | 93%  2025-05-07T19:45:02.9004950Z 2025-05-07T19:45:02.9004972Z 2025-05-07T19:45:02.9004978Z 2025-05-07T19:45:02.9004982Z 2025-05-07T19:45:02.9005010Z 2025-05-07T19:45:02.9005016Z 2025-05-07T19:45:02.9005023Z 2025-05-07T19:45:02.9005031Z 2025-05-07T19:45:02.9718267Z libcups-2.3.3 | 4.3 MB | #########6 | 96%  2025-05-07T19:45:02.9718594Z 2025-05-07T19:45:02.9718599Z 2025-05-07T19:45:02.9718602Z 2025-05-07T19:45:02.9718606Z 2025-05-07T19:45:02.9718610Z 2025-05-07T19:45:02.9718634Z 2025-05-07T19:45:02.9718637Z 2025-05-07T19:45:02.9745131Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:02.9818481Z openjdk-23.0.2 | 181.4 MB | #9 | 19% 2025-05-07T19:45:02.9818878Z 2025-05-07T19:45:02.9818924Z 2025-05-07T19:45:02.9818928Z 2025-05-07T19:45:02.9818931Z 2025-05-07T19:45:02.9818948Z 2025-05-07T19:45:02.9819083Z 2025-05-07T19:45:02.9819096Z 2025-05-07T19:45:02.9819108Z 2025-05-07T19:45:03.0135499Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:03.0136123Z 2025-05-07T19:45:03.0136128Z 2025-05-07T19:45:03.0136153Z 2025-05-07T19:45:03.0136157Z 2025-05-07T19:45:03.0136160Z 2025-05-07T19:45:03.0136165Z 2025-05-07T19:45:03.0136409Z 2025-05-07T19:45:03.0136414Z 2025-05-07T19:45:03.0136418Z 2025-05-07T19:45:03.0136421Z 2025-05-07T19:45:03.0312166Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:03.0313131Z 2025-05-07T19:45:03.0313146Z 2025-05-07T19:45:03.0313158Z 2025-05-07T19:45:03.0313168Z 2025-05-07T19:45:03.0313179Z 2025-05-07T19:45:03.0313189Z 2025-05-07T19:45:03.0313200Z 2025-05-07T19:45:03.0313210Z 2025-05-07T19:45:03.0313221Z 2025-05-07T19:45:03.0767182Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:03.1300969Z openjdk-23.0.2 | 181.4 MB | ##2 | 23% 2025-05-07T19:45:03.1301315Z 2025-05-07T19:45:03.1301321Z 2025-05-07T19:45:03.1301325Z 2025-05-07T19:45:03.1301330Z 2025-05-07T19:45:03.1301335Z 2025-05-07T19:45:03.1301340Z 2025-05-07T19:45:03.1301344Z 2025-05-07T19:45:03.1301366Z 2025-05-07T19:45:03.1301370Z 2025-05-07T19:45:03.1301373Z 2025-05-07T19:45:03.1301736Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:03.1302081Z 2025-05-07T19:45:03.1302085Z 2025-05-07T19:45:03.1302088Z 2025-05-07T19:45:03.1302092Z 2025-05-07T19:45:03.1302096Z 2025-05-07T19:45:03.1302099Z 2025-05-07T19:45:03.1302103Z 2025-05-07T19:45:03.1302106Z 2025-05-07T19:45:03.1302109Z 2025-05-07T19:45:03.1302113Z 2025-05-07T19:45:03.1531008Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:03.1531391Z 2025-05-07T19:45:03.1531395Z 2025-05-07T19:45:03.1531399Z 2025-05-07T19:45:03.1531402Z 2025-05-07T19:45:03.1531406Z 2025-05-07T19:45:03.1531410Z 2025-05-07T19:45:03.1531413Z 2025-05-07T19:45:03.1531417Z 2025-05-07T19:45:03.1531420Z 2025-05-07T19:45:03.1531683Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:03.1531997Z 2025-05-07T19:45:03.1532218Z 2025-05-07T19:45:03.1532221Z 2025-05-07T19:45:03.1532225Z 2025-05-07T19:45:03.1532228Z 2025-05-07T19:45:03.1532241Z 2025-05-07T19:45:03.1532245Z 2025-05-07T19:45:03.1532248Z 2025-05-07T19:45:03.1532252Z 2025-05-07T19:45:03.1670245Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:03.1670594Z 2025-05-07T19:45:03.1670599Z 2025-05-07T19:45:03.1671705Z python-3.9.22 | 22.5 MB | ########## | 100%  2025-05-07T19:45:03.1671973Z 2025-05-07T19:45:03.1672110Z 2025-05-07T19:45:03.1741589Z python-3.9.22 | 22.5 MB | ########## | 100%  2025-05-07T19:45:03.1741882Z 2025-05-07T19:45:03.1741887Z 2025-05-07T19:45:03.1741890Z 2025-05-07T19:45:03.1741894Z 2025-05-07T19:45:03.1741898Z 2025-05-07T19:45:03.1741901Z 2025-05-07T19:45:03.1741905Z 2025-05-07T19:45:03.1741908Z 2025-05-07T19:45:03.1741911Z 2025-05-07T19:45:03.1741938Z 2025-05-07T19:45:03.1741941Z 2025-05-07T19:45:03.2163752Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:03.2164086Z 2025-05-07T19:45:03.2164100Z 2025-05-07T19:45:03.2164104Z 2025-05-07T19:45:03.2164108Z 2025-05-07T19:45:03.2164111Z 2025-05-07T19:45:03.2164115Z 2025-05-07T19:45:03.2164140Z 2025-05-07T19:45:03.2164143Z 2025-05-07T19:45:03.2164147Z 2025-05-07T19:45:03.2164150Z 2025-05-07T19:45:03.2164153Z 2025-05-07T19:45:03.2164157Z 2025-05-07T19:45:03.2242179Z harfbuzz-11.0.0 | 1.6 MB | | 1%  2025-05-07T19:45:03.2242549Z 2025-05-07T19:45:03.2242579Z 2025-05-07T19:45:03.2242582Z 2025-05-07T19:45:03.2242586Z 2025-05-07T19:45:03.2242590Z 2025-05-07T19:45:03.2242593Z 2025-05-07T19:45:03.2242597Z 2025-05-07T19:45:03.2242600Z 2025-05-07T19:45:03.2242604Z 2025-05-07T19:45:03.2242607Z 2025-05-07T19:45:03.2242611Z 2025-05-07T19:45:03.2242614Z 2025-05-07T19:45:03.2242618Z 2025-05-07T19:45:03.2330850Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:03.2331396Z openjdk-23.0.2 | 181.4 MB | ##5 | 26% 2025-05-07T19:45:03.2331838Z 2025-05-07T19:45:03.2331845Z 2025-05-07T19:45:03.2331848Z 2025-05-07T19:45:03.2331852Z 2025-05-07T19:45:03.2701603Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:03.2701900Z 2025-05-07T19:45:03.2701905Z 2025-05-07T19:45:03.2701908Z 2025-05-07T19:45:03.2701912Z 2025-05-07T19:45:03.2701915Z 2025-05-07T19:45:03.2701919Z 2025-05-07T19:45:03.2701922Z 2025-05-07T19:45:03.2701926Z 2025-05-07T19:45:03.2701929Z 2025-05-07T19:45:03.2701933Z 2025-05-07T19:45:03.2701957Z 2025-05-07T19:45:03.2724241Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:03.2724563Z 2025-05-07T19:45:03.2724567Z 2025-05-07T19:45:03.2724571Z 2025-05-07T19:45:03.2724574Z 2025-05-07T19:45:03.2724578Z 2025-05-07T19:45:03.2724581Z 2025-05-07T19:45:03.2724610Z 2025-05-07T19:45:03.2724613Z 2025-05-07T19:45:03.2724629Z 2025-05-07T19:45:03.2724632Z 2025-05-07T19:45:03.2724636Z 2025-05-07T19:45:03.2724639Z 2025-05-07T19:45:03.2764799Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:03.2765143Z 2025-05-07T19:45:03.2765250Z 2025-05-07T19:45:03.2765258Z 2025-05-07T19:45:03.2765266Z 2025-05-07T19:45:03.2765273Z 2025-05-07T19:45:03.2765277Z 2025-05-07T19:45:03.2765282Z 2025-05-07T19:45:03.2765287Z 2025-05-07T19:45:03.2765292Z 2025-05-07T19:45:03.2765296Z 2025-05-07T19:45:03.2765301Z 2025-05-07T19:45:03.2765305Z 2025-05-07T19:45:03.2765310Z 2025-05-07T19:45:03.3043434Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:03.3043797Z 2025-05-07T19:45:03.3043990Z 2025-05-07T19:45:03.3043997Z 2025-05-07T19:45:03.3044002Z 2025-05-07T19:45:03.3044007Z 2025-05-07T19:45:03.3044011Z 2025-05-07T19:45:03.3044016Z 2025-05-07T19:45:03.3044020Z 2025-05-07T19:45:03.3044024Z 2025-05-07T19:45:03.3044842Z 2025-05-07T19:45:03.3044846Z 2025-05-07T19:45:03.3044849Z 2025-05-07T19:45:03.3044852Z 2025-05-07T19:45:03.3044864Z 2025-05-07T19:45:03.3353320Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:03.3353718Z 2025-05-07T19:45:03.3353723Z 2025-05-07T19:45:03.3353726Z 2025-05-07T19:45:03.3353730Z 2025-05-07T19:45:03.3353733Z 2025-05-07T19:45:03.3353737Z 2025-05-07T19:45:03.3353740Z 2025-05-07T19:45:03.3353744Z 2025-05-07T19:45:03.3353747Z 2025-05-07T19:45:03.3353751Z 2025-05-07T19:45:03.3353754Z 2025-05-07T19:45:03.3353758Z 2025-05-07T19:45:03.3353761Z 2025-05-07T19:45:03.3353765Z 2025-05-07T19:45:03.3353768Z 2025-05-07T19:45:03.3492416Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:03.3645036Z openjdk-23.0.2 | 181.4 MB | ##8 | 28% 2025-05-07T19:45:03.3645477Z 2025-05-07T19:45:03.3645497Z 2025-05-07T19:45:03.3645501Z 2025-05-07T19:45:03.3645521Z 2025-05-07T19:45:03.3645526Z 2025-05-07T19:45:03.3645531Z 2025-05-07T19:45:03.3645535Z 2025-05-07T19:45:03.3645550Z 2025-05-07T19:45:03.3645555Z 2025-05-07T19:45:03.3645559Z 2025-05-07T19:45:03.3645603Z 2025-05-07T19:45:03.3645607Z 2025-05-07T19:45:03.3645611Z 2025-05-07T19:45:03.3645615Z 2025-05-07T19:45:03.3670399Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:03.3670756Z 2025-05-07T19:45:03.3670761Z 2025-05-07T19:45:03.3670765Z 2025-05-07T19:45:03.3670768Z 2025-05-07T19:45:03.3670795Z 2025-05-07T19:45:03.3670799Z 2025-05-07T19:45:03.3670802Z 2025-05-07T19:45:03.3670806Z 2025-05-07T19:45:03.3670809Z 2025-05-07T19:45:03.3670813Z 2025-05-07T19:45:03.3670816Z 2025-05-07T19:45:03.3670820Z 2025-05-07T19:45:03.3670823Z 2025-05-07T19:45:03.3670826Z 2025-05-07T19:45:03.3670830Z 2025-05-07T19:45:03.3670839Z 2025-05-07T19:45:03.3777407Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:03.3777829Z 2025-05-07T19:45:03.3777834Z 2025-05-07T19:45:03.3778030Z 2025-05-07T19:45:03.3778036Z 2025-05-07T19:45:03.3778039Z 2025-05-07T19:45:03.3778043Z 2025-05-07T19:45:03.3778047Z 2025-05-07T19:45:03.3778050Z 2025-05-07T19:45:03.3778054Z 2025-05-07T19:45:03.3778057Z 2025-05-07T19:45:03.3778061Z 2025-05-07T19:45:03.3778064Z 2025-05-07T19:45:03.3778068Z 2025-05-07T19:45:03.3778071Z 2025-05-07T19:45:03.3778074Z 2025-05-07T19:45:03.4046542Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.4047009Z 2025-05-07T19:45:03.4047027Z 2025-05-07T19:45:03.4047032Z 2025-05-07T19:45:03.4047037Z 2025-05-07T19:45:03.4047042Z 2025-05-07T19:45:03.4047047Z 2025-05-07T19:45:03.4047051Z 2025-05-07T19:45:03.4047055Z 2025-05-07T19:45:03.4047060Z 2025-05-07T19:45:03.4047064Z 2025-05-07T19:45:03.4047069Z 2025-05-07T19:45:03.4047073Z 2025-05-07T19:45:03.4047081Z 2025-05-07T19:45:03.4047085Z 2025-05-07T19:45:03.4047106Z 2025-05-07T19:45:03.4047111Z 2025-05-07T19:45:03.4047114Z 2025-05-07T19:45:03.4048187Z cairo-1.18.4 | 955 KB | 1 | 2%  2025-05-07T19:45:03.4048527Z 2025-05-07T19:45:03.4048538Z 2025-05-07T19:45:03.4048542Z 2025-05-07T19:45:03.4048545Z 2025-05-07T19:45:03.4048548Z 2025-05-07T19:45:03.4048552Z 2025-05-07T19:45:03.4048555Z 2025-05-07T19:45:03.4048585Z 2025-05-07T19:45:03.4048588Z 2025-05-07T19:45:03.4048592Z 2025-05-07T19:45:03.4048595Z 2025-05-07T19:45:03.4048599Z 2025-05-07T19:45:03.4048602Z 2025-05-07T19:45:03.4048606Z 2025-05-07T19:45:03.4048609Z 2025-05-07T19:45:03.4049468Z 2025-05-07T19:45:03.4283005Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.4283425Z 2025-05-07T19:45:03.4283429Z 2025-05-07T19:45:03.4283433Z 2025-05-07T19:45:03.4283436Z 2025-05-07T19:45:03.4283440Z 2025-05-07T19:45:03.4283443Z 2025-05-07T19:45:03.4283625Z 2025-05-07T19:45:03.4283628Z 2025-05-07T19:45:03.4283632Z 2025-05-07T19:45:03.4283642Z 2025-05-07T19:45:03.4283646Z 2025-05-07T19:45:03.4283650Z 2025-05-07T19:45:03.4283653Z 2025-05-07T19:45:03.4283656Z 2025-05-07T19:45:03.4283660Z 2025-05-07T19:45:03.4283663Z 2025-05-07T19:45:03.4283666Z 2025-05-07T19:45:03.4283670Z 2025-05-07T19:45:03.4410748Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:03.4411161Z 2025-05-07T19:45:03.4411204Z 2025-05-07T19:45:03.4411208Z 2025-05-07T19:45:03.4411211Z 2025-05-07T19:45:03.4411215Z 2025-05-07T19:45:03.4411219Z 2025-05-07T19:45:03.4411250Z 2025-05-07T19:45:03.4411253Z 2025-05-07T19:45:03.4411256Z 2025-05-07T19:45:03.4411260Z 2025-05-07T19:45:03.4411263Z 2025-05-07T19:45:03.4411266Z 2025-05-07T19:45:03.4411275Z 2025-05-07T19:45:03.4411279Z 2025-05-07T19:45:03.4411282Z 2025-05-07T19:45:03.4411286Z 2025-05-07T19:45:03.4411289Z 2025-05-07T19:45:03.4490533Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:03.4490935Z 2025-05-07T19:45:03.4490941Z 2025-05-07T19:45:03.4490945Z 2025-05-07T19:45:03.4490948Z 2025-05-07T19:45:03.4490952Z 2025-05-07T19:45:03.4490956Z 2025-05-07T19:45:03.4490959Z 2025-05-07T19:45:03.4490963Z 2025-05-07T19:45:03.4490966Z 2025-05-07T19:45:03.4490970Z 2025-05-07T19:45:03.4490973Z 2025-05-07T19:45:03.4490977Z 2025-05-07T19:45:03.4490980Z 2025-05-07T19:45:03.4490983Z 2025-05-07T19:45:03.4490987Z 2025-05-07T19:45:03.4490990Z 2025-05-07T19:45:03.4490993Z 2025-05-07T19:45:03.4493566Z 2025-05-07T19:45:03.4577017Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:03.4577409Z 2025-05-07T19:45:03.4577414Z 2025-05-07T19:45:03.4577418Z 2025-05-07T19:45:03.4577421Z 2025-05-07T19:45:03.4577425Z 2025-05-07T19:45:03.4577428Z 2025-05-07T19:45:03.4577432Z 2025-05-07T19:45:03.4577435Z 2025-05-07T19:45:03.4577450Z 2025-05-07T19:45:03.4577454Z 2025-05-07T19:45:03.4577457Z 2025-05-07T19:45:03.4577632Z 2025-05-07T19:45:03.4577638Z 2025-05-07T19:45:03.4577641Z 2025-05-07T19:45:03.4577645Z 2025-05-07T19:45:03.4577648Z 2025-05-07T19:45:03.4577652Z 2025-05-07T19:45:03.4577655Z 2025-05-07T19:45:03.4577659Z 2025-05-07T19:45:03.4577961Z ... (more hidden) ... 2025-05-07T19:45:03.4838137Z openjdk-23.0.2 | 181.4 MB | ###1 | 31% 2025-05-07T19:45:03.4838486Z 2025-05-07T19:45:03.4838637Z 2025-05-07T19:45:03.4838645Z 2025-05-07T19:45:03.4838650Z 2025-05-07T19:45:03.4838654Z 2025-05-07T19:45:03.4838658Z 2025-05-07T19:45:03.4838663Z 2025-05-07T19:45:03.4838668Z 2025-05-07T19:45:03.4838672Z 2025-05-07T19:45:03.4838677Z 2025-05-07T19:45:03.4838682Z 2025-05-07T19:45:03.4838686Z 2025-05-07T19:45:03.4838691Z 2025-05-07T19:45:03.4838695Z 2025-05-07T19:45:03.4838700Z 2025-05-07T19:45:03.4838710Z 2025-05-07T19:45:03.4838730Z 2025-05-07T19:45:03.4838734Z 2025-05-07T19:45:03.4838739Z 2025-05-07T19:45:03.5423614Z ... (more hidden) ... 2025-05-07T19:45:03.5423951Z 2025-05-07T19:45:03.5423956Z 2025-05-07T19:45:03.5423959Z 2025-05-07T19:45:03.5423963Z 2025-05-07T19:45:03.5423967Z 2025-05-07T19:45:03.5582323Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:03.6386494Z openjdk-23.0.2 | 181.4 MB | ###3 | 34% 2025-05-07T19:45:03.6386831Z 2025-05-07T19:45:03.6741840Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:03.7717226Z openjdk-23.0.2 | 181.4 MB | ###7 | 38% 2025-05-07T19:45:03.7717623Z 2025-05-07T19:45:03.7717843Z 2025-05-07T19:45:03.7717854Z 2025-05-07T19:45:03.7717858Z 2025-05-07T19:45:03.7717865Z 2025-05-07T19:45:03.7717870Z 2025-05-07T19:45:03.8342103Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:03.8342416Z 2025-05-07T19:45:03.8342633Z 2025-05-07T19:45:03.8342638Z 2025-05-07T19:45:03.8342653Z 2025-05-07T19:45:03.8342669Z 2025-05-07T19:45:03.8342672Z 2025-05-07T19:45:03.8342676Z 2025-05-07T19:45:03.8577430Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:03.9044511Z openjdk-23.0.2 | 181.4 MB | #### | 40% 2025-05-07T19:45:03.9045351Z 2025-05-07T19:45:03.9045367Z 2025-05-07T19:45:03.9045379Z 2025-05-07T19:45:03.9045390Z 2025-05-07T19:45:03.9045400Z 2025-05-07T19:45:03.9045411Z 2025-05-07T19:45:03.9045421Z 2025-05-07T19:45:03.9045431Z 2025-05-07T19:45:03.9581533Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:04.1217559Z openjdk-23.0.2 | 181.4 MB | ####3 | 44% 2025-05-07T19:45:04.1941358Z openjdk-23.0.2 | 181.4 MB | ####6 | 46% 2025-05-07T19:45:04.1941655Z 2025-05-07T19:45:04.1941660Z 2025-05-07T19:45:04.1941664Z 2025-05-07T19:45:04.1941667Z 2025-05-07T19:45:04.1941672Z 2025-05-07T19:45:04.1941694Z 2025-05-07T19:45:04.1941699Z 2025-05-07T19:45:04.1941704Z 2025-05-07T19:45:04.1941725Z 2025-05-07T19:45:04.2174727Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:04.2175659Z 2025-05-07T19:45:04.2175673Z 2025-05-07T19:45:04.2175686Z 2025-05-07T19:45:04.2175696Z 2025-05-07T19:45:04.2175707Z 2025-05-07T19:45:04.2175717Z 2025-05-07T19:45:04.2175727Z 2025-05-07T19:45:04.2175738Z 2025-05-07T19:45:04.2175748Z 2025-05-07T19:45:04.2175787Z 2025-05-07T19:45:04.5287157Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:04.6285783Z openjdk-23.0.2 | 181.4 MB | ####8 | 49% 2025-05-07T19:45:04.6486438Z openjdk-23.0.2 | 181.4 MB | #####1 | 52% 2025-05-07T19:45:04.6486743Z 2025-05-07T19:45:04.6486772Z 2025-05-07T19:45:04.6486777Z 2025-05-07T19:45:04.6486782Z 2025-05-07T19:45:04.6486785Z 2025-05-07T19:45:04.6486789Z 2025-05-07T19:45:04.6486794Z 2025-05-07T19:45:04.6486820Z 2025-05-07T19:45:04.6486824Z 2025-05-07T19:45:04.6486827Z 2025-05-07T19:45:04.6487029Z 2025-05-07T19:45:04.6491645Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:04.6491952Z 2025-05-07T19:45:04.6491956Z 2025-05-07T19:45:04.6491960Z 2025-05-07T19:45:04.6491963Z 2025-05-07T19:45:04.6491977Z 2025-05-07T19:45:04.6491980Z 2025-05-07T19:45:04.6491984Z 2025-05-07T19:45:04.6491987Z 2025-05-07T19:45:04.6491991Z 2025-05-07T19:45:04.6491994Z 2025-05-07T19:45:04.6491998Z 2025-05-07T19:45:04.7317206Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:04.7801201Z openjdk-23.0.2 | 181.4 MB | #####3 | 54% 2025-05-07T19:45:04.7801629Z 2025-05-07T19:45:04.7801676Z 2025-05-07T19:45:04.7801682Z 2025-05-07T19:45:04.7801778Z 2025-05-07T19:45:04.7801785Z 2025-05-07T19:45:04.7801802Z 2025-05-07T19:45:04.7801806Z 2025-05-07T19:45:04.7801813Z 2025-05-07T19:45:04.7801836Z 2025-05-07T19:45:04.7801843Z 2025-05-07T19:45:04.7801847Z 2025-05-07T19:45:04.7801852Z 2025-05-07T19:45:04.7803610Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:04.7803962Z 2025-05-07T19:45:04.7803967Z 2025-05-07T19:45:04.7803971Z 2025-05-07T19:45:04.7803975Z 2025-05-07T19:45:04.7803978Z 2025-05-07T19:45:04.7804003Z 2025-05-07T19:45:04.7804007Z 2025-05-07T19:45:04.7804011Z 2025-05-07T19:45:04.7804014Z 2025-05-07T19:45:04.7804018Z 2025-05-07T19:45:04.7804021Z 2025-05-07T19:45:04.7804025Z 2025-05-07T19:45:04.8317058Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:04.8489407Z openjdk-23.0.2 | 181.4 MB | #####7 | 58% 2025-05-07T19:45:04.8489744Z 2025-05-07T19:45:04.8489749Z 2025-05-07T19:45:04.8489752Z 2025-05-07T19:45:04.8489756Z 2025-05-07T19:45:04.8489759Z 2025-05-07T19:45:04.8489763Z 2025-05-07T19:45:04.8489767Z 2025-05-07T19:45:04.8489771Z 2025-05-07T19:45:04.8489985Z 2025-05-07T19:45:04.8489989Z 2025-05-07T19:45:04.8489994Z 2025-05-07T19:45:04.8490007Z 2025-05-07T19:45:04.8490010Z 2025-05-07T19:45:04.8492657Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:04.8493044Z 2025-05-07T19:45:04.8493049Z 2025-05-07T19:45:04.8493053Z 2025-05-07T19:45:04.8493056Z 2025-05-07T19:45:04.8493060Z 2025-05-07T19:45:04.8493063Z 2025-05-07T19:45:04.8493067Z 2025-05-07T19:45:04.8493070Z 2025-05-07T19:45:04.8493074Z 2025-05-07T19:45:04.8493078Z 2025-05-07T19:45:04.8493081Z 2025-05-07T19:45:04.8493085Z 2025-05-07T19:45:04.8493096Z 2025-05-07T19:45:04.9344166Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:04.9344547Z 2025-05-07T19:45:04.9344552Z 2025-05-07T19:45:04.9344556Z 2025-05-07T19:45:04.9344559Z 2025-05-07T19:45:04.9344563Z 2025-05-07T19:45:04.9344566Z 2025-05-07T19:45:04.9344570Z 2025-05-07T19:45:04.9344573Z 2025-05-07T19:45:04.9344593Z 2025-05-07T19:45:04.9344596Z 2025-05-07T19:45:04.9344626Z 2025-05-07T19:45:04.9344644Z 2025-05-07T19:45:04.9344647Z 2025-05-07T19:45:04.9344651Z 2025-05-07T19:45:04.9344979Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:04.9345323Z 2025-05-07T19:45:04.9345327Z 2025-05-07T19:45:04.9345330Z 2025-05-07T19:45:04.9345333Z 2025-05-07T19:45:04.9345337Z 2025-05-07T19:45:04.9345365Z 2025-05-07T19:45:04.9345369Z 2025-05-07T19:45:04.9345372Z 2025-05-07T19:45:04.9345376Z 2025-05-07T19:45:04.9345379Z 2025-05-07T19:45:04.9345383Z 2025-05-07T19:45:04.9345386Z 2025-05-07T19:45:04.9345389Z 2025-05-07T19:45:04.9345506Z 2025-05-07T19:45:04.9399678Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.0670532Z openjdk-23.0.2 | 181.4 MB | ######2 | 62% 2025-05-07T19:45:05.0670820Z 2025-05-07T19:45:05.0670915Z 2025-05-07T19:45:05.0670919Z 2025-05-07T19:45:05.0670949Z 2025-05-07T19:45:05.0670953Z 2025-05-07T19:45:05.0670977Z 2025-05-07T19:45:05.0671247Z 2025-05-07T19:45:05.0671265Z 2025-05-07T19:45:05.0671268Z 2025-05-07T19:45:05.0671272Z 2025-05-07T19:45:05.0671275Z 2025-05-07T19:45:05.0671279Z 2025-05-07T19:45:05.0671282Z 2025-05-07T19:45:05.0671285Z 2025-05-07T19:45:05.0671289Z 2025-05-07T19:45:05.0672727Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.0673057Z 2025-05-07T19:45:05.0673086Z 2025-05-07T19:45:05.0673090Z 2025-05-07T19:45:05.0673094Z 2025-05-07T19:45:05.0673098Z 2025-05-07T19:45:05.0673102Z 2025-05-07T19:45:05.0673116Z 2025-05-07T19:45:05.0673120Z 2025-05-07T19:45:05.0673123Z 2025-05-07T19:45:05.0673127Z 2025-05-07T19:45:05.0673130Z 2025-05-07T19:45:05.0673134Z 2025-05-07T19:45:05.0673137Z 2025-05-07T19:45:05.0673141Z 2025-05-07T19:45:05.0673144Z 2025-05-07T19:45:05.1076291Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.2080730Z openjdk-23.0.2 | 181.4 MB | ######5 | 65% 2025-05-07T19:45:05.4442015Z openjdk-23.0.2 | 181.4 MB | ######9 | 70% 2025-05-07T19:45:05.6014964Z openjdk-23.0.2 | 181.4 MB | #######2 | 73% 2025-05-07T19:45:05.6015798Z 2025-05-07T19:45:05.6015812Z 2025-05-07T19:45:05.6015825Z 2025-05-07T19:45:05.6015836Z 2025-05-07T19:45:05.6015878Z 2025-05-07T19:45:05.6015891Z 2025-05-07T19:45:05.6015903Z 2025-05-07T19:45:05.6015914Z 2025-05-07T19:45:05.6015924Z 2025-05-07T19:45:05.6015935Z 2025-05-07T19:45:05.6015945Z 2025-05-07T19:45:05.6015955Z 2025-05-07T19:45:05.6015965Z 2025-05-07T19:45:05.6015975Z 2025-05-07T19:45:05.6015986Z 2025-05-07T19:45:05.6015996Z 2025-05-07T19:45:05.6017078Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.6017465Z 2025-05-07T19:45:05.6017469Z 2025-05-07T19:45:05.6017473Z 2025-05-07T19:45:05.6017476Z 2025-05-07T19:45:05.6017774Z 2025-05-07T19:45:05.6017778Z 2025-05-07T19:45:05.6017781Z 2025-05-07T19:45:05.6017797Z 2025-05-07T19:45:05.6017801Z 2025-05-07T19:45:05.6017804Z 2025-05-07T19:45:05.6017807Z 2025-05-07T19:45:05.6017810Z 2025-05-07T19:45:05.6017814Z 2025-05-07T19:45:05.6017817Z 2025-05-07T19:45:05.6017820Z 2025-05-07T19:45:05.6017982Z 2025-05-07T19:45:05.6717661Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.6718563Z 2025-05-07T19:45:05.6718568Z 2025-05-07T19:45:05.6718572Z 2025-05-07T19:45:05.6718575Z 2025-05-07T19:45:05.6718579Z 2025-05-07T19:45:05.6718583Z 2025-05-07T19:45:05.6718586Z 2025-05-07T19:45:05.6718590Z 2025-05-07T19:45:05.6718594Z 2025-05-07T19:45:05.6718623Z 2025-05-07T19:45:05.6718627Z 2025-05-07T19:45:05.6718631Z 2025-05-07T19:45:05.6718634Z 2025-05-07T19:45:05.6718638Z 2025-05-07T19:45:05.6718641Z 2025-05-07T19:45:05.6718645Z 2025-05-07T19:45:05.6718649Z 2025-05-07T19:45:05.6718973Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:05.6719393Z 2025-05-07T19:45:05.6719422Z 2025-05-07T19:45:05.6719426Z 2025-05-07T19:45:05.6719429Z 2025-05-07T19:45:05.6719433Z 2025-05-07T19:45:05.6719436Z 2025-05-07T19:45:05.6719440Z 2025-05-07T19:45:05.6719443Z 2025-05-07T19:45:05.6719446Z 2025-05-07T19:45:05.6719450Z 2025-05-07T19:45:05.6719453Z 2025-05-07T19:45:05.6719457Z 2025-05-07T19:45:05.6719460Z 2025-05-07T19:45:05.6719463Z 2025-05-07T19:45:05.6719467Z 2025-05-07T19:45:05.6719471Z 2025-05-07T19:45:05.6719474Z 2025-05-07T19:45:05.7453489Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:05.8561295Z openjdk-23.0.2 | 181.4 MB | #######5 | 75% 2025-05-07T19:45:05.9011799Z openjdk-23.0.2 | 181.4 MB | #######8 | 78% 2025-05-07T19:45:05.9012667Z 2025-05-07T19:45:05.9012682Z 2025-05-07T19:45:05.9012693Z 2025-05-07T19:45:05.9012738Z 2025-05-07T19:45:05.9012750Z 2025-05-07T19:45:05.9012760Z 2025-05-07T19:45:05.9012771Z 2025-05-07T19:45:05.9013200Z 2025-05-07T19:45:05.9013214Z 2025-05-07T19:45:05.9013224Z 2025-05-07T19:45:05.9013234Z 2025-05-07T19:45:05.9013244Z 2025-05-07T19:45:05.9013255Z 2025-05-07T19:45:05.9013265Z 2025-05-07T19:45:05.9013275Z 2025-05-07T19:45:05.9013322Z 2025-05-07T19:45:05.9013333Z 2025-05-07T19:45:05.9013343Z 2025-05-07T19:45:05.9014408Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:05.9015351Z 2025-05-07T19:45:05.9015362Z 2025-05-07T19:45:05.9015372Z 2025-05-07T19:45:05.9016023Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:05.9016775Z 2025-05-07T19:45:05.9016786Z 2025-05-07T19:45:05.9016796Z 2025-05-07T19:45:05.9016806Z 2025-05-07T19:45:05.9016816Z 2025-05-07T19:45:05.9016826Z 2025-05-07T19:45:05.9016836Z 2025-05-07T19:45:05.9016846Z 2025-05-07T19:45:05.9016857Z 2025-05-07T19:45:05.9016882Z 2025-05-07T19:45:05.9016892Z 2025-05-07T19:45:05.9016903Z 2025-05-07T19:45:05.9016922Z 2025-05-07T19:45:05.9016933Z 2025-05-07T19:45:05.9016972Z 2025-05-07T19:45:05.9016982Z 2025-05-07T19:45:05.9016992Z 2025-05-07T19:45:05.9017003Z 2025-05-07T19:45:05.9532614Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:05.9533649Z 2025-05-07T19:45:05.9533694Z 2025-05-07T19:45:05.9533707Z 2025-05-07T19:45:05.9533717Z 2025-05-07T19:45:05.9533728Z 2025-05-07T19:45:05.9533738Z 2025-05-07T19:45:05.9533749Z 2025-05-07T19:45:05.9533759Z 2025-05-07T19:45:05.9533770Z 2025-05-07T19:45:05.9533780Z 2025-05-07T19:45:05.9533791Z 2025-05-07T19:45:05.9533801Z 2025-05-07T19:45:05.9533811Z 2025-05-07T19:45:05.9533822Z 2025-05-07T19:45:05.9533832Z 2025-05-07T19:45:05.9533842Z 2025-05-07T19:45:05.9533852Z 2025-05-07T19:45:05.9533862Z 2025-05-07T19:45:05.9533872Z 2025-05-07T19:45:05.9534638Z ... (more hidden) ... 2025-05-07T19:45:05.9535919Z 2025-05-07T19:45:05.9535946Z 2025-05-07T19:45:05.9535957Z 2025-05-07T19:45:05.9535968Z 2025-05-07T19:45:05.9535978Z 2025-05-07T19:45:05.9535988Z 2025-05-07T19:45:05.9535998Z 2025-05-07T19:45:05.9536008Z 2025-05-07T19:45:05.9536019Z 2025-05-07T19:45:05.9536029Z 2025-05-07T19:45:05.9536040Z 2025-05-07T19:45:05.9536050Z 2025-05-07T19:45:05.9536060Z 2025-05-07T19:45:05.9536070Z 2025-05-07T19:45:05.9536080Z 2025-05-07T19:45:05.9536091Z 2025-05-07T19:45:05.9536131Z 2025-05-07T19:45:05.9536142Z 2025-05-07T19:45:05.9536152Z 2025-05-07T19:45:06.0533659Z ... (more hidden) ... 2025-05-07T19:45:06.0533992Z 2025-05-07T19:45:06.0534273Z 2025-05-07T19:45:06.0980861Z python-3.9.22 | 22.5 MB | ########## | 100%  2025-05-07T19:45:06.2006470Z openjdk-23.0.2 | 181.4 MB | ######## | 81% 2025-05-07T19:45:06.3007312Z openjdk-23.0.2 | 181.4 MB | ########4 | 84% 2025-05-07T19:45:06.4007975Z openjdk-23.0.2 | 181.4 MB | ########8 | 88% 2025-05-07T19:45:06.5010424Z openjdk-23.0.2 | 181.4 MB | #########1 | 91% 2025-05-07T19:45:06.6048859Z openjdk-23.0.2 | 181.4 MB | #########6 | 96% 2025-05-07T19:45:07.5147256Z openjdk-23.0.2 | 181.4 MB | #########9 | 100% 2025-05-07T19:45:07.5147568Z 2025-05-07T19:45:08.2200276Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:08.9736707Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:08.9742179Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:08.9742961Z 2025-05-07T19:45:08.9742976Z 2025-05-07T19:45:08.9742987Z 2025-05-07T19:45:08.9742997Z 2025-05-07T19:45:08.9743008Z 2025-05-07T19:45:08.9743018Z 2025-05-07T19:45:08.9743030Z 2025-05-07T19:45:08.9743040Z 2025-05-07T19:45:08.9743050Z 2025-05-07T19:45:08.9743061Z 2025-05-07T19:45:08.9743107Z 2025-05-07T19:45:08.9743118Z 2025-05-07T19:45:08.9743128Z 2025-05-07T19:45:08.9743185Z 2025-05-07T19:45:08.9743195Z 2025-05-07T19:45:08.9743205Z 2025-05-07T19:45:08.9743718Z 2025-05-07T19:45:08.9743732Z 2025-05-07T19:45:08.9743743Z 2025-05-07T19:45:08.9743980Z 2025-05-07T19:45:08.9745101Z  2025-05-07T19:45:08.9746064Z 2025-05-07T19:45:08.9746705Z 2025-05-07T19:45:08.9747198Z  2025-05-07T19:45:08.9747821Z 2025-05-07T19:45:08.9747833Z 2025-05-07T19:45:08.9748344Z  2025-05-07T19:45:08.9748973Z 2025-05-07T19:45:08.9748985Z 2025-05-07T19:45:08.9748996Z 2025-05-07T19:45:08.9749644Z  2025-05-07T19:45:08.9750272Z 2025-05-07T19:45:08.9750283Z 2025-05-07T19:45:08.9750293Z 2025-05-07T19:45:08.9750303Z 2025-05-07T19:45:08.9750805Z  2025-05-07T19:45:08.9751495Z 2025-05-07T19:45:08.9751505Z 2025-05-07T19:45:08.9751527Z 2025-05-07T19:45:08.9751539Z 2025-05-07T19:45:08.9751549Z 2025-05-07T19:45:08.9752207Z  2025-05-07T19:45:08.9752467Z 2025-05-07T19:45:08.9752471Z 2025-05-07T19:45:08.9752475Z 2025-05-07T19:45:08.9752479Z 2025-05-07T19:45:08.9752482Z 2025-05-07T19:45:08.9752486Z 2025-05-07T19:45:08.9752681Z  2025-05-07T19:45:08.9752920Z 2025-05-07T19:45:08.9752923Z 2025-05-07T19:45:08.9752953Z 2025-05-07T19:45:08.9752957Z 2025-05-07T19:45:08.9752960Z 2025-05-07T19:45:08.9752963Z 2025-05-07T19:45:08.9752967Z 2025-05-07T19:45:08.9753164Z  2025-05-07T19:45:08.9753403Z 2025-05-07T19:45:08.9753406Z 2025-05-07T19:45:08.9753410Z 2025-05-07T19:45:08.9753413Z 2025-05-07T19:45:08.9753550Z 2025-05-07T19:45:08.9753579Z 2025-05-07T19:45:08.9753583Z 2025-05-07T19:45:08.9753586Z 2025-05-07T19:45:08.9753789Z  2025-05-07T19:45:08.9754040Z 2025-05-07T19:45:08.9754044Z 2025-05-07T19:45:08.9754047Z 2025-05-07T19:45:08.9754051Z 2025-05-07T19:45:08.9754055Z 2025-05-07T19:45:08.9754058Z 2025-05-07T19:45:08.9754061Z 2025-05-07T19:45:08.9754090Z 2025-05-07T19:45:08.9754094Z 2025-05-07T19:45:08.9754308Z  2025-05-07T19:45:08.9754553Z 2025-05-07T19:45:08.9754557Z 2025-05-07T19:45:08.9754561Z 2025-05-07T19:45:08.9754565Z 2025-05-07T19:45:08.9754568Z 2025-05-07T19:45:08.9754571Z 2025-05-07T19:45:08.9754575Z 2025-05-07T19:45:08.9754579Z 2025-05-07T19:45:08.9754611Z 2025-05-07T19:45:08.9754614Z 2025-05-07T19:45:08.9754818Z  2025-05-07T19:45:08.9755072Z 2025-05-07T19:45:08.9755075Z 2025-05-07T19:45:08.9755079Z 2025-05-07T19:45:08.9755083Z 2025-05-07T19:45:08.9755090Z 2025-05-07T19:45:08.9755094Z 2025-05-07T19:45:08.9755097Z 2025-05-07T19:45:08.9755100Z 2025-05-07T19:45:08.9755129Z 2025-05-07T19:45:08.9755133Z 2025-05-07T19:45:08.9755136Z 2025-05-07T19:45:08.9755349Z  2025-05-07T19:45:08.9755601Z 2025-05-07T19:45:08.9755604Z 2025-05-07T19:45:08.9755608Z 2025-05-07T19:45:08.9755611Z 2025-05-07T19:45:08.9755614Z 2025-05-07T19:45:08.9755618Z 2025-05-07T19:45:08.9755621Z 2025-05-07T19:45:08.9755648Z 2025-05-07T19:45:08.9755652Z 2025-05-07T19:45:08.9755655Z 2025-05-07T19:45:08.9755659Z 2025-05-07T19:45:08.9755663Z 2025-05-07T19:45:08.9755875Z  2025-05-07T19:45:08.9756127Z 2025-05-07T19:45:08.9756131Z 2025-05-07T19:45:08.9756134Z 2025-05-07T19:45:08.9756138Z 2025-05-07T19:45:08.9756145Z 2025-05-07T19:45:08.9756177Z 2025-05-07T19:45:08.9756180Z 2025-05-07T19:45:08.9756283Z 2025-05-07T19:45:08.9756288Z 2025-05-07T19:45:08.9756291Z 2025-05-07T19:45:08.9756294Z 2025-05-07T19:45:08.9756298Z 2025-05-07T19:45:08.9756301Z 2025-05-07T19:45:08.9756520Z  2025-05-07T19:45:08.9756774Z 2025-05-07T19:45:08.9756778Z 2025-05-07T19:45:08.9756811Z 2025-05-07T19:45:08.9756814Z 2025-05-07T19:45:08.9756818Z 2025-05-07T19:45:08.9756821Z 2025-05-07T19:45:08.9756824Z 2025-05-07T19:45:08.9756828Z 2025-05-07T19:45:08.9756831Z 2025-05-07T19:45:08.9756835Z 2025-05-07T19:45:08.9756838Z 2025-05-07T19:45:08.9756841Z 2025-05-07T19:45:08.9756845Z 2025-05-07T19:45:08.9756848Z 2025-05-07T19:45:08.9757069Z  2025-05-07T19:45:08.9757362Z 2025-05-07T19:45:08.9757365Z 2025-05-07T19:45:08.9757368Z 2025-05-07T19:45:08.9757376Z 2025-05-07T19:45:08.9757379Z 2025-05-07T19:45:08.9757383Z 2025-05-07T19:45:08.9757390Z 2025-05-07T19:45:08.9757394Z 2025-05-07T19:45:08.9757397Z 2025-05-07T19:45:08.9757400Z 2025-05-07T19:45:08.9757404Z 2025-05-07T19:45:08.9757407Z 2025-05-07T19:45:08.9757410Z 2025-05-07T19:45:08.9757414Z 2025-05-07T19:45:08.9757417Z 2025-05-07T19:45:08.9757679Z  2025-05-07T19:45:08.9757941Z 2025-05-07T19:45:08.9757945Z 2025-05-07T19:45:08.9757948Z 2025-05-07T19:45:08.9757952Z 2025-05-07T19:45:08.9757955Z 2025-05-07T19:45:08.9757959Z 2025-05-07T19:45:08.9757962Z 2025-05-07T19:45:08.9757965Z 2025-05-07T19:45:08.9757969Z 2025-05-07T19:45:08.9757973Z 2025-05-07T19:45:08.9757976Z 2025-05-07T19:45:08.9757980Z 2025-05-07T19:45:08.9757983Z 2025-05-07T19:45:08.9757987Z 2025-05-07T19:45:08.9757990Z 2025-05-07T19:45:08.9757993Z 2025-05-07T19:45:08.9758258Z  2025-05-07T19:45:08.9758593Z 2025-05-07T19:45:08.9758596Z 2025-05-07T19:45:08.9758600Z 2025-05-07T19:45:08.9758603Z 2025-05-07T19:45:08.9758607Z 2025-05-07T19:45:08.9758610Z 2025-05-07T19:45:08.9758613Z 2025-05-07T19:45:08.9758617Z 2025-05-07T19:45:08.9758620Z 2025-05-07T19:45:08.9758652Z 2025-05-07T19:45:08.9758655Z 2025-05-07T19:45:08.9758659Z 2025-05-07T19:45:08.9758662Z 2025-05-07T19:45:08.9758665Z 2025-05-07T19:45:08.9758669Z 2025-05-07T19:45:08.9758672Z 2025-05-07T19:45:08.9758676Z 2025-05-07T19:45:08.9758920Z  2025-05-07T19:45:08.9759187Z 2025-05-07T19:45:08.9759191Z 2025-05-07T19:45:08.9759222Z 2025-05-07T19:45:08.9759225Z 2025-05-07T19:45:08.9759229Z 2025-05-07T19:45:08.9759232Z 2025-05-07T19:45:08.9759236Z 2025-05-07T19:45:08.9759239Z 2025-05-07T19:45:08.9759243Z 2025-05-07T19:45:08.9759246Z 2025-05-07T19:45:08.9759253Z 2025-05-07T19:45:08.9759257Z 2025-05-07T19:45:08.9759260Z 2025-05-07T19:45:08.9759267Z 2025-05-07T19:45:08.9759271Z 2025-05-07T19:45:08.9759274Z 2025-05-07T19:45:08.9759278Z 2025-05-07T19:45:08.9759281Z 2025-05-07T19:45:08.9759520Z  2025-05-07T19:45:08.9759817Z 2025-05-07T19:45:08.9759820Z 2025-05-07T19:45:08.9759932Z  2025-05-07T19:45:08.9760049Z 2025-05-07T19:45:08.9760053Z 2025-05-07T19:45:08.9760301Z  2025-05-07T19:45:08.9760415Z 2025-05-07T19:45:08.9760418Z 2025-05-07T19:45:08.9760422Z 2025-05-07T19:45:08.9760528Z  2025-05-07T19:45:08.9760665Z 2025-05-07T19:45:08.9760668Z 2025-05-07T19:45:08.9760671Z 2025-05-07T19:45:08.9760675Z 2025-05-07T19:45:08.9760782Z  2025-05-07T19:45:08.9760905Z 2025-05-07T19:45:08.9760910Z 2025-05-07T19:45:08.9760929Z 2025-05-07T19:45:08.9760933Z 2025-05-07T19:45:08.9760936Z 2025-05-07T19:45:08.9761047Z  2025-05-07T19:45:08.9761175Z 2025-05-07T19:45:08.9761179Z 2025-05-07T19:45:08.9761238Z 2025-05-07T19:45:08.9761243Z 2025-05-07T19:45:08.9761246Z 2025-05-07T19:45:08.9761249Z 2025-05-07T19:45:08.9761381Z  2025-05-07T19:45:08.9761515Z 2025-05-07T19:45:08.9761518Z 2025-05-07T19:45:08.9761522Z 2025-05-07T19:45:08.9761525Z 2025-05-07T19:45:08.9761529Z 2025-05-07T19:45:08.9761532Z 2025-05-07T19:45:08.9761535Z 2025-05-07T19:45:08.9761652Z  2025-05-07T19:45:08.9761836Z 2025-05-07T19:45:08.9761839Z 2025-05-07T19:45:08.9761842Z 2025-05-07T19:45:08.9761846Z 2025-05-07T19:45:08.9761849Z 2025-05-07T19:45:08.9761853Z 2025-05-07T19:45:08.9761856Z 2025-05-07T19:45:08.9761860Z 2025-05-07T19:45:08.9761991Z  2025-05-07T19:45:08.9762180Z 2025-05-07T19:45:08.9762183Z 2025-05-07T19:45:08.9762187Z 2025-05-07T19:45:08.9762190Z 2025-05-07T19:45:08.9762194Z 2025-05-07T19:45:08.9762197Z 2025-05-07T19:45:08.9762200Z 2025-05-07T19:45:08.9762207Z 2025-05-07T19:45:08.9762211Z 2025-05-07T19:45:08.9762347Z  2025-05-07T19:45:08.9762544Z 2025-05-07T19:45:08.9762547Z 2025-05-07T19:45:08.9762550Z 2025-05-07T19:45:08.9762554Z 2025-05-07T19:45:08.9762557Z 2025-05-07T19:45:08.9762561Z 2025-05-07T19:45:08.9762564Z 2025-05-07T19:45:08.9762567Z 2025-05-07T19:45:08.9762571Z 2025-05-07T19:45:08.9762575Z 2025-05-07T19:45:08.9762712Z  2025-05-07T19:45:08.9762917Z 2025-05-07T19:45:08.9762921Z 2025-05-07T19:45:08.9762925Z 2025-05-07T19:45:08.9762928Z 2025-05-07T19:45:08.9762931Z 2025-05-07T19:45:08.9762935Z 2025-05-07T19:45:08.9762938Z 2025-05-07T19:45:08.9762942Z 2025-05-07T19:45:08.9762945Z 2025-05-07T19:45:08.9762949Z 2025-05-07T19:45:08.9762952Z 2025-05-07T19:45:08.9763094Z  2025-05-07T19:45:08.9763316Z 2025-05-07T19:45:08.9763321Z 2025-05-07T19:45:08.9763324Z 2025-05-07T19:45:08.9763327Z 2025-05-07T19:45:08.9763331Z 2025-05-07T19:45:08.9763395Z 2025-05-07T19:45:08.9763398Z 2025-05-07T19:45:08.9763402Z 2025-05-07T19:45:08.9763409Z 2025-05-07T19:45:08.9763412Z 2025-05-07T19:45:08.9763415Z 2025-05-07T19:45:08.9763419Z 2025-05-07T19:45:08.9763570Z  2025-05-07T19:45:08.9763801Z 2025-05-07T19:45:08.9763804Z 2025-05-07T19:45:08.9763808Z 2025-05-07T19:45:08.9763811Z 2025-05-07T19:45:08.9763814Z 2025-05-07T19:45:08.9763818Z 2025-05-07T19:45:08.9763821Z 2025-05-07T19:45:08.9763824Z 2025-05-07T19:45:08.9763828Z 2025-05-07T19:45:08.9763831Z 2025-05-07T19:45:08.9763834Z 2025-05-07T19:45:08.9763838Z 2025-05-07T19:45:08.9763841Z 2025-05-07T19:45:08.9763991Z  2025-05-07T19:45:08.9764233Z 2025-05-07T19:45:08.9764237Z 2025-05-07T19:45:08.9764240Z 2025-05-07T19:45:08.9764243Z 2025-05-07T19:45:08.9764247Z 2025-05-07T19:45:08.9764250Z 2025-05-07T19:45:08.9764253Z 2025-05-07T19:45:08.9764256Z 2025-05-07T19:45:08.9764260Z 2025-05-07T19:45:08.9764264Z 2025-05-07T19:45:08.9764271Z 2025-05-07T19:45:08.9764274Z 2025-05-07T19:45:08.9764278Z 2025-05-07T19:45:08.9764284Z 2025-05-07T19:45:08.9764465Z  2025-05-07T19:45:08.9764679Z 2025-05-07T19:45:08.9764683Z 2025-05-07T19:45:08.9764686Z 2025-05-07T19:45:08.9764690Z 2025-05-07T19:45:08.9764693Z 2025-05-07T19:45:08.9764697Z 2025-05-07T19:45:08.9764700Z 2025-05-07T19:45:08.9764704Z 2025-05-07T19:45:08.9764707Z 2025-05-07T19:45:08.9764710Z 2025-05-07T19:45:08.9764714Z 2025-05-07T19:45:08.9764717Z 2025-05-07T19:45:08.9764722Z 2025-05-07T19:45:08.9764726Z 2025-05-07T19:45:08.9764729Z 2025-05-07T19:45:08.9764913Z  2025-05-07T19:45:08.9765135Z 2025-05-07T19:45:08.9765139Z 2025-05-07T19:45:08.9765143Z 2025-05-07T19:45:08.9765146Z 2025-05-07T19:45:08.9765150Z 2025-05-07T19:45:08.9765153Z 2025-05-07T19:45:08.9765156Z 2025-05-07T19:45:08.9765160Z 2025-05-07T19:45:08.9765164Z 2025-05-07T19:45:08.9765167Z 2025-05-07T19:45:08.9765174Z 2025-05-07T19:45:08.9765177Z 2025-05-07T19:45:08.9765207Z 2025-05-07T19:45:08.9765269Z 2025-05-07T19:45:08.9765273Z 2025-05-07T19:45:08.9765276Z 2025-05-07T19:45:08.9765439Z  2025-05-07T19:45:08.9765674Z 2025-05-07T19:45:08.9765677Z 2025-05-07T19:45:08.9765680Z 2025-05-07T19:45:08.9765684Z 2025-05-07T19:45:08.9765687Z 2025-05-07T19:45:08.9765691Z 2025-05-07T19:45:08.9765717Z 2025-05-07T19:45:08.9765721Z 2025-05-07T19:45:08.9765724Z 2025-05-07T19:45:08.9765728Z 2025-05-07T19:45:08.9765731Z 2025-05-07T19:45:08.9765734Z 2025-05-07T19:45:08.9765738Z 2025-05-07T19:45:08.9765741Z 2025-05-07T19:45:08.9765745Z 2025-05-07T19:45:08.9765748Z 2025-05-07T19:45:08.9765751Z 2025-05-07T19:45:08.9765920Z  2025-05-07T19:45:08.9766176Z 2025-05-07T19:45:08.9766180Z 2025-05-07T19:45:08.9766183Z 2025-05-07T19:45:08.9766186Z 2025-05-07T19:45:08.9766190Z 2025-05-07T19:45:08.9766193Z 2025-05-07T19:45:08.9766200Z 2025-05-07T19:45:08.9766203Z 2025-05-07T19:45:08.9766207Z 2025-05-07T19:45:08.9766213Z 2025-05-07T19:45:08.9766216Z 2025-05-07T19:45:08.9766220Z 2025-05-07T19:45:08.9766223Z 2025-05-07T19:45:08.9766226Z 2025-05-07T19:45:08.9766230Z 2025-05-07T19:45:08.9766233Z 2025-05-07T19:45:08.9766236Z 2025-05-07T19:45:08.9766240Z 2025-05-07T19:45:08.9766442Z  2025-05-07T19:45:08.9766674Z 2025-05-07T19:45:08.9766677Z 2025-05-07T19:45:08.9766781Z  2025-05-07T19:45:08.9766900Z 2025-05-07T19:45:08.9766932Z 2025-05-07T19:45:08.9767040Z  2025-05-07T19:45:08.9767169Z 2025-05-07T19:45:08.9767173Z 2025-05-07T19:45:08.9767176Z 2025-05-07T19:45:08.9767288Z  2025-05-07T19:45:08.9767437Z 2025-05-07T19:45:08.9767441Z 2025-05-07T19:45:08.9767444Z 2025-05-07T19:45:08.9767447Z 2025-05-07T19:45:08.9767561Z  2025-05-07T19:45:08.9767691Z 2025-05-07T19:45:08.9767694Z 2025-05-07T19:45:08.9767723Z 2025-05-07T19:45:08.9767786Z 2025-05-07T19:45:08.9767790Z 2025-05-07T19:45:08.9767905Z  2025-05-07T19:45:08.9768043Z 2025-05-07T19:45:08.9768046Z 2025-05-07T19:45:08.9768050Z 2025-05-07T19:45:08.9768053Z 2025-05-07T19:45:08.9768056Z 2025-05-07T19:45:08.9768060Z 2025-05-07T19:45:08.9768205Z  2025-05-07T19:45:08.9768346Z 2025-05-07T19:45:08.9768350Z 2025-05-07T19:45:08.9768353Z 2025-05-07T19:45:08.9768357Z 2025-05-07T19:45:08.9768360Z 2025-05-07T19:45:08.9768363Z 2025-05-07T19:45:08.9768367Z 2025-05-07T19:45:08.9768490Z  2025-05-07T19:45:08.9768668Z 2025-05-07T19:45:08.9768671Z 2025-05-07T19:45:08.9768675Z 2025-05-07T19:45:08.9768678Z 2025-05-07T19:45:08.9768681Z 2025-05-07T19:45:08.9768685Z 2025-05-07T19:45:08.9768688Z 2025-05-07T19:45:08.9768692Z 2025-05-07T19:45:08.9768821Z  2025-05-07T19:45:08.9769011Z 2025-05-07T19:45:08.9769015Z 2025-05-07T19:45:08.9769018Z 2025-05-07T19:45:08.9769022Z 2025-05-07T19:45:08.9769026Z 2025-05-07T19:45:08.9769034Z 2025-05-07T19:45:08.9769038Z 2025-05-07T19:45:08.9769041Z 2025-05-07T19:45:08.9769047Z 2025-05-07T19:45:08.9769177Z  2025-05-07T19:45:08.9769377Z 2025-05-07T19:45:08.9769380Z 2025-05-07T19:45:08.9769384Z 2025-05-07T19:45:08.9769389Z 2025-05-07T19:45:08.9769392Z 2025-05-07T19:45:08.9769396Z 2025-05-07T19:45:08.9769399Z 2025-05-07T19:45:08.9769403Z 2025-05-07T19:45:08.9769406Z 2025-05-07T19:45:08.9769410Z 2025-05-07T19:45:08.9769546Z  2025-05-07T19:45:08.9769759Z 2025-05-07T19:45:08.9769763Z 2025-05-07T19:45:08.9769766Z 2025-05-07T19:45:08.9769769Z 2025-05-07T19:45:08.9769773Z 2025-05-07T19:45:08.9769776Z 2025-05-07T19:45:08.9769780Z 2025-05-07T19:45:08.9769784Z 2025-05-07T19:45:08.9769787Z 2025-05-07T19:45:08.9769790Z 2025-05-07T19:45:08.9769794Z 2025-05-07T19:45:08.9769936Z  2025-05-07T19:45:08.9770158Z 2025-05-07T19:45:08.9770162Z 2025-05-07T19:45:08.9770166Z 2025-05-07T19:45:08.9770172Z 2025-05-07T19:45:08.9770176Z 2025-05-07T19:45:08.9770179Z 2025-05-07T19:45:08.9770242Z 2025-05-07T19:45:08.9770246Z 2025-05-07T19:45:08.9770250Z 2025-05-07T19:45:08.9770253Z 2025-05-07T19:45:08.9770257Z 2025-05-07T19:45:08.9770260Z 2025-05-07T19:45:08.9770405Z  2025-05-07T19:45:08.9770638Z 2025-05-07T19:45:08.9770641Z 2025-05-07T19:45:08.9770644Z 2025-05-07T19:45:08.9770648Z 2025-05-07T19:45:08.9770651Z 2025-05-07T19:45:08.9770655Z 2025-05-07T19:45:08.9770658Z 2025-05-07T19:45:08.9770662Z 2025-05-07T19:45:08.9770665Z 2025-05-07T19:45:08.9770669Z 2025-05-07T19:45:08.9770672Z 2025-05-07T19:45:08.9770675Z 2025-05-07T19:45:08.9770679Z 2025-05-07T19:45:08.9770830Z  2025-05-07T19:45:08.9771071Z 2025-05-07T19:45:08.9771075Z 2025-05-07T19:45:08.9771078Z 2025-05-07T19:45:08.9771081Z 2025-05-07T19:45:08.9771085Z 2025-05-07T19:45:08.9771088Z 2025-05-07T19:45:08.9771091Z 2025-05-07T19:45:08.9771098Z 2025-05-07T19:45:08.9771102Z 2025-05-07T19:45:08.9771105Z 2025-05-07T19:45:08.9771109Z 2025-05-07T19:45:08.9771115Z 2025-05-07T19:45:08.9771119Z 2025-05-07T19:45:08.9771122Z 2025-05-07T19:45:08.9771303Z  2025-05-07T19:45:08.9771520Z 2025-05-07T19:45:08.9771523Z 2025-05-07T19:45:08.9771527Z 2025-05-07T19:45:08.9771530Z 2025-05-07T19:45:08.9771533Z 2025-05-07T19:45:08.9771537Z 2025-05-07T19:45:08.9771541Z 2025-05-07T19:45:08.9771544Z 2025-05-07T19:45:08.9771548Z 2025-05-07T19:45:08.9771553Z 2025-05-07T19:45:08.9771556Z 2025-05-07T19:45:08.9771560Z 2025-05-07T19:45:08.9771564Z 2025-05-07T19:45:08.9771567Z 2025-05-07T19:45:08.9771570Z 2025-05-07T19:45:08.9771755Z  2025-05-07T19:45:08.9771975Z 2025-05-07T19:45:08.9771979Z 2025-05-07T19:45:08.9771983Z 2025-05-07T19:45:08.9771986Z 2025-05-07T19:45:08.9771990Z 2025-05-07T19:45:08.9771993Z 2025-05-07T19:45:08.9771997Z 2025-05-07T19:45:08.9772000Z 2025-05-07T19:45:08.9772072Z 2025-05-07T19:45:08.9772076Z 2025-05-07T19:45:08.9772079Z 2025-05-07T19:45:08.9772085Z 2025-05-07T19:45:08.9772116Z 2025-05-07T19:45:08.9772119Z 2025-05-07T19:45:08.9772123Z 2025-05-07T19:45:08.9772126Z 2025-05-07T19:45:08.9772288Z  2025-05-07T19:45:08.9772518Z 2025-05-07T19:45:08.9772522Z 2025-05-07T19:45:08.9772525Z 2025-05-07T19:45:08.9772528Z 2025-05-07T19:45:08.9772532Z 2025-05-07T19:45:08.9772535Z 2025-05-07T19:45:08.9772562Z 2025-05-07T19:45:08.9772566Z 2025-05-07T19:45:08.9772569Z 2025-05-07T19:45:08.9772572Z 2025-05-07T19:45:08.9772576Z 2025-05-07T19:45:08.9772580Z 2025-05-07T19:45:08.9772583Z 2025-05-07T19:45:08.9772586Z 2025-05-07T19:45:08.9772590Z 2025-05-07T19:45:08.9772593Z 2025-05-07T19:45:08.9772596Z 2025-05-07T19:45:08.9772765Z  2025-05-07T19:45:08.9773027Z 2025-05-07T19:45:08.9773031Z 2025-05-07T19:45:08.9773034Z 2025-05-07T19:45:08.9773042Z 2025-05-07T19:45:08.9773045Z 2025-05-07T19:45:08.9773049Z 2025-05-07T19:45:08.9773052Z 2025-05-07T19:45:08.9773059Z 2025-05-07T19:45:08.9773062Z 2025-05-07T19:45:08.9773066Z 2025-05-07T19:45:08.9773069Z 2025-05-07T19:45:08.9773073Z 2025-05-07T19:45:08.9773076Z 2025-05-07T19:45:08.9773079Z 2025-05-07T19:45:08.9773083Z 2025-05-07T19:45:08.9773086Z 2025-05-07T19:45:08.9773090Z 2025-05-07T19:45:08.9773093Z 2025-05-07T19:45:08.9773293Z  2025-05-07T19:45:08.9773529Z 2025-05-07T19:45:08.9773532Z 2025-05-07T19:45:08.9773638Z  2025-05-07T19:45:08.9773755Z 2025-05-07T19:45:08.9773787Z 2025-05-07T19:45:08.9773897Z  2025-05-07T19:45:08.9774018Z 2025-05-07T19:45:08.9774022Z 2025-05-07T19:45:08.9774025Z 2025-05-07T19:45:08.9774143Z  2025-05-07T19:45:08.9774300Z 2025-05-07T19:45:08.9774303Z 2025-05-07T19:45:08.9774307Z 2025-05-07T19:45:08.9774310Z 2025-05-07T19:45:08.9774426Z  2025-05-07T19:45:08.9774559Z 2025-05-07T19:45:08.9774565Z 2025-05-07T19:45:08.9774603Z 2025-05-07T19:45:08.9774606Z 2025-05-07T19:45:08.9774669Z 2025-05-07T19:45:08.9774888Z  2025-05-07T19:45:08.9775026Z 2025-05-07T19:45:08.9775030Z 2025-05-07T19:45:08.9775033Z 2025-05-07T19:45:08.9775037Z 2025-05-07T19:45:08.9775040Z 2025-05-07T19:45:08.9775043Z 2025-05-07T19:45:08.9775193Z  2025-05-07T19:45:08.9775332Z 2025-05-07T19:45:08.9775336Z 2025-05-07T19:45:08.9775339Z 2025-05-07T19:45:08.9775342Z 2025-05-07T19:45:08.9775346Z 2025-05-07T19:45:08.9775349Z 2025-05-07T19:45:08.9775353Z 2025-05-07T19:45:08.9775506Z  2025-05-07T19:45:08.9775658Z 2025-05-07T19:45:08.9775661Z 2025-05-07T19:45:08.9775665Z 2025-05-07T19:45:08.9775668Z 2025-05-07T19:45:08.9775672Z 2025-05-07T19:45:08.9775676Z 2025-05-07T19:45:08.9775679Z 2025-05-07T19:45:08.9775683Z 2025-05-07T19:45:08.9775808Z  2025-05-07T19:45:08.9776006Z 2025-05-07T19:45:08.9776010Z 2025-05-07T19:45:08.9776016Z 2025-05-07T19:45:08.9776019Z 2025-05-07T19:45:08.9776023Z 2025-05-07T19:45:08.9776027Z 2025-05-07T19:45:08.9776033Z 2025-05-07T19:45:08.9776037Z 2025-05-07T19:45:08.9776041Z 2025-05-07T19:45:08.9776167Z  2025-05-07T19:45:08.9776370Z 2025-05-07T19:45:08.9776374Z 2025-05-07T19:45:08.9776377Z 2025-05-07T19:45:08.9776381Z 2025-05-07T19:45:08.9776384Z 2025-05-07T19:45:08.9776387Z 2025-05-07T19:45:08.9776391Z 2025-05-07T19:45:08.9776394Z 2025-05-07T19:45:08.9776398Z 2025-05-07T19:45:08.9776401Z 2025-05-07T19:45:08.9776538Z  2025-05-07T19:45:08.9776754Z 2025-05-07T19:45:08.9776757Z 2025-05-07T19:45:08.9776760Z 2025-05-07T19:45:08.9776764Z 2025-05-07T19:45:08.9776767Z 2025-05-07T19:45:08.9776771Z 2025-05-07T19:45:08.9776775Z 2025-05-07T19:45:08.9776778Z 2025-05-07T19:45:08.9776782Z 2025-05-07T19:45:08.9776786Z 2025-05-07T19:45:08.9776789Z 2025-05-07T19:45:08.9776929Z  2025-05-07T19:45:08.9777208Z 2025-05-07T19:45:08.9777212Z 2025-05-07T19:45:08.9777216Z 2025-05-07T19:45:08.9777219Z 2025-05-07T19:45:08.9777225Z 2025-05-07T19:45:08.9777229Z 2025-05-07T19:45:08.9777233Z 2025-05-07T19:45:08.9777236Z 2025-05-07T19:45:08.9777241Z 2025-05-07T19:45:08.9777245Z 2025-05-07T19:45:08.9777248Z 2025-05-07T19:45:08.9777251Z 2025-05-07T19:45:08.9777403Z  2025-05-07T19:45:08.9777630Z 2025-05-07T19:45:08.9777634Z 2025-05-07T19:45:08.9777637Z 2025-05-07T19:45:08.9777640Z 2025-05-07T19:45:08.9777644Z 2025-05-07T19:45:08.9777648Z 2025-05-07T19:45:08.9777652Z 2025-05-07T19:45:08.9777655Z 2025-05-07T19:45:08.9777659Z 2025-05-07T19:45:08.9777662Z 2025-05-07T19:45:08.9777665Z 2025-05-07T19:45:08.9777669Z 2025-05-07T19:45:08.9777672Z 2025-05-07T19:45:08.9777820Z  2025-05-07T19:45:08.9778057Z 2025-05-07T19:45:08.9778060Z 2025-05-07T19:45:08.9778064Z 2025-05-07T19:45:08.9778068Z 2025-05-07T19:45:08.9778071Z 2025-05-07T19:45:08.9778078Z 2025-05-07T19:45:08.9778082Z 2025-05-07T19:45:08.9778085Z 2025-05-07T19:45:08.9778092Z 2025-05-07T19:45:08.9778096Z 2025-05-07T19:45:08.9778099Z 2025-05-07T19:45:08.9778103Z 2025-05-07T19:45:08.9778106Z 2025-05-07T19:45:08.9778109Z 2025-05-07T19:45:08.9778293Z  2025-05-07T19:45:08.9778512Z 2025-05-07T19:45:08.9778516Z 2025-05-07T19:45:08.9778519Z 2025-05-07T19:45:08.9778523Z 2025-05-07T19:45:08.9778526Z 2025-05-07T19:45:08.9778529Z 2025-05-07T19:45:08.9778533Z 2025-05-07T19:45:08.9778536Z 2025-05-07T19:45:08.9778539Z 2025-05-07T19:45:08.9778543Z 2025-05-07T19:45:08.9778546Z 2025-05-07T19:45:08.9778550Z 2025-05-07T19:45:08.9778553Z 2025-05-07T19:45:08.9778556Z 2025-05-07T19:45:08.9778560Z 2025-05-07T19:45:08.9778746Z  2025-05-07T19:45:08.9778968Z 2025-05-07T19:45:08.9778972Z 2025-05-07T19:45:08.9778975Z 2025-05-07T19:45:08.9778979Z 2025-05-07T19:45:08.9778982Z 2025-05-07T19:45:08.9778988Z 2025-05-07T19:45:08.9778992Z 2025-05-07T19:45:08.9778995Z 2025-05-07T19:45:08.9778999Z 2025-05-07T19:45:08.9779058Z 2025-05-07T19:45:08.9779094Z 2025-05-07T19:45:08.9779097Z 2025-05-07T19:45:08.9779101Z 2025-05-07T19:45:08.9779104Z 2025-05-07T19:45:08.9779108Z 2025-05-07T19:45:08.9779111Z 2025-05-07T19:45:08.9779273Z  2025-05-07T19:45:08.9779503Z 2025-05-07T19:45:08.9779507Z 2025-05-07T19:45:08.9779510Z 2025-05-07T19:45:08.9779514Z 2025-05-07T19:45:08.9779518Z 2025-05-07T19:45:08.9779647Z 2025-05-07T19:45:08.9779652Z 2025-05-07T19:45:08.9779657Z 2025-05-07T19:45:08.9779662Z 2025-05-07T19:45:08.9779669Z 2025-05-07T19:45:08.9779674Z 2025-05-07T19:45:08.9779679Z 2025-05-07T19:45:08.9779685Z 2025-05-07T19:45:08.9779689Z 2025-05-07T19:45:08.9779692Z 2025-05-07T19:45:08.9779696Z 2025-05-07T19:45:08.9779699Z 2025-05-07T19:45:08.9779875Z  2025-05-07T19:45:08.9780139Z 2025-05-07T19:45:08.9780147Z 2025-05-07T19:45:08.9780151Z 2025-05-07T19:45:08.9780154Z 2025-05-07T19:45:08.9780161Z 2025-05-07T19:45:08.9780165Z 2025-05-07T19:45:08.9780169Z 2025-05-07T19:45:08.9780173Z 2025-05-07T19:45:08.9780177Z 2025-05-07T19:45:08.9780180Z 2025-05-07T19:45:08.9780183Z 2025-05-07T19:45:08.9780187Z 2025-05-07T19:45:08.9780190Z 2025-05-07T19:45:08.9780194Z 2025-05-07T19:45:08.9780197Z 2025-05-07T19:45:08.9780201Z 2025-05-07T19:45:08.9780204Z 2025-05-07T19:45:08.9780208Z 2025-05-07T19:45:08.9780415Z  2025-05-07T19:45:08.9780651Z 2025-05-07T19:45:08.9780655Z 2025-05-07T19:45:08.9780766Z  2025-05-07T19:45:08.9780911Z 2025-05-07T19:45:08.9780915Z 2025-05-07T19:45:08.9781028Z  2025-05-07T19:45:08.9781154Z 2025-05-07T19:45:08.9781157Z 2025-05-07T19:45:08.9781161Z 2025-05-07T19:45:08.9781277Z  2025-05-07T19:45:08.9781430Z 2025-05-07T19:45:08.9781434Z 2025-05-07T19:45:08.9781438Z 2025-05-07T19:45:08.9781441Z 2025-05-07T19:45:08.9781626Z  2025-05-07T19:45:08.9781757Z 2025-05-07T19:45:08.9781786Z 2025-05-07T19:45:08.9781793Z 2025-05-07T19:45:08.9781796Z 2025-05-07T19:45:08.9781799Z 2025-05-07T19:45:08.9782132Z  2025-05-07T19:45:08.9782271Z 2025-05-07T19:45:08.9782274Z 2025-05-07T19:45:08.9782277Z 2025-05-07T19:45:08.9782281Z 2025-05-07T19:45:08.9782285Z 2025-05-07T19:45:08.9782288Z 2025-05-07T19:45:08.9782440Z  2025-05-07T19:45:08.9782582Z 2025-05-07T19:45:08.9782586Z 2025-05-07T19:45:08.9782589Z 2025-05-07T19:45:08.9782593Z 2025-05-07T19:45:08.9782596Z 2025-05-07T19:45:08.9782600Z 2025-05-07T19:45:08.9782603Z 2025-05-07T19:45:08.9782754Z  2025-05-07T19:45:08.9782905Z 2025-05-07T19:45:08.9782908Z 2025-05-07T19:45:08.9782912Z 2025-05-07T19:45:08.9782915Z 2025-05-07T19:45:08.9782919Z 2025-05-07T19:45:08.9782922Z 2025-05-07T19:45:08.9782926Z 2025-05-07T19:45:08.9782929Z 2025-05-07T19:45:08.9783057Z  2025-05-07T19:45:08.9783250Z 2025-05-07T19:45:08.9783254Z 2025-05-07T19:45:08.9783257Z 2025-05-07T19:45:08.9783264Z 2025-05-07T19:45:08.9783268Z 2025-05-07T19:45:08.9783271Z 2025-05-07T19:45:08.9783276Z 2025-05-07T19:45:08.9783279Z 2025-05-07T19:45:08.9783282Z 2025-05-07T19:45:08.9783412Z  2025-05-07T19:45:08.9783610Z 2025-05-07T19:45:08.9783613Z 2025-05-07T19:45:08.9783617Z 2025-05-07T19:45:08.9783620Z 2025-05-07T19:45:08.9783624Z 2025-05-07T19:45:08.9783628Z 2025-05-07T19:45:08.9783631Z 2025-05-07T19:45:08.9783635Z 2025-05-07T19:45:08.9783638Z 2025-05-07T19:45:08.9783642Z 2025-05-07T19:45:08.9783776Z  2025-05-07T19:45:08.9783976Z 2025-05-07T19:45:08.9783980Z 2025-05-07T19:45:08.9783983Z 2025-05-07T19:45:08.9783987Z 2025-05-07T19:45:08.9783991Z 2025-05-07T19:45:08.9783994Z 2025-05-07T19:45:08.9783998Z 2025-05-07T19:45:08.9784001Z 2025-05-07T19:45:08.9784005Z 2025-05-07T19:45:08.9784008Z 2025-05-07T19:45:08.9784011Z 2025-05-07T19:45:08.9784152Z  2025-05-07T19:45:08.9784367Z 2025-05-07T19:45:08.9785669Z 2025-05-07T19:45:08.9785676Z 2025-05-07T19:45:08.9785680Z 2025-05-07T19:45:08.9785684Z 2025-05-07T19:45:08.9785687Z 2025-05-07T19:45:08.9785691Z 2025-05-07T19:45:08.9785694Z 2025-05-07T19:45:08.9785697Z 2025-05-07T19:45:08.9785701Z 2025-05-07T19:45:08.9785705Z 2025-05-07T19:45:08.9785708Z 2025-05-07T19:45:08.9785869Z  2025-05-07T19:45:08.9786098Z 2025-05-07T19:45:08.9786101Z 2025-05-07T19:45:08.9786105Z 2025-05-07T19:45:08.9786108Z 2025-05-07T19:45:08.9786112Z 2025-05-07T19:45:08.9786115Z 2025-05-07T19:45:08.9786119Z 2025-05-07T19:45:08.9786123Z 2025-05-07T19:45:08.9786126Z 2025-05-07T19:45:08.9786129Z 2025-05-07T19:45:08.9786133Z 2025-05-07T19:45:08.9786136Z 2025-05-07T19:45:08.9786140Z 2025-05-07T19:45:08.9786285Z  2025-05-07T19:45:08.9786518Z 2025-05-07T19:45:08.9786521Z 2025-05-07T19:45:08.9786525Z 2025-05-07T19:45:08.9786534Z 2025-05-07T19:45:08.9786538Z 2025-05-07T19:45:08.9786541Z 2025-05-07T19:45:08.9786548Z 2025-05-07T19:45:08.9786552Z 2025-05-07T19:45:08.9786555Z 2025-05-07T19:45:08.9786558Z 2025-05-07T19:45:08.9786562Z 2025-05-07T19:45:08.9786565Z 2025-05-07T19:45:08.9786569Z 2025-05-07T19:45:08.9786573Z 2025-05-07T19:45:08.9786752Z  2025-05-07T19:45:08.9786965Z 2025-05-07T19:45:08.9786968Z 2025-05-07T19:45:08.9786972Z 2025-05-07T19:45:08.9786975Z 2025-05-07T19:45:08.9786979Z 2025-05-07T19:45:08.9786982Z 2025-05-07T19:45:08.9786985Z 2025-05-07T19:45:08.9786989Z 2025-05-07T19:45:08.9786992Z 2025-05-07T19:45:08.9786996Z 2025-05-07T19:45:08.9786999Z 2025-05-07T19:45:08.9787003Z 2025-05-07T19:45:08.9787006Z 2025-05-07T19:45:08.9787010Z 2025-05-07T19:45:08.9787013Z 2025-05-07T19:45:08.9787199Z  2025-05-07T19:45:08.9787419Z 2025-05-07T19:45:08.9787423Z 2025-05-07T19:45:08.9787426Z 2025-05-07T19:45:08.9787524Z 2025-05-07T19:45:08.9787527Z 2025-05-07T19:45:08.9787531Z 2025-05-07T19:45:08.9787537Z 2025-05-07T19:45:08.9787541Z 2025-05-07T19:45:08.9787544Z 2025-05-07T19:45:08.9787548Z 2025-05-07T19:45:08.9787577Z 2025-05-07T19:45:08.9787580Z 2025-05-07T19:45:08.9787584Z 2025-05-07T19:45:08.9787587Z 2025-05-07T19:45:08.9787591Z 2025-05-07T19:45:08.9787594Z 2025-05-07T19:45:08.9787881Z  2025-05-07T19:45:08.9788102Z 2025-05-07T19:45:08.9788106Z 2025-05-07T19:45:08.9788109Z 2025-05-07T19:45:08.9788113Z 2025-05-07T19:45:08.9788116Z 2025-05-07T19:45:08.9788146Z 2025-05-07T19:45:08.9788150Z 2025-05-07T19:45:08.9788153Z 2025-05-07T19:45:08.9788157Z 2025-05-07T19:45:08.9788160Z 2025-05-07T19:45:08.9788163Z 2025-05-07T19:45:08.9788166Z 2025-05-07T19:45:08.9788170Z 2025-05-07T19:45:08.9788173Z 2025-05-07T19:45:08.9788176Z 2025-05-07T19:45:08.9788180Z 2025-05-07T19:45:08.9788183Z 2025-05-07T19:45:08.9788349Z  2025-05-07T19:45:08.9788606Z 2025-05-07T19:45:08.9788610Z 2025-05-07T19:45:08.9788616Z 2025-05-07T19:45:08.9788619Z 2025-05-07T19:45:08.9788623Z 2025-05-07T19:45:08.9788626Z 2025-05-07T19:45:08.9788630Z 2025-05-07T19:45:08.9788634Z 2025-05-07T19:45:08.9788638Z 2025-05-07T19:45:08.9788641Z 2025-05-07T19:45:08.9788644Z 2025-05-07T19:45:08.9788648Z 2025-05-07T19:45:08.9788651Z 2025-05-07T19:45:08.9788655Z 2025-05-07T19:45:08.9788658Z 2025-05-07T19:45:08.9788661Z 2025-05-07T19:45:08.9788664Z 2025-05-07T19:45:08.9788668Z 2025-05-07T19:45:08.9788869Z  2025-05-07T19:45:08.9789101Z 2025-05-07T19:45:08.9789106Z 2025-05-07T19:45:08.9789216Z  2025-05-07T19:45:08.9789358Z 2025-05-07T19:45:08.9789362Z 2025-05-07T19:45:08.9789470Z  2025-05-07T19:45:08.9789592Z 2025-05-07T19:45:08.9789596Z 2025-05-07T19:45:08.9789599Z 2025-05-07T19:45:08.9789715Z  2025-05-07T19:45:08.9789863Z 2025-05-07T19:45:08.9789867Z 2025-05-07T19:45:08.9789873Z 2025-05-07T19:45:08.9789876Z 2025-05-07T19:45:08.9789994Z  2025-05-07T19:45:08.9790183Z 2025-05-07T19:45:08.9790212Z 2025-05-07T19:45:08.9790215Z 2025-05-07T19:45:08.9790219Z 2025-05-07T19:45:08.9790223Z 2025-05-07T19:45:08.9790341Z  2025-05-07T19:45:08.9790477Z 2025-05-07T19:45:08.9790481Z 2025-05-07T19:45:08.9790484Z 2025-05-07T19:45:08.9790487Z 2025-05-07T19:45:08.9790491Z 2025-05-07T19:45:08.9790494Z 2025-05-07T19:45:08.9790641Z  done 2025-05-07T19:45:09.2978008Z Preparing transaction: \ | / done 2025-05-07T19:45:12.8850369Z Verifying transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:45:15.6024874Z Executing transaction: / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:45:16.0242624Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:17.8670248Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:17.8670965Z 2025-05-07T19:45:17.8684032Z 2025-05-07T19:45:17.8709419Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:20.2777778Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:20.2779350Z 2025-05-07T19:45:20.2779500Z Collecting build 2025-05-07T19:45:20.2780207Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:20.2781097Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build) (25.0) 2025-05-07T19:45:20.2782530Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:20.2783052Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:20.2783571Z Collecting importlib-metadata>=4.6 (from build) 2025-05-07T19:45:20.2784077Z Downloading importlib_metadata-8.7.0-py3-none-any.whl.metadata (4.8 kB) 2025-05-07T19:45:20.2784980Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build) (2.2.1) 2025-05-07T19:45:20.2785785Z Collecting zipp>=3.20 (from importlib-metadata>=4.6->build) 2025-05-07T19:45:20.2786307Z Downloading zipp-3.21.0-py3-none-any.whl.metadata (3.7 kB) 2025-05-07T19:45:20.2786781Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:20.2787286Z Downloading importlib_metadata-8.7.0-py3-none-any.whl (27 kB) 2025-05-07T19:45:20.2787887Z Downloading zipp-3.21.0-py3-none-any.whl (9.6 kB) 2025-05-07T19:45:20.2788321Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:20.2788917Z Installing collected packages: zipp, pyproject_hooks, importlib-metadata, build 2025-05-07T19:45:20.2789422Z 2025-05-07T19:45:20.2789760Z Successfully installed build-1.2.2.post1 importlib-metadata-8.7.0 pyproject_hooks-1.2.0 zipp-3.21.0 2025-05-07T19:45:20.2790234Z 2025-05-07T19:45:22.1357059Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:22.1357429Z 2025-05-07T19:45:22.2012940Z [CHECK] Binary make found in PATH 2025-05-07T19:45:23.9898824Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:23.9900023Z 2025-05-07T19:45:24.0660725Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:25.8621221Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:25.8621636Z 2025-05-07T19:45:25.9396378Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:27.8443089Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:29.8573014Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:31.7629638Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:33.7397118Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:35.6001245Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:35.6002326Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:35.6083493Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:35.6083994Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:35.6084748Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:35.6085144Z env: 2025-05-07T19:45:35.6085382Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:35.6085819Z BUILD_ENV: build_binary 2025-05-07T19:45:35.6086081Z BUILD_TARGET: default 2025-05-07T19:45:35.6086350Z BUILD_VARIANT: cuda 2025-05-07T19:45:35.6086600Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:35.6086884Z ##[endgroup] 2025-05-07T19:45:36.0301748Z ################################################################################ 2025-05-07T19:45:36.0302178Z # Install CUDA 2025-05-07T19:45:36.0302436Z # 2025-05-07T19:45:36.0318284Z # [2025-05-07T19:45:36.031Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:36.0319126Z ################################################################################ 2025-05-07T19:45:36.0319521Z 2025-05-07T19:45:36.0333390Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:36.1207385Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:36.1211460Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:36.1213712Z + conda clean --packages --tarball -y 2025-05-07T19:45:36.1213979Z 2025-05-07T19:45:36.6497283Z Will remove 134 (483.0 MB) tarball(s). 2025-05-07T19:45:36.6498257Z Will remove 16 (73.2 MB) package(s). 2025-05-07T19:45:36.7061654Z 2025-05-07T19:45:36.7076859Z + conda clean --all -y 2025-05-07T19:45:36.7077285Z 2025-05-07T19:45:37.3243434Z There are no unused tarball(s) to remove. 2025-05-07T19:45:37.3810830Z Will remove 1 index cache(s). 2025-05-07T19:45:37.3811662Z There are no unused package(s) to remove. 2025-05-07T19:45:37.3812018Z There are no tempfile(s) to remove. 2025-05-07T19:45:37.3812368Z There are no logfile(s) to remove. 2025-05-07T19:45:37.3812569Z 2025-05-07T19:45:37.3822167Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:37.3850614Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:38.2361420Z Channels: 2025-05-07T19:45:38.2362139Z - conda-forge 2025-05-07T19:45:38.2366924Z Platform: linux-64 2025-05-07T19:45:48.0234595Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:49.4614566Z Solving environment: \ | / - done 2025-05-07T19:45:49.5880628Z 2025-05-07T19:45:49.5881205Z ## Package Plan ## 2025-05-07T19:45:49.5881706Z 2025-05-07T19:45:49.5882709Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:49.5883745Z 2025-05-07T19:45:49.5884030Z added / updated specs: 2025-05-07T19:45:49.5884921Z - cuda=12.6.3 2025-05-07T19:45:49.5885324Z 2025-05-07T19:45:49.5885338Z 2025-05-07T19:45:49.5885727Z The following packages will be downloaded: 2025-05-07T19:45:49.5886401Z 2025-05-07T19:45:49.5886774Z package | build 2025-05-07T19:45:49.5887777Z ---------------------------|----------------- 2025-05-07T19:45:49.5888851Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:45:49.5890140Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:45:49.5891489Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:45:49.5892165Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:45:49.5892662Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:45:49.5893216Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:45:49.5893812Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:45:49.5894367Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:45:49.5895375Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:45:49.5895912Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.5896414Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.5896985Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:45:49.5897543Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.5898242Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:45:49.5898809Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:45:49.5899313Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:45:49.5899927Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:45:49.5900615Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:45:49.5901156Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:45:49.5901678Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.5902261Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:45:49.5902814Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:45:49.5903304Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:49.5903864Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:49.5904385Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:45:49.5905052Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:45:49.5905591Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:45:49.5906124Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:45:49.5906764Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:45:49.5907245Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:45:49.5907758Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:45:49.5908227Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:45:49.5908716Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:45:49.5909203Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:45:49.5909667Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:45:49.5910152Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:45:49.5910615Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:45:49.5911121Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:45:49.5911647Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:45:49.5912128Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:45:49.5912591Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:45:49.5913091Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:45:49.5913565Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:45:49.5914091Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:45:49.5914614Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:49.5915102Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:45:49.5915731Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:49.5916186Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:49.5916661Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:45:49.5917133Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:49.5917636Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:45:49.5918091Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:45:49.5918488Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:45:49.5918911Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:49.5919325Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:45:49.5919759Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:45:49.5920172Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:49.5920572Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:45:49.5921019Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:45:49.5921474Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:45:49.5921953Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:45:49.5922401Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:45:49.5922877Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:45:49.5923425Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:45:49.5923885Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:45:49.5924374Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:45:49.5924842Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:45:49.5925343Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:45:49.5925841Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:45:49.5926314Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:45:49.5926818Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:45:49.5927282Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:45:49.5927741Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:45:49.5928167Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:45:49.5928640Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:45:49.5929120Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:45:49.5929562Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:45:49.5930059Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:45:49.5930535Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:45:49.5931038Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:45:49.5931514Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:45:49.5932003Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:45:49.5932496Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:45:49.5932945Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:45:49.5933494Z libxkbcommon-1.9.2 | h65c71a3_0 660 KB conda-forge 2025-05-07T19:45:49.5933951Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:45:49.5934413Z libxml2-2.13.8 | h4bc477f_0 675 KB conda-forge 2025-05-07T19:45:49.5934851Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:45:49.5935306Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:45:49.5935782Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:45:49.5936174Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:45:49.5936602Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:45:49.5937064Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:45:49.5937554Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:45:49.5938006Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:45:49.5938428Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:45:49.5938896Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:45:49.5939364Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:45:49.5939964Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:45:49.5940675Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:45:49.5941215Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:45:49.5941834Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:45:49.5942368Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:45:49.5942933Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:45:49.5943396Z ------------------------------------------------------------ 2025-05-07T19:45:49.5943800Z Total: 1.59 GB 2025-05-07T19:45:49.5944034Z 2025-05-07T19:45:49.5944206Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:49.5944453Z 2025-05-07T19:45:49.5944656Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:45:49.5945145Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:45:49.5945651Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:45:49.5946157Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:45:49.5946794Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:45:49.5947404Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:45:49.5948028Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:45:49.5948590Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:49.5949191Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:45:49.5949747Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:45:49.5950282Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.5950903Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.5951527Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:45:49.5952192Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.5952859Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.5953441Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.5954092Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:45:49.5954625Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:45:49.5955199Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.5955783Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.5956380Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.5956961Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:45:49.5957472Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:45:49.5958084Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:45:49.5958674Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:45:49.5959176Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:45:49.5959738Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:45:49.5960324Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:45:49.5960913Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:45:49.5961522Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:45:49.5962088Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.5962658Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.5963177Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:45:49.5965050Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.5965606Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:45:49.5966147Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:45:49.5966699Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:45:49.5967240Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:49.5967842Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:45:49.5968434Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:45:49.5968957Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:45:49.5969485Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:45:49.5970026Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.5970644Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:45:49.5971235Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:45:49.5971798Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.5972391Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:45:49.5972881Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:45:49.5973396Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:45:49.5973962Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:45:49.5974718Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:45:49.5975229Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:45:49.5975671Z expat conda-forge/linux-64::expat-2.7.0-h5888daf_0 2025-05-07T19:45:49.5976116Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:45:49.5976867Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:45:49.5977340Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:45:49.5977789Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:45:49.5978227Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:45:49.5978743Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:45:49.5979298Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:45:49.5980026Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:45:49.5980590Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:45:49.5981200Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:45:49.5981781Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:45:49.5982542Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:45:49.5983141Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:45:49.5983749Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:45:49.5984338Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:45:49.5984965Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:45:49.5985559Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:45:49.5986196Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:45:49.5986786Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:45:49.5987450Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:45:49.5987985Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:45:49.5988540Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:45:49.5989068Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:45:49.5989626Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:45:49.5990222Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.5990853Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:45:49.5991492Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:45:49.5992081Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:45:49.5992675Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:45:49.5993254Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:45:49.5993794Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:45:49.5994348Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.9.2-h65c71a3_0 2025-05-07T19:45:49.5994882Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:45:49.5995405Z libxml2 conda-forge/linux-64::libxml2-2.13.8-h4bc477f_0 2025-05-07T19:45:49.5995869Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:45:49.5996428Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:45:49.5996989Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:45:49.5997406Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:45:49.5997871Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:45:49.5998426Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:45:49.5999010Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:45:49.5999630Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:45:49.6000107Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:45:49.6000674Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:45:49.6001265Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:45:49.6001885Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:45:49.6002549Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:45:49.6003138Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:45:49.6003729Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:45:49.6004371Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:45:49.6005032Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:45:49.6005391Z 2025-05-07T19:45:49.6005399Z 2025-05-07T19:45:49.6005404Z 2025-05-07T19:45:49.6005588Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:49.6006009Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:45:49.6006300Z 2025-05-07T19:45:49.6006643Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:45:49.6006912Z 2025-05-07T19:45:49.6006916Z 2025-05-07T19:45:49.6007157Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:45:49.6007461Z 2025-05-07T19:45:49.6007464Z 2025-05-07T19:45:49.6007468Z 2025-05-07T19:45:49.6021971Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:45:49.6022289Z 2025-05-07T19:45:49.6022293Z 2025-05-07T19:45:49.6022419Z 2025-05-07T19:45:49.6022423Z 2025-05-07T19:45:49.6031024Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:45:49.6031346Z 2025-05-07T19:45:49.6031349Z 2025-05-07T19:45:49.6031353Z 2025-05-07T19:45:49.6031356Z 2025-05-07T19:45:49.6031384Z 2025-05-07T19:45:49.6032145Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:49.6032450Z 2025-05-07T19:45:49.6032454Z 2025-05-07T19:45:49.6032457Z 2025-05-07T19:45:49.6032461Z 2025-05-07T19:45:49.6032464Z 2025-05-07T19:45:49.6032467Z 2025-05-07T19:45:49.6033331Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:49.6033648Z 2025-05-07T19:45:49.6033652Z 2025-05-07T19:45:49.6033656Z 2025-05-07T19:45:49.6033659Z 2025-05-07T19:45:49.6033663Z 2025-05-07T19:45:49.6033667Z 2025-05-07T19:45:49.6033670Z 2025-05-07T19:45:49.6034399Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:49.6034693Z 2025-05-07T19:45:49.6034710Z 2025-05-07T19:45:49.6034714Z 2025-05-07T19:45:49.6034718Z 2025-05-07T19:45:49.6034721Z 2025-05-07T19:45:49.6034725Z 2025-05-07T19:45:49.6034729Z 2025-05-07T19:45:49.6034733Z 2025-05-07T19:45:49.6040677Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:49.6041008Z 2025-05-07T19:45:49.6041012Z 2025-05-07T19:45:49.6041016Z 2025-05-07T19:45:49.6041029Z 2025-05-07T19:45:49.6041033Z 2025-05-07T19:45:49.6041036Z 2025-05-07T19:45:49.6041039Z 2025-05-07T19:45:49.6041042Z 2025-05-07T19:45:49.6041046Z 2025-05-07T19:45:49.6045892Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:49.6046798Z 2025-05-07T19:45:49.6046809Z 2025-05-07T19:45:49.6046818Z 2025-05-07T19:45:49.6046828Z 2025-05-07T19:45:49.6046838Z 2025-05-07T19:45:49.6046849Z 2025-05-07T19:45:49.6046859Z 2025-05-07T19:45:49.6046869Z 2025-05-07T19:45:49.6046910Z 2025-05-07T19:45:49.6046922Z 2025-05-07T19:45:49.6047687Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:49.6048578Z 2025-05-07T19:45:49.6048588Z 2025-05-07T19:45:49.6048599Z 2025-05-07T19:45:49.6048608Z 2025-05-07T19:45:49.6048619Z 2025-05-07T19:45:49.6048629Z 2025-05-07T19:45:49.6048640Z 2025-05-07T19:45:49.6048841Z 2025-05-07T19:45:49.6048884Z 2025-05-07T19:45:49.6048895Z 2025-05-07T19:45:49.6048905Z 2025-05-07T19:45:49.6049631Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:49.6049965Z 2025-05-07T19:45:49.6049969Z 2025-05-07T19:45:49.6049972Z 2025-05-07T19:45:49.6049976Z 2025-05-07T19:45:49.6049981Z 2025-05-07T19:45:49.6050016Z 2025-05-07T19:45:49.6050019Z 2025-05-07T19:45:49.6050022Z 2025-05-07T19:45:49.6050026Z 2025-05-07T19:45:49.6050029Z 2025-05-07T19:45:49.6050032Z 2025-05-07T19:45:49.6050036Z 2025-05-07T19:45:49.6050325Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:49.6050650Z 2025-05-07T19:45:49.6050654Z 2025-05-07T19:45:49.6050657Z 2025-05-07T19:45:49.6050683Z 2025-05-07T19:45:49.6050686Z 2025-05-07T19:45:49.6050690Z 2025-05-07T19:45:49.6050693Z 2025-05-07T19:45:49.6050696Z 2025-05-07T19:45:49.6050700Z 2025-05-07T19:45:49.6050703Z 2025-05-07T19:45:49.6050707Z 2025-05-07T19:45:49.6050713Z 2025-05-07T19:45:49.6050717Z 2025-05-07T19:45:49.6051020Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:49.6051358Z 2025-05-07T19:45:49.6051362Z 2025-05-07T19:45:49.6051365Z 2025-05-07T19:45:49.6051368Z 2025-05-07T19:45:49.6051372Z 2025-05-07T19:45:49.6051375Z 2025-05-07T19:45:49.6051379Z 2025-05-07T19:45:49.6051382Z 2025-05-07T19:45:49.6051385Z 2025-05-07T19:45:49.6051389Z 2025-05-07T19:45:49.6051392Z 2025-05-07T19:45:49.6051396Z 2025-05-07T19:45:49.6051399Z 2025-05-07T19:45:49.6051403Z 2025-05-07T19:45:49.6053857Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:49.6054285Z 2025-05-07T19:45:49.6054301Z 2025-05-07T19:45:49.6054305Z 2025-05-07T19:45:49.6054308Z 2025-05-07T19:45:49.6054312Z 2025-05-07T19:45:49.6054315Z 2025-05-07T19:45:49.6054319Z 2025-05-07T19:45:49.6054323Z 2025-05-07T19:45:49.6054326Z 2025-05-07T19:45:49.6054333Z 2025-05-07T19:45:49.6054337Z 2025-05-07T19:45:49.6054340Z 2025-05-07T19:45:49.6054343Z 2025-05-07T19:45:49.6054347Z 2025-05-07T19:45:49.6054350Z 2025-05-07T19:45:49.6055396Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:49.6055774Z 2025-05-07T19:45:49.6055779Z 2025-05-07T19:45:49.6055784Z 2025-05-07T19:45:49.6055814Z 2025-05-07T19:45:49.6055819Z 2025-05-07T19:45:49.6055822Z 2025-05-07T19:45:49.6055843Z 2025-05-07T19:45:49.6055847Z 2025-05-07T19:45:49.6055852Z 2025-05-07T19:45:49.6055856Z 2025-05-07T19:45:49.6055860Z 2025-05-07T19:45:49.6055865Z 2025-05-07T19:45:49.6055869Z 2025-05-07T19:45:49.6055872Z 2025-05-07T19:45:49.6055905Z 2025-05-07T19:45:49.6055909Z 2025-05-07T19:45:49.6056244Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:49.6056613Z 2025-05-07T19:45:49.6056618Z 2025-05-07T19:45:49.6056621Z 2025-05-07T19:45:49.6056625Z 2025-05-07T19:45:49.6056642Z 2025-05-07T19:45:49.6056646Z 2025-05-07T19:45:49.6056650Z 2025-05-07T19:45:49.6056653Z 2025-05-07T19:45:49.6056656Z 2025-05-07T19:45:49.6056660Z 2025-05-07T19:45:49.6056663Z 2025-05-07T19:45:49.6056666Z 2025-05-07T19:45:49.6056670Z 2025-05-07T19:45:49.6056674Z 2025-05-07T19:45:49.6056678Z 2025-05-07T19:45:49.6056681Z 2025-05-07T19:45:49.6056699Z 2025-05-07T19:45:49.6057039Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:49.6057377Z 2025-05-07T19:45:49.6057381Z 2025-05-07T19:45:49.6057384Z 2025-05-07T19:45:49.6057388Z 2025-05-07T19:45:49.6057391Z 2025-05-07T19:45:49.6057394Z 2025-05-07T19:45:49.6057403Z 2025-05-07T19:45:49.6057406Z 2025-05-07T19:45:49.6057410Z 2025-05-07T19:45:49.6057425Z 2025-05-07T19:45:49.6057428Z 2025-05-07T19:45:49.6057432Z 2025-05-07T19:45:49.6057435Z 2025-05-07T19:45:49.6057456Z 2025-05-07T19:45:49.6057459Z 2025-05-07T19:45:49.6057463Z 2025-05-07T19:45:49.6057466Z 2025-05-07T19:45:49.6057666Z 2025-05-07T19:45:49.6058065Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:49.6058425Z 2025-05-07T19:45:49.6058449Z 2025-05-07T19:45:49.6058452Z 2025-05-07T19:45:49.6058456Z 2025-05-07T19:45:49.6058459Z 2025-05-07T19:45:49.6058462Z 2025-05-07T19:45:49.6058466Z 2025-05-07T19:45:49.6058469Z 2025-05-07T19:45:49.6058473Z 2025-05-07T19:45:49.6058476Z 2025-05-07T19:45:49.6058479Z 2025-05-07T19:45:49.6058483Z 2025-05-07T19:45:49.6058486Z 2025-05-07T19:45:49.6058489Z 2025-05-07T19:45:49.6058493Z 2025-05-07T19:45:49.6058496Z 2025-05-07T19:45:49.6058500Z 2025-05-07T19:45:49.6058503Z 2025-05-07T19:45:49.6058515Z 2025-05-07T19:45:49.6980042Z ... (more hidden) ... 2025-05-07T19:45:49.6983430Z nsight-compute-2024. | 443.1 MB | | 1% 2025-05-07T19:45:49.6985182Z 2025-05-07T19:45:49.6990812Z libcublas-12.6.4.1 | 256.2 MB | 1 | 1%  2025-05-07T19:45:49.6991113Z 2025-05-07T19:45:49.6991313Z 2025-05-07T19:45:49.7010689Z libcufft-11.3.0.4 | 156.2 MB | 1 | 1%  2025-05-07T19:45:49.7010997Z 2025-05-07T19:45:49.7011003Z 2025-05-07T19:45:49.7011091Z 2025-05-07T19:45:49.7024761Z libcusparse-12.5.4.2 | 118.6 MB | | 1%  2025-05-07T19:45:49.7025095Z 2025-05-07T19:45:49.7025100Z 2025-05-07T19:45:49.7025104Z 2025-05-07T19:45:49.7025495Z 2025-05-07T19:45:49.7981840Z cuda-nsight-12.6.77 | 113.2 MB | | 1%  2025-05-07T19:45:49.7983441Z nsight-compute-2024. | 443.1 MB | 1 | 2% 2025-05-07T19:45:49.7984373Z 2025-05-07T19:45:49.7991153Z libcublas-12.6.4.1 | 256.2 MB | 2 | 3%  2025-05-07T19:45:49.7991679Z 2025-05-07T19:45:49.7991694Z 2025-05-07T19:45:49.8017835Z libcufft-11.3.0.4 | 156.2 MB | 4 | 5%  2025-05-07T19:45:49.8018149Z 2025-05-07T19:45:49.8018155Z 2025-05-07T19:45:49.8018184Z 2025-05-07T19:45:49.8028256Z libcusparse-12.5.4.2 | 118.6 MB | 4 | 5%  2025-05-07T19:45:49.8028606Z 2025-05-07T19:45:49.8028613Z 2025-05-07T19:45:49.8028633Z 2025-05-07T19:45:49.8028638Z 2025-05-07T19:45:49.8987145Z cuda-nsight-12.6.77 | 113.2 MB | 4 | 4%  2025-05-07T19:45:49.8989091Z nsight-compute-2024. | 443.1 MB | 3 | 3% 2025-05-07T19:45:49.8990792Z 2025-05-07T19:45:49.8994365Z libcublas-12.6.4.1 | 256.2 MB | 4 | 5%  2025-05-07T19:45:49.8994645Z 2025-05-07T19:45:49.8995816Z 2025-05-07T19:45:49.9018238Z libcufft-11.3.0.4 | 156.2 MB | 8 | 8%  2025-05-07T19:45:49.9018544Z 2025-05-07T19:45:49.9018550Z 2025-05-07T19:45:49.9018574Z 2025-05-07T19:45:49.9033459Z libcusparse-12.5.4.2 | 118.6 MB | 8 | 9%  2025-05-07T19:45:49.9033862Z 2025-05-07T19:45:49.9033986Z 2025-05-07T19:45:49.9033991Z 2025-05-07T19:45:49.9034073Z 2025-05-07T19:45:49.9990223Z cuda-nsight-12.6.77 | 113.2 MB | 8 | 8%  2025-05-07T19:45:49.9990548Z 2025-05-07T19:45:50.0000973Z libcublas-12.6.4.1 | 256.2 MB | 6 | 7%  2025-05-07T19:45:50.0001314Z 2025-05-07T19:45:50.0001319Z 2025-05-07T19:45:50.0021231Z libcufft-11.3.0.4 | 156.2 MB | #1 | 12%  2025-05-07T19:45:50.0021579Z 2025-05-07T19:45:50.0021586Z 2025-05-07T19:45:50.0021593Z 2025-05-07T19:45:50.0036583Z libcusparse-12.5.4.2 | 118.6 MB | #3 | 13%  2025-05-07T19:45:50.0036918Z 2025-05-07T19:45:50.0036925Z 2025-05-07T19:45:50.0036930Z 2025-05-07T19:45:50.0036933Z 2025-05-07T19:45:50.0291916Z cuda-nsight-12.6.77 | 113.2 MB | #2 | 13%  2025-05-07T19:45:50.0993824Z nsight-compute-2024. | 443.1 MB | 4 | 4% 2025-05-07T19:45:50.0994442Z 2025-05-07T19:45:50.0999060Z libcublas-12.6.4.1 | 256.2 MB | 8 | 9%  2025-05-07T19:45:50.0999371Z 2025-05-07T19:45:50.0999379Z 2025-05-07T19:45:50.1021274Z libcufft-11.3.0.4 | 156.2 MB | #5 | 15%  2025-05-07T19:45:50.1021838Z 2025-05-07T19:45:50.1021846Z 2025-05-07T19:45:50.1021851Z 2025-05-07T19:45:50.1039148Z libcusparse-12.5.4.2 | 118.6 MB | #7 | 18%  2025-05-07T19:45:50.1039477Z 2025-05-07T19:45:50.1039497Z 2025-05-07T19:45:50.1039502Z 2025-05-07T19:45:50.1039507Z 2025-05-07T19:45:50.1291554Z cuda-nsight-12.6.77 | 113.2 MB | #6 | 17%  2025-05-07T19:45:50.2001325Z nsight-compute-2024. | 443.1 MB | 5 | 5% 2025-05-07T19:45:50.2001906Z 2025-05-07T19:45:50.2001918Z 2025-05-07T19:45:50.2022965Z libcufft-11.3.0.4 | 156.2 MB | #8 | 19%  2025-05-07T19:45:50.2023274Z 2025-05-07T19:45:50.2023279Z 2025-05-07T19:45:50.2023282Z 2025-05-07T19:45:50.2041907Z libcusparse-12.5.4.2 | 118.6 MB | ##2 | 23%  2025-05-07T19:45:50.2042241Z 2025-05-07T19:45:50.2042247Z 2025-05-07T19:45:50.2042252Z 2025-05-07T19:45:50.2042256Z 2025-05-07T19:45:50.2072108Z cuda-nsight-12.6.77 | 113.2 MB | ## | 21%  2025-05-07T19:45:50.2075732Z 2025-05-07T19:45:50.2292597Z libcublas-12.6.4.1 | 256.2 MB | # | 11%  2025-05-07T19:45:50.3044718Z nsight-compute-2024. | 443.1 MB | 6 | 6% 2025-05-07T19:45:50.3045050Z 2025-05-07T19:45:50.3045057Z 2025-05-07T19:45:50.3045093Z 2025-05-07T19:45:50.3045098Z 2025-05-07T19:45:50.3045457Z cuda-nsight-12.6.77 | 113.2 MB | ##5 | 25%  2025-05-07T19:45:50.3045837Z 2025-05-07T19:45:50.3046449Z 2025-05-07T19:45:50.3075900Z libcufft-11.3.0.4 | 156.2 MB | ##2 | 22%  2025-05-07T19:45:50.3076427Z 2025-05-07T19:45:50.3186305Z libcublas-12.6.4.1 | 256.2 MB | #2 | 12%  2025-05-07T19:45:50.3186608Z 2025-05-07T19:45:50.3186886Z 2025-05-07T19:45:50.3186897Z 2025-05-07T19:45:50.3291748Z libcusparse-12.5.4.2 | 118.6 MB | ##6 | 27%  2025-05-07T19:45:50.4045776Z nsight-compute-2024. | 443.1 MB | 7 | 8% 2025-05-07T19:45:50.4046529Z 2025-05-07T19:45:50.4046535Z 2025-05-07T19:45:50.4046572Z 2025-05-07T19:45:50.4046576Z 2025-05-07T19:45:50.4080701Z cuda-nsight-12.6.77 | 113.2 MB | ##9 | 30%  2025-05-07T19:45:50.4081606Z 2025-05-07T19:45:50.4188993Z libcublas-12.6.4.1 | 256.2 MB | #4 | 14%  2025-05-07T19:45:50.4189293Z 2025-05-07T19:45:50.4189306Z 2025-05-07T19:45:50.4189451Z 2025-05-07T19:45:50.4289476Z libcusparse-12.5.4.2 | 118.6 MB | ###1 | 32%  2025-05-07T19:45:50.4289830Z 2025-05-07T19:45:50.4289974Z 2025-05-07T19:45:50.4548820Z libcufft-11.3.0.4 | 156.2 MB | ##5 | 25%  2025-05-07T19:45:50.5050170Z nsight-compute-2024. | 443.1 MB | 8 | 9% 2025-05-07T19:45:50.5050690Z 2025-05-07T19:45:50.5050731Z 2025-05-07T19:45:50.5050737Z 2025-05-07T19:45:50.5050743Z 2025-05-07T19:45:50.5081280Z cuda-nsight-12.6.77 | 113.2 MB | ###4 | 35%  2025-05-07T19:45:50.5082608Z 2025-05-07T19:45:50.5200938Z libcublas-12.6.4.1 | 256.2 MB | #6 | 17%  2025-05-07T19:45:50.5201517Z 2025-05-07T19:45:50.5201555Z 2025-05-07T19:45:50.5201560Z 2025-05-07T19:45:50.5322767Z libcusparse-12.5.4.2 | 118.6 MB | ###5 | 36%  2025-05-07T19:45:50.5323106Z 2025-05-07T19:45:50.5323226Z 2025-05-07T19:45:50.5551156Z libcufft-11.3.0.4 | 156.2 MB | ##8 | 28%  2025-05-07T19:45:50.6156893Z nsight-compute-2024. | 443.1 MB | 9 | 10% 2025-05-07T19:45:50.6157494Z 2025-05-07T19:45:50.6157507Z 2025-05-07T19:45:50.6157516Z 2025-05-07T19:45:50.6157534Z 2025-05-07T19:45:50.6203497Z cuda-nsight-12.6.77 | 113.2 MB | ###8 | 39%  2025-05-07T19:45:50.6203831Z 2025-05-07T19:45:50.6203957Z 2025-05-07T19:45:50.6203978Z 2025-05-07T19:45:50.6228227Z libcusparse-12.5.4.2 | 118.6 MB | #### | 40%  2025-05-07T19:45:50.6228567Z 2025-05-07T19:45:50.6342261Z libcublas-12.6.4.1 | 256.2 MB | #8 | 19%  2025-05-07T19:45:50.6342561Z 2025-05-07T19:45:50.6342685Z 2025-05-07T19:45:50.6550092Z libcufft-11.3.0.4 | 156.2 MB | ###1 | 32%  2025-05-07T19:45:50.7204298Z nsight-compute-2024. | 443.1 MB | #1 | 11% 2025-05-07T19:45:50.7204788Z 2025-05-07T19:45:50.7204797Z 2025-05-07T19:45:50.7204808Z 2025-05-07T19:45:50.7207692Z libcusparse-12.5.4.2 | 118.6 MB | ####5 | 45%  2025-05-07T19:45:50.7208008Z 2025-05-07T19:45:50.7208016Z 2025-05-07T19:45:50.7208019Z 2025-05-07T19:45:50.7208023Z 2025-05-07T19:45:50.7228707Z cuda-nsight-12.6.77 | 113.2 MB | ####3 | 43%  2025-05-07T19:45:50.7229018Z 2025-05-07T19:45:50.7342633Z libcublas-12.6.4.1 | 256.2 MB | ## | 21%  2025-05-07T19:45:50.7342939Z 2025-05-07T19:45:50.7342958Z 2025-05-07T19:45:50.8212989Z libcufft-11.3.0.4 | 156.2 MB | ###5 | 36%  2025-05-07T19:45:50.8213345Z 2025-05-07T19:45:50.8213350Z 2025-05-07T19:45:50.8213354Z 2025-05-07T19:45:50.8213357Z 2025-05-07T19:45:50.8221940Z cuda-nsight-12.6.77 | 113.2 MB | ####8 | 49%  2025-05-07T19:45:50.8222250Z 2025-05-07T19:45:50.8222277Z 2025-05-07T19:45:50.8222290Z 2025-05-07T19:45:50.8229182Z libcusparse-12.5.4.2 | 118.6 MB | ##### | 50%  2025-05-07T19:45:50.8229514Z 2025-05-07T19:45:50.8466914Z libcublas-12.6.4.1 | 256.2 MB | ##3 | 24%  2025-05-07T19:45:50.8467213Z 2025-05-07T19:45:50.8467219Z 2025-05-07T19:45:50.8807926Z libcufft-11.3.0.4 | 156.2 MB | ###9 | 39%  2025-05-07T19:45:50.9303510Z nsight-compute-2024. | 443.1 MB | #2 | 12% 2025-05-07T19:45:50.9303824Z 2025-05-07T19:45:50.9303829Z 2025-05-07T19:45:50.9303833Z 2025-05-07T19:45:50.9303837Z 2025-05-07T19:45:50.9417833Z cuda-nsight-12.6.77 | 113.2 MB | #####3 | 53%  2025-05-07T19:45:50.9418411Z 2025-05-07T19:45:50.9418416Z 2025-05-07T19:45:50.9418420Z 2025-05-07T19:45:50.9471450Z libcusparse-12.5.4.2 | 118.6 MB | #####4 | 55%  2025-05-07T19:45:50.9472342Z 2025-05-07T19:45:50.9629282Z 2025-05-07T19:45:50.9629702Z libcufft-11.3.0.4 | 156.2 MB | ####2 | 42%  2025-05-07T19:45:50.9630014Z 2025-05-07T19:45:50.9811201Z libcublas-12.6.4.1 | 256.2 MB | ##6 | 26%  2025-05-07T19:45:51.0350976Z nsight-compute-2024. | 443.1 MB | #3 | 13% 2025-05-07T19:45:51.0351267Z 2025-05-07T19:45:51.0351272Z 2025-05-07T19:45:51.0351277Z 2025-05-07T19:45:51.0351280Z 2025-05-07T19:45:51.0517780Z cuda-nsight-12.6.77 | 113.2 MB | #####7 | 58%  2025-05-07T19:45:51.0518115Z 2025-05-07T19:45:51.0518122Z 2025-05-07T19:45:51.0518126Z 2025-05-07T19:45:51.0532607Z libcusparse-12.5.4.2 | 118.6 MB | #####8 | 59%  2025-05-07T19:45:51.0532933Z 2025-05-07T19:45:51.0532937Z 2025-05-07T19:45:51.0865270Z libcufft-11.3.0.4 | 156.2 MB | ####5 | 46%  2025-05-07T19:45:51.0899159Z nsight-compute-2024. | 443.1 MB | #4 | 14% 2025-05-07T19:45:51.0899703Z 2025-05-07T19:45:51.1355201Z libcublas-12.6.4.1 | 256.2 MB | ##8 | 28%  2025-05-07T19:45:51.1356035Z 2025-05-07T19:45:51.1356050Z 2025-05-07T19:45:51.1356110Z 2025-05-07T19:45:51.1356121Z 2025-05-07T19:45:51.1519728Z cuda-nsight-12.6.77 | 113.2 MB | ######2 | 63%  2025-05-07T19:45:51.1520457Z 2025-05-07T19:45:51.1520479Z 2025-05-07T19:45:51.1520495Z 2025-05-07T19:45:51.1867230Z libcusparse-12.5.4.2 | 118.6 MB | ######3 | 63%  2025-05-07T19:45:51.1900182Z nsight-compute-2024. | 443.1 MB | #5 | 16% 2025-05-07T19:45:51.1900843Z 2025-05-07T19:45:51.2123879Z libcublas-12.6.4.1 | 256.2 MB | ### | 30%  2025-05-07T19:45:51.2124178Z 2025-05-07T19:45:51.2124350Z 2025-05-07T19:45:51.2410459Z libcufft-11.3.0.4 | 156.2 MB | ####8 | 49%  2025-05-07T19:45:51.2410763Z 2025-05-07T19:45:51.2411003Z 2025-05-07T19:45:51.2411008Z 2025-05-07T19:45:51.2411014Z 2025-05-07T19:45:51.2697766Z cuda-nsight-12.6.77 | 113.2 MB | ######7 | 68%  2025-05-07T19:45:51.2698717Z 2025-05-07T19:45:51.2698731Z 2025-05-07T19:45:51.2698743Z 2025-05-07T19:45:51.2867668Z libcusparse-12.5.4.2 | 118.6 MB | ######7 | 67%  2025-05-07T19:45:51.3107268Z nsight-compute-2024. | 443.1 MB | #6 | 17% 2025-05-07T19:45:51.3107570Z 2025-05-07T19:45:51.3130578Z libcublas-12.6.4.1 | 256.2 MB | ###2 | 32%  2025-05-07T19:45:51.3130881Z 2025-05-07T19:45:51.3130910Z 2025-05-07T19:45:51.3487007Z libcufft-11.3.0.4 | 156.2 MB | #####1 | 52%  2025-05-07T19:45:51.3487788Z 2025-05-07T19:45:51.3487794Z 2025-05-07T19:45:51.3487799Z 2025-05-07T19:45:51.3487803Z 2025-05-07T19:45:51.3798572Z cuda-nsight-12.6.77 | 113.2 MB | #######2 | 72%  2025-05-07T19:45:51.3799211Z 2025-05-07T19:45:51.3799274Z 2025-05-07T19:45:51.3799281Z 2025-05-07T19:45:51.3867399Z libcusparse-12.5.4.2 | 118.6 MB | #######1 | 72%  2025-05-07T19:45:51.4132206Z nsight-compute-2024. | 443.1 MB | #7 | 18% 2025-05-07T19:45:51.4133026Z 2025-05-07T19:45:51.4133041Z 2025-05-07T19:45:51.4231539Z libcufft-11.3.0.4 | 156.2 MB | #####5 | 55%  2025-05-07T19:45:51.4232855Z 2025-05-07T19:45:51.4799176Z libcublas-12.6.4.1 | 256.2 MB | ###4 | 34%  2025-05-07T19:45:51.4799515Z 2025-05-07T19:45:51.4799521Z 2025-05-07T19:45:51.4799611Z 2025-05-07T19:45:51.4868074Z libcusparse-12.5.4.2 | 118.6 MB | #######6 | 77%  2025-05-07T19:45:51.5139786Z nsight-compute-2024. | 443.1 MB | #9 | 19% 2025-05-07T19:45:51.5140325Z 2025-05-07T19:45:51.5140385Z 2025-05-07T19:45:51.5140960Z libcufft-11.3.0.4 | 156.2 MB | #####8 | 58%  2025-05-07T19:45:51.5141302Z 2025-05-07T19:45:51.5141315Z 2025-05-07T19:45:51.5141323Z 2025-05-07T19:45:51.5141330Z 2025-05-07T19:45:51.5236137Z cuda-nsight-12.6.77 | 113.2 MB | #######6 | 77%  2025-05-07T19:45:51.5236739Z 2025-05-07T19:45:51.5837510Z libcublas-12.6.4.1 | 256.2 MB | ###6 | 36%  2025-05-07T19:45:51.5837814Z 2025-05-07T19:45:51.5837891Z 2025-05-07T19:45:51.5837896Z 2025-05-07T19:45:51.6019962Z libcusparse-12.5.4.2 | 118.6 MB | ######## | 81%  2025-05-07T19:45:51.6140920Z nsight-compute-2024. | 443.1 MB | ## | 20% 2025-05-07T19:45:51.6141247Z 2025-05-07T19:45:51.6141416Z 2025-05-07T19:45:51.6141737Z libcufft-11.3.0.4 | 156.2 MB | ######1 | 61%  2025-05-07T19:45:51.6142026Z 2025-05-07T19:45:51.6142032Z 2025-05-07T19:45:51.6142035Z 2025-05-07T19:45:51.6142039Z 2025-05-07T19:45:51.6429346Z cuda-nsight-12.6.77 | 113.2 MB | ########1 | 81%  2025-05-07T19:45:51.6429669Z 2025-05-07T19:45:51.7068653Z libcublas-12.6.4.1 | 256.2 MB | ###8 | 38%  2025-05-07T19:45:51.7084388Z nsight-compute-2024. | 443.1 MB | ##1 | 21% 2025-05-07T19:45:51.7084681Z 2025-05-07T19:45:51.7084721Z 2025-05-07T19:45:51.7085064Z 2025-05-07T19:45:51.7144027Z libcusparse-12.5.4.2 | 118.6 MB | ########5 | 85%  2025-05-07T19:45:51.7144929Z 2025-05-07T19:45:51.7144944Z 2025-05-07T19:45:51.7149266Z libcufft-11.3.0.4 | 156.2 MB | ######4 | 65%  2025-05-07T19:45:51.7149536Z 2025-05-07T19:45:51.7149574Z 2025-05-07T19:45:51.7149578Z 2025-05-07T19:45:51.7150871Z 2025-05-07T19:45:51.7570853Z cuda-nsight-12.6.77 | 113.2 MB | ########5 | 85%  2025-05-07T19:45:51.7571175Z 2025-05-07T19:45:51.8118166Z libcublas-12.6.4.1 | 256.2 MB | #### | 40%  2025-05-07T19:45:51.8142984Z nsight-compute-2024. | 443.1 MB | ##2 | 23% 2025-05-07T19:45:51.8143797Z 2025-05-07T19:45:51.8143953Z 2025-05-07T19:45:51.8151126Z libcufft-11.3.0.4 | 156.2 MB | ######7 | 68%  2025-05-07T19:45:51.8151429Z 2025-05-07T19:45:51.8151441Z 2025-05-07T19:45:51.8151446Z 2025-05-07T19:45:51.8153414Z 2025-05-07T19:45:51.8203445Z cuda-nsight-12.6.77 | 113.2 MB | ########9 | 90%  2025-05-07T19:45:51.8203812Z 2025-05-07T19:45:51.8203818Z 2025-05-07T19:45:51.8203824Z 2025-05-07T19:45:51.8662447Z libcusparse-12.5.4.2 | 118.6 MB | ########8 | 89%  2025-05-07T19:45:51.8663378Z 2025-05-07T19:45:51.9121911Z libcublas-12.6.4.1 | 256.2 MB | ####2 | 42%  2025-05-07T19:45:51.9146176Z nsight-compute-2024. | 443.1 MB | ##3 | 24% 2025-05-07T19:45:51.9146461Z 2025-05-07T19:45:51.9147304Z 2025-05-07T19:45:51.9152194Z libcufft-11.3.0.4 | 156.2 MB | #######1 | 71%  2025-05-07T19:45:51.9152495Z 2025-05-07T19:45:51.9152522Z 2025-05-07T19:45:51.9152529Z 2025-05-07T19:45:51.9152535Z 2025-05-07T19:45:51.9341293Z cuda-nsight-12.6.77 | 113.2 MB | #########4 | 94%  2025-05-07T19:45:51.9342207Z 2025-05-07T19:45:51.9342221Z 2025-05-07T19:45:51.9342232Z 2025-05-07T19:45:51.9816821Z libcusparse-12.5.4.2 | 118.6 MB | #########2 | 93%  2025-05-07T19:45:51.9817164Z 2025-05-07T19:45:52.0169246Z libcublas-12.6.4.1 | 256.2 MB | ####3 | 44%  2025-05-07T19:45:52.0169558Z 2025-05-07T19:45:52.0169788Z 2025-05-07T19:45:52.0169803Z 2025-05-07T19:45:52.0169812Z 2025-05-07T19:45:52.0170615Z cuda-nsight-12.6.77 | 113.2 MB | #########8 | 98%  2025-05-07T19:45:52.0229372Z nsight-compute-2024. | 443.1 MB | ##4 | 25% 2025-05-07T19:45:52.0229657Z 2025-05-07T19:45:52.0229663Z 2025-05-07T19:45:52.0381486Z libcufft-11.3.0.4 | 156.2 MB | #######4 | 74%  2025-05-07T19:45:52.0381789Z 2025-05-07T19:45:52.0381795Z 2025-05-07T19:45:52.0381800Z 2025-05-07T19:45:52.0817828Z libcusparse-12.5.4.2 | 118.6 MB | #########6 | 97%  2025-05-07T19:45:52.0818333Z 2025-05-07T19:45:52.1171137Z libcublas-12.6.4.1 | 256.2 MB | ####5 | 46%  2025-05-07T19:45:52.1383653Z nsight-compute-2024. | 443.1 MB | ##6 | 26% 2025-05-07T19:45:52.1384147Z 2025-05-07T19:45:52.1384155Z 2025-05-07T19:45:52.1818411Z libcufft-11.3.0.4 | 156.2 MB | #######7 | 77%  2025-05-07T19:45:52.1818976Z 2025-05-07T19:45:52.2385520Z libcublas-12.6.4.1 | 256.2 MB | ####9 | 50%  2025-05-07T19:45:52.2385832Z 2025-05-07T19:45:52.2385839Z 2025-05-07T19:45:52.2784955Z libcufft-11.3.0.4 | 156.2 MB | ########3 | 84%  2025-05-07T19:45:52.2868821Z nsight-compute-2024. | 443.1 MB | ##7 | 27% 2025-05-07T19:45:52.2869219Z 2025-05-07T19:45:52.3427953Z libcublas-12.6.4.1 | 256.2 MB | #####4 | 54%  2025-05-07T19:45:52.3428262Z 2025-05-07T19:45:52.3428267Z 2025-05-07T19:45:52.3847903Z libcufft-11.3.0.4 | 156.2 MB | ########8 | 89%  2025-05-07T19:45:52.3869546Z nsight-compute-2024. | 443.1 MB | ##8 | 29% 2025-05-07T19:45:52.3869851Z 2025-05-07T19:45:52.4571089Z libcublas-12.6.4.1 | 256.2 MB | #####8 | 59%  2025-05-07T19:45:52.4571394Z 2025-05-07T19:45:52.4571401Z 2025-05-07T19:45:52.4870922Z libcufft-11.3.0.4 | 156.2 MB | #########2 | 93%  2025-05-07T19:45:52.4871213Z 2025-05-07T19:45:52.5582634Z libcublas-12.6.4.1 | 256.2 MB | ######3 | 64%  2025-05-07T19:45:52.5582967Z 2025-05-07T19:45:52.5582982Z 2025-05-07T19:45:52.5845897Z libcufft-11.3.0.4 | 156.2 MB | #########7 | 98%  2025-05-07T19:45:52.5870405Z nsight-compute-2024. | 443.1 MB | ##9 | 30% 2025-05-07T19:45:52.5870704Z 2025-05-07T19:45:52.6846262Z libcublas-12.6.4.1 | 256.2 MB | ######8 | 68%  2025-05-07T19:45:52.6870512Z nsight-compute-2024. | 443.1 MB | ###1 | 32% 2025-05-07T19:45:52.6870823Z 2025-05-07T19:45:52.7846538Z libcublas-12.6.4.1 | 256.2 MB | #######3 | 73%  2025-05-07T19:45:52.7959134Z nsight-compute-2024. | 443.1 MB | ###4 | 34% 2025-05-07T19:45:52.7959751Z 2025-05-07T19:45:52.8847136Z libcublas-12.6.4.1 | 256.2 MB | #######7 | 77%  2025-05-07T19:45:52.8969941Z nsight-compute-2024. | 443.1 MB | ###6 | 37% 2025-05-07T19:45:52.8970416Z 2025-05-07T19:45:52.9409563Z libcublas-12.6.4.1 | 256.2 MB | ########1 | 82%  2025-05-07T19:45:52.9409907Z 2025-05-07T19:45:52.9409914Z 2025-05-07T19:45:52.9409919Z 2025-05-07T19:45:52.9409923Z 2025-05-07T19:45:52.9857043Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:52.9875843Z nsight-compute-2024. | 443.1 MB | ###8 | 38% 2025-05-07T19:45:52.9876209Z 2025-05-07T19:45:52.9876697Z 2025-05-07T19:45:52.9876708Z 2025-05-07T19:45:52.9876713Z 2025-05-07T19:45:52.9876717Z 2025-05-07T19:45:52.9884529Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:52.9884849Z 2025-05-07T19:45:52.9884854Z 2025-05-07T19:45:52.9884863Z 2025-05-07T19:45:53.0260439Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:53.0260794Z 2025-05-07T19:45:53.0260937Z 2025-05-07T19:45:53.0260941Z 2025-05-07T19:45:53.0261008Z 2025-05-07T19:45:53.0261014Z 2025-05-07T19:45:53.0261035Z 2025-05-07T19:45:53.0879791Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:53.0880770Z 2025-05-07T19:45:53.0880814Z 2025-05-07T19:45:53.0880827Z 2025-05-07T19:45:53.0880837Z 2025-05-07T19:45:53.0880847Z 2025-05-07T19:45:53.1017233Z cuda-nvvp-12.6.80 | 109.3 MB | 8 | 8%  2025-05-07T19:45:53.1017588Z 2025-05-07T19:45:53.1093451Z libcublas-12.6.4.1 | 256.2 MB | ########5 | 86%  2025-05-07T19:45:53.1262187Z nsight-compute-2024. | 443.1 MB | #### | 40% 2025-05-07T19:45:53.1262517Z 2025-05-07T19:45:53.1262545Z 2025-05-07T19:45:53.1262549Z 2025-05-07T19:45:53.1262552Z 2025-05-07T19:45:53.1262556Z 2025-05-07T19:45:53.1262559Z 2025-05-07T19:45:53.1879933Z libcusolver-11.7.1.2 | 95.8 MB | 4 | 5%  2025-05-07T19:45:53.1880900Z 2025-05-07T19:45:53.1880916Z 2025-05-07T19:45:53.1880928Z 2025-05-07T19:45:53.1880969Z 2025-05-07T19:45:53.1880980Z 2025-05-07T19:45:53.2263408Z cuda-nvvp-12.6.80 | 109.3 MB | #3 | 13%  2025-05-07T19:45:53.2263733Z 2025-05-07T19:45:53.2263738Z 2025-05-07T19:45:53.2263970Z 2025-05-07T19:45:53.2263974Z 2025-05-07T19:45:53.2263977Z 2025-05-07T19:45:53.2263981Z 2025-05-07T19:45:53.2502345Z libcusolver-11.7.1.2 | 95.8 MB | 9 | 10%  2025-05-07T19:45:53.3091959Z nsight-compute-2024. | 443.1 MB | ####2 | 42% 2025-05-07T19:45:53.3092342Z 2025-05-07T19:45:53.3148327Z libcublas-12.6.4.1 | 256.2 MB | ########9 | 89%  2025-05-07T19:45:53.3148663Z 2025-05-07T19:45:53.3148669Z 2025-05-07T19:45:53.3148673Z 2025-05-07T19:45:53.3148678Z 2025-05-07T19:45:53.3148697Z 2025-05-07T19:45:53.3263770Z cuda-nvvp-12.6.80 | 109.3 MB | #8 | 18%  2025-05-07T19:45:53.3264108Z 2025-05-07T19:45:53.3264140Z 2025-05-07T19:45:53.3264146Z 2025-05-07T19:45:53.3264152Z 2025-05-07T19:45:53.3264158Z 2025-05-07T19:45:53.3264164Z 2025-05-07T19:45:53.3775240Z libcusolver-11.7.1.2 | 95.8 MB | #4 | 15%  2025-05-07T19:45:53.4150373Z nsight-compute-2024. | 443.1 MB | ####3 | 44% 2025-05-07T19:45:53.4150719Z 2025-05-07T19:45:53.4150742Z 2025-05-07T19:45:53.4150746Z 2025-05-07T19:45:53.4150749Z 2025-05-07T19:45:53.4151075Z 2025-05-07T19:45:53.4264830Z cuda-nvvp-12.6.80 | 109.3 MB | ##2 | 23%  2025-05-07T19:45:53.4265164Z 2025-05-07T19:45:53.4265171Z 2025-05-07T19:45:53.4265234Z 2025-05-07T19:45:53.4265238Z 2025-05-07T19:45:53.4265241Z 2025-05-07T19:45:53.4265246Z 2025-05-07T19:45:53.4674239Z libcusolver-11.7.1.2 | 95.8 MB | #9 | 20%  2025-05-07T19:45:53.4674578Z 2025-05-07T19:45:53.5016860Z libcublas-12.6.4.1 | 256.2 MB | #########2 | 92%  2025-05-07T19:45:53.5151188Z nsight-compute-2024. | 443.1 MB | ####5 | 45% 2025-05-07T19:45:53.5151513Z 2025-05-07T19:45:53.5151519Z 2025-05-07T19:45:53.5151523Z 2025-05-07T19:45:53.5151529Z 2025-05-07T19:45:53.5151999Z 2025-05-07T19:45:53.5268035Z cuda-nvvp-12.6.80 | 109.3 MB | ##7 | 28%  2025-05-07T19:45:53.5268395Z 2025-05-07T19:45:53.5268400Z 2025-05-07T19:45:53.5268433Z 2025-05-07T19:45:53.5268437Z 2025-05-07T19:45:53.5268441Z 2025-05-07T19:45:53.5268444Z 2025-05-07T19:45:53.6141307Z libcusolver-11.7.1.2 | 95.8 MB | ##4 | 25%  2025-05-07T19:45:53.6160559Z nsight-compute-2024. | 443.1 MB | ####6 | 46% 2025-05-07T19:45:53.6161842Z 2025-05-07T19:45:53.6161860Z 2025-05-07T19:45:53.6161872Z 2025-05-07T19:45:53.6161882Z 2025-05-07T19:45:53.6161893Z 2025-05-07T19:45:53.6188773Z cuda-nvvp-12.6.80 | 109.3 MB | ###2 | 32%  2025-05-07T19:45:53.6189099Z 2025-05-07T19:45:53.6268288Z libcublas-12.6.4.1 | 256.2 MB | #########5 | 95%  2025-05-07T19:45:53.6268620Z 2025-05-07T19:45:53.6268625Z 2025-05-07T19:45:53.6268630Z 2025-05-07T19:45:53.6268634Z 2025-05-07T19:45:53.6268637Z 2025-05-07T19:45:53.6268641Z 2025-05-07T19:45:53.7181264Z libcusolver-11.7.1.2 | 95.8 MB | ##9 | 29%  2025-05-07T19:45:53.7181653Z 2025-05-07T19:45:53.7181688Z 2025-05-07T19:45:53.7181713Z 2025-05-07T19:45:53.7181717Z 2025-05-07T19:45:53.7181722Z 2025-05-07T19:45:53.7271215Z cuda-nvvp-12.6.80 | 109.3 MB | ###7 | 37%  2025-05-07T19:45:53.7272110Z 2025-05-07T19:45:53.7272124Z 2025-05-07T19:45:53.7272136Z 2025-05-07T19:45:53.7272147Z 2025-05-07T19:45:53.7272188Z 2025-05-07T19:45:53.7272237Z 2025-05-07T19:45:53.7279848Z libcusolver-11.7.1.2 | 95.8 MB | ###4 | 34%  2025-05-07T19:45:53.7489537Z nsight-compute-2024. | 443.1 MB | ####7 | 48% 2025-05-07T19:45:53.7490455Z 2025-05-07T19:45:53.8182073Z libcublas-12.6.4.1 | 256.2 MB | #########7 | 98%  2025-05-07T19:45:53.8182390Z 2025-05-07T19:45:53.8182417Z 2025-05-07T19:45:53.8182421Z 2025-05-07T19:45:53.8182424Z 2025-05-07T19:45:53.8182428Z 2025-05-07T19:45:53.8271873Z cuda-nvvp-12.6.80 | 109.3 MB | ####2 | 42%  2025-05-07T19:45:53.8272216Z 2025-05-07T19:45:53.8272221Z 2025-05-07T19:45:53.8272225Z 2025-05-07T19:45:53.8272229Z 2025-05-07T19:45:53.8272462Z 2025-05-07T19:45:53.8273259Z 2025-05-07T19:45:53.8323002Z libcusolver-11.7.1.2 | 95.8 MB | ###9 | 39%  2025-05-07T19:45:53.8839530Z nsight-compute-2024. | 443.1 MB | ####9 | 49% 2025-05-07T19:45:53.8840037Z 2025-05-07T19:45:53.9182360Z libcublas-12.6.4.1 | 256.2 MB | #########9 | 100%  2025-05-07T19:45:53.9182685Z 2025-05-07T19:45:53.9182697Z 2025-05-07T19:45:53.9182701Z 2025-05-07T19:45:53.9182704Z 2025-05-07T19:45:53.9182708Z 2025-05-07T19:45:53.9276044Z cuda-nvvp-12.6.80 | 109.3 MB | ####7 | 48%  2025-05-07T19:45:53.9276443Z 2025-05-07T19:45:53.9276614Z 2025-05-07T19:45:53.9276621Z 2025-05-07T19:45:53.9276625Z 2025-05-07T19:45:53.9276630Z 2025-05-07T19:45:53.9276635Z 2025-05-07T19:45:53.9327792Z libcusolver-11.7.1.2 | 95.8 MB | ####4 | 45%  2025-05-07T19:45:54.0184525Z nsight-compute-2024. | 443.1 MB | ##### | 50% 2025-05-07T19:45:54.0185342Z 2025-05-07T19:45:54.0185357Z 2025-05-07T19:45:54.0185399Z 2025-05-07T19:45:54.0185411Z 2025-05-07T19:45:54.0185434Z 2025-05-07T19:45:54.0276783Z cuda-nvvp-12.6.80 | 109.3 MB | #####3 | 53%  2025-05-07T19:45:54.0277320Z 2025-05-07T19:45:54.0277329Z 2025-05-07T19:45:54.0277338Z 2025-05-07T19:45:54.0277348Z 2025-05-07T19:45:54.0277377Z 2025-05-07T19:45:54.0277385Z 2025-05-07T19:45:54.0327646Z libcusolver-11.7.1.2 | 95.8 MB | #####1 | 51%  2025-05-07T19:45:54.1278171Z nsight-compute-2024. | 443.1 MB | #####2 | 52% 2025-05-07T19:45:54.1278768Z 2025-05-07T19:45:54.1278787Z 2025-05-07T19:45:54.1278792Z 2025-05-07T19:45:54.1278797Z 2025-05-07T19:45:54.1278802Z 2025-05-07T19:45:54.1278808Z 2025-05-07T19:45:54.1308189Z libcusolver-11.7.1.2 | 95.8 MB | #####6 | 57%  2025-05-07T19:45:54.1308559Z 2025-05-07T19:45:54.1308564Z 2025-05-07T19:45:54.1308568Z 2025-05-07T19:45:54.1308571Z 2025-05-07T19:45:54.1308575Z 2025-05-07T19:45:54.1327295Z cuda-nvvp-12.6.80 | 109.3 MB | #####8 | 58%  2025-05-07T19:45:54.2329593Z nsight-compute-2024. | 443.1 MB | #####4 | 54% 2025-05-07T19:45:54.2343732Z nsight-compute-2024. | 443.1 MB | #####6 | 56% 2025-05-07T19:45:54.2344058Z 2025-05-07T19:45:54.2344063Z 2025-05-07T19:45:54.2344293Z 2025-05-07T19:45:54.2344304Z 2025-05-07T19:45:54.2344308Z 2025-05-07T19:45:54.2344602Z 2025-05-07T19:45:54.2618705Z libcusolver-11.7.1.2 | 95.8 MB | ######2 | 63%  2025-05-07T19:45:54.2619074Z 2025-05-07T19:45:54.2619079Z 2025-05-07T19:45:54.2619083Z 2025-05-07T19:45:54.2619086Z 2025-05-07T19:45:54.2619090Z 2025-05-07T19:45:54.3331625Z cuda-nvvp-12.6.80 | 109.3 MB | ######3 | 63%  2025-05-07T19:45:54.3436710Z nsight-compute-2024. | 443.1 MB | #####8 | 59% 2025-05-07T19:45:54.3437536Z 2025-05-07T19:45:54.3437550Z 2025-05-07T19:45:54.3437562Z 2025-05-07T19:45:54.3437573Z 2025-05-07T19:45:54.3437584Z 2025-05-07T19:45:54.3437594Z 2025-05-07T19:45:54.3620209Z libcusolver-11.7.1.2 | 95.8 MB | ######8 | 68%  2025-05-07T19:45:54.3620608Z 2025-05-07T19:45:54.3620613Z 2025-05-07T19:45:54.3620616Z 2025-05-07T19:45:54.3620620Z 2025-05-07T19:45:54.3620625Z 2025-05-07T19:45:54.3748082Z cuda-nvvp-12.6.80 | 109.3 MB | ######8 | 68%  2025-05-07T19:45:54.3749013Z 2025-05-07T19:45:54.3749027Z 2025-05-07T19:45:54.3749038Z 2025-05-07T19:45:54.3749061Z 2025-05-07T19:45:54.4436643Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:54.4436986Z 2025-05-07T19:45:54.4436992Z 2025-05-07T19:45:54.4436995Z 2025-05-07T19:45:54.4437000Z 2025-05-07T19:45:54.4437005Z 2025-05-07T19:45:54.4437008Z 2025-05-07T19:45:54.4480734Z libcusolver-11.7.1.2 | 95.8 MB | #######4 | 75%  2025-05-07T19:45:54.4619538Z nsight-compute-2024. | 443.1 MB | ###### | 61% 2025-05-07T19:45:54.4619911Z 2025-05-07T19:45:54.4619916Z 2025-05-07T19:45:54.4619939Z 2025-05-07T19:45:54.4620157Z 2025-05-07T19:45:54.4620161Z 2025-05-07T19:45:54.5436691Z cuda-nvvp-12.6.80 | 109.3 MB | #######5 | 75%  2025-05-07T19:45:54.5437011Z 2025-05-07T19:45:54.5437016Z 2025-05-07T19:45:54.5437020Z 2025-05-07T19:45:54.5437024Z 2025-05-07T19:45:54.5437027Z 2025-05-07T19:45:54.5437057Z 2025-05-07T19:45:54.5510766Z libcusolver-11.7.1.2 | 95.8 MB | ########4 | 84%  2025-05-07T19:45:54.6184711Z nsight-compute-2024. | 443.1 MB | ######2 | 62% 2025-05-07T19:45:54.6185557Z 2025-05-07T19:45:54.6185570Z 2025-05-07T19:45:54.6185582Z 2025-05-07T19:45:54.6185593Z 2025-05-07T19:45:54.6185603Z 2025-05-07T19:45:54.6437875Z cuda-nvvp-12.6.80 | 109.3 MB | ######## | 81%  2025-05-07T19:45:54.6438191Z 2025-05-07T19:45:54.6438221Z 2025-05-07T19:45:54.6438226Z 2025-05-07T19:45:54.6438231Z 2025-05-07T19:45:54.6438235Z 2025-05-07T19:45:54.6438238Z 2025-05-07T19:45:54.6511811Z libcusolver-11.7.1.2 | 95.8 MB | #########2 | 92%  2025-05-07T19:45:54.7264356Z nsight-compute-2024. | 443.1 MB | ######4 | 64% 2025-05-07T19:45:54.7264650Z 2025-05-07T19:45:54.7264662Z 2025-05-07T19:45:54.7283158Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:54.7283475Z 2025-05-07T19:45:54.7283481Z 2025-05-07T19:45:54.7283507Z 2025-05-07T19:45:54.7283510Z 2025-05-07T19:45:54.7283514Z 2025-05-07T19:45:54.7561635Z cuda-nvvp-12.6.80 | 109.3 MB | ########5 | 86%  2025-05-07T19:45:54.7799220Z nsight-compute-2024. | 443.1 MB | ######6 | 67% 2025-05-07T19:45:54.7799508Z 2025-05-07T19:45:54.7799513Z 2025-05-07T19:45:54.7799517Z 2025-05-07T19:45:54.7799520Z 2025-05-07T19:45:54.7799524Z 2025-05-07T19:45:54.7799527Z 2025-05-07T19:45:54.8820029Z 2025-05-07T19:45:54.8821214Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:54.8822326Z 2025-05-07T19:45:54.8822338Z 2025-05-07T19:45:54.8822349Z 2025-05-07T19:45:54.8822360Z 2025-05-07T19:45:54.8822401Z 2025-05-07T19:45:54.8822412Z 2025-05-07T19:45:54.8822422Z 2025-05-07T19:45:54.8823170Z libnpp-12.3.1.54 | 93.4 MB | 8 | 9%  2025-05-07T19:45:54.8889234Z nsight-compute-2024. | 443.1 MB | ######9 | 69% 2025-05-07T19:45:54.8889544Z 2025-05-07T19:45:54.8889779Z 2025-05-07T19:45:54.8889784Z 2025-05-07T19:45:54.8889813Z 2025-05-07T19:45:54.8889816Z 2025-05-07T19:45:54.9060269Z cuda-nvvp-12.6.80 | 109.3 MB | ######### | 90%  2025-05-07T19:45:54.9060641Z 2025-05-07T19:45:54.9060646Z 2025-05-07T19:45:54.9060651Z 2025-05-07T19:45:54.9060655Z 2025-05-07T19:45:54.9060659Z 2025-05-07T19:45:54.9060664Z 2025-05-07T19:45:54.9821714Z libcusolver-11.7.1.2 | 95.8 MB | #########9 | 99%  2025-05-07T19:45:54.9822061Z 2025-05-07T19:45:54.9822065Z 2025-05-07T19:45:54.9822069Z 2025-05-07T19:45:54.9822072Z 2025-05-07T19:45:54.9822076Z 2025-05-07T19:45:54.9822080Z 2025-05-07T19:45:54.9822083Z 2025-05-07T19:45:54.9822805Z libnpp-12.3.1.54 | 93.4 MB | #8 | 18%  2025-05-07T19:45:55.0085544Z nsight-compute-2024. | 443.1 MB | #######1 | 71% 2025-05-07T19:45:55.0086109Z 2025-05-07T19:45:55.0086124Z 2025-05-07T19:45:55.0086129Z 2025-05-07T19:45:55.0086134Z 2025-05-07T19:45:55.0086155Z 2025-05-07T19:45:55.0821734Z cuda-nvvp-12.6.80 | 109.3 MB | #########4 | 95%  2025-05-07T19:45:55.0822065Z 2025-05-07T19:45:55.0822070Z 2025-05-07T19:45:55.0822098Z 2025-05-07T19:45:55.0822102Z 2025-05-07T19:45:55.0822105Z 2025-05-07T19:45:55.0822108Z 2025-05-07T19:45:55.0822112Z 2025-05-07T19:45:55.0940209Z libnpp-12.3.1.54 | 93.4 MB | ##6 | 26%  2025-05-07T19:45:55.1892691Z nsight-compute-2024. | 443.1 MB | #######3 | 73% 2025-05-07T19:45:55.1893189Z 2025-05-07T19:45:55.1893199Z 2025-05-07T19:45:55.1893215Z 2025-05-07T19:45:55.1893221Z 2025-05-07T19:45:55.1893226Z 2025-05-07T19:45:55.1893231Z 2025-05-07T19:45:55.1893237Z 2025-05-07T19:45:55.2010145Z libnpp-12.3.1.54 | 93.4 MB | ###3 | 34%  2025-05-07T19:45:55.3011492Z nsight-compute-2024. | 443.1 MB | #######5 | 75% 2025-05-07T19:45:55.3809818Z nsight-compute-2024. | 443.1 MB | #######8 | 78% 2025-05-07T19:45:55.3810142Z 2025-05-07T19:45:55.3810202Z 2025-05-07T19:45:55.3810208Z 2025-05-07T19:45:55.3810228Z 2025-05-07T19:45:55.3810233Z 2025-05-07T19:45:55.3810238Z 2025-05-07T19:45:55.3810256Z 2025-05-07T19:45:55.4012531Z libnpp-12.3.1.54 | 93.4 MB | ####1 | 41%  2025-05-07T19:45:55.5013742Z nsight-compute-2024. | 443.1 MB | ######## | 81% 2025-05-07T19:45:55.5400570Z nsight-compute-2024. | 443.1 MB | ########3 | 84% 2025-05-07T19:45:55.5400903Z 2025-05-07T19:45:55.5400908Z 2025-05-07T19:45:55.5400914Z 2025-05-07T19:45:55.5400919Z 2025-05-07T19:45:55.5400922Z 2025-05-07T19:45:55.5400925Z 2025-05-07T19:45:55.5400929Z 2025-05-07T19:45:55.6014767Z libnpp-12.3.1.54 | 93.4 MB | ####7 | 47%  2025-05-07T19:45:55.6402447Z nsight-compute-2024. | 443.1 MB | ########6 | 86% 2025-05-07T19:45:55.6402758Z 2025-05-07T19:45:55.6402934Z 2025-05-07T19:45:55.6402949Z 2025-05-07T19:45:55.6403013Z 2025-05-07T19:45:55.6403024Z 2025-05-07T19:45:55.6403036Z 2025-05-07T19:45:55.6403085Z 2025-05-07T19:45:55.7404875Z libnpp-12.3.1.54 | 93.4 MB | #####7 | 58%  2025-05-07T19:45:55.7405292Z 2025-05-07T19:45:55.7405297Z 2025-05-07T19:45:55.7405302Z 2025-05-07T19:45:55.7405307Z 2025-05-07T19:45:55.7405331Z 2025-05-07T19:45:55.7405336Z 2025-05-07T19:45:55.7405341Z 2025-05-07T19:45:55.7457911Z libnpp-12.3.1.54 | 93.4 MB | ######9 | 70%  2025-05-07T19:45:55.8405664Z nsight-compute-2024. | 443.1 MB | ########8 | 89% 2025-05-07T19:45:55.8406008Z 2025-05-07T19:45:55.8406270Z 2025-05-07T19:45:55.8406278Z 2025-05-07T19:45:55.8406284Z 2025-05-07T19:45:55.8406290Z 2025-05-07T19:45:55.8406295Z 2025-05-07T19:45:55.8406324Z 2025-05-07T19:45:55.8495622Z libnpp-12.3.1.54 | 93.4 MB | ######## | 81%  2025-05-07T19:45:55.9117355Z nsight-compute-2024. | 443.1 MB | ######### | 91% 2025-05-07T19:45:55.9117699Z 2025-05-07T19:45:55.9117707Z 2025-05-07T19:45:55.9117711Z 2025-05-07T19:45:55.9197528Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:55.9197869Z 2025-05-07T19:45:55.9197891Z 2025-05-07T19:45:55.9197896Z 2025-05-07T19:45:55.9197904Z 2025-05-07T19:45:55.9197909Z 2025-05-07T19:45:55.9197914Z 2025-05-07T19:45:55.9406685Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:55.9407046Z 2025-05-07T19:45:55.9407053Z 2025-05-07T19:45:55.9407059Z 2025-05-07T19:45:55.9407077Z 2025-05-07T19:45:55.9407083Z 2025-05-07T19:45:55.9407089Z 2025-05-07T19:45:55.9407094Z 2025-05-07T19:45:55.9703360Z libnpp-12.3.1.54 | 93.4 MB | #########1 | 92%  2025-05-07T19:45:55.9703714Z 2025-05-07T19:45:55.9703754Z 2025-05-07T19:45:55.9703759Z 2025-05-07T19:45:55.9703763Z 2025-05-07T19:45:55.9703781Z 2025-05-07T19:45:55.9703785Z 2025-05-07T19:45:55.9703789Z 2025-05-07T19:45:55.9703793Z 2025-05-07T19:45:55.9943349Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:56.0704562Z nsight-compute-2024. | 443.1 MB | #########2 | 93% 2025-05-07T19:45:56.0704894Z 2025-05-07T19:45:56.0704900Z 2025-05-07T19:45:56.0704927Z 2025-05-07T19:45:56.0704932Z 2025-05-07T19:45:56.0704935Z 2025-05-07T19:45:56.0704939Z 2025-05-07T19:45:56.0704943Z 2025-05-07T19:45:56.0704947Z 2025-05-07T19:45:56.1231503Z cuda-nvdisasm-12.6.7 | 47.6 MB | #4 | 15%  2025-05-07T19:45:56.1705240Z nsight-compute-2024. | 443.1 MB | #########5 | 95% 2025-05-07T19:45:56.1705541Z 2025-05-07T19:45:56.1705573Z 2025-05-07T19:45:56.1705578Z 2025-05-07T19:45:56.1705583Z 2025-05-07T19:45:56.1705588Z 2025-05-07T19:45:56.1705605Z 2025-05-07T19:45:56.1705610Z 2025-05-07T19:45:56.1705871Z 2025-05-07T19:45:56.2233772Z cuda-nvdisasm-12.6.7 | 47.6 MB | ###4 | 34%  2025-05-07T19:45:56.2399908Z nsight-compute-2024. | 443.1 MB | #########6 | 97% 2025-05-07T19:45:56.2400330Z 2025-05-07T19:45:56.2400467Z 2025-05-07T19:45:56.2400524Z 2025-05-07T19:45:56.2400600Z 2025-05-07T19:45:56.2400644Z 2025-05-07T19:45:56.2400996Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:56.2401304Z 2025-05-07T19:45:56.2401308Z 2025-05-07T19:45:56.2401312Z 2025-05-07T19:45:56.2401315Z 2025-05-07T19:45:56.2401318Z 2025-05-07T19:45:56.2705570Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:56.2705910Z 2025-05-07T19:45:56.2705933Z 2025-05-07T19:45:56.2705938Z 2025-05-07T19:45:56.2705942Z 2025-05-07T19:45:56.2705945Z 2025-05-07T19:45:56.2705950Z 2025-05-07T19:45:56.2705954Z 2025-05-07T19:45:56.2705958Z 2025-05-07T19:45:56.2942749Z cuda-nvdisasm-12.6.7 | 47.6 MB | ##### | 50%  2025-05-07T19:45:56.2943168Z 2025-05-07T19:45:56.2943173Z 2025-05-07T19:45:56.2943177Z 2025-05-07T19:45:56.2943181Z 2025-05-07T19:45:56.2943184Z 2025-05-07T19:45:56.2943188Z 2025-05-07T19:45:56.2943191Z 2025-05-07T19:45:56.2943196Z 2025-05-07T19:45:56.2943225Z 2025-05-07T19:45:56.3467275Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:56.3863751Z nsight-compute-2024. | 443.1 MB | #########8 | 99% 2025-05-07T19:45:56.3864074Z 2025-05-07T19:45:56.3864080Z 2025-05-07T19:45:56.3864084Z 2025-05-07T19:45:56.3864087Z 2025-05-07T19:45:56.3864091Z 2025-05-07T19:45:56.3864094Z 2025-05-07T19:45:56.3864098Z 2025-05-07T19:45:56.3864102Z 2025-05-07T19:45:56.3945112Z cuda-nvdisasm-12.6.7 | 47.6 MB | ######4 | 65%  2025-05-07T19:45:56.3946108Z 2025-05-07T19:45:56.3946122Z 2025-05-07T19:45:56.3946132Z 2025-05-07T19:45:56.3946143Z 2025-05-07T19:45:56.3946153Z 2025-05-07T19:45:56.3946200Z 2025-05-07T19:45:56.3946210Z 2025-05-07T19:45:56.3946221Z 2025-05-07T19:45:56.3946243Z 2025-05-07T19:45:56.4864487Z libcurand-10.3.7.77 | 39.9 MB | #5 | 16%  2025-05-07T19:45:56.4864863Z 2025-05-07T19:45:56.4864868Z 2025-05-07T19:45:56.4864872Z 2025-05-07T19:45:56.4865154Z 2025-05-07T19:45:56.4865159Z 2025-05-07T19:45:56.4865162Z 2025-05-07T19:45:56.4865166Z 2025-05-07T19:45:56.4865169Z 2025-05-07T19:45:56.4943436Z cuda-nvdisasm-12.6.7 | 47.6 MB | #######9 | 79%  2025-05-07T19:45:56.4943818Z 2025-05-07T19:45:56.4943823Z 2025-05-07T19:45:56.4943827Z 2025-05-07T19:45:56.4943830Z 2025-05-07T19:45:56.4943834Z 2025-05-07T19:45:56.4943839Z 2025-05-07T19:45:56.4943842Z 2025-05-07T19:45:56.4943846Z 2025-05-07T19:45:56.4944095Z 2025-05-07T19:45:56.5866068Z libcurand-10.3.7.77 | 39.9 MB | ###3 | 34%  2025-05-07T19:45:56.5866427Z 2025-05-07T19:45:56.5866433Z 2025-05-07T19:45:56.5866439Z 2025-05-07T19:45:56.5866466Z 2025-05-07T19:45:56.5866469Z 2025-05-07T19:45:56.5866473Z 2025-05-07T19:45:56.5866476Z 2025-05-07T19:45:56.5866487Z 2025-05-07T19:45:56.5944366Z cuda-nvdisasm-12.6.7 | 47.6 MB | #########4 | 94%  2025-05-07T19:45:56.5944745Z 2025-05-07T19:45:56.5944765Z 2025-05-07T19:45:56.5944769Z 2025-05-07T19:45:56.5944772Z 2025-05-07T19:45:56.5944776Z 2025-05-07T19:45:56.5944779Z 2025-05-07T19:45:56.5944783Z 2025-05-07T19:45:56.5944786Z 2025-05-07T19:45:56.5944789Z 2025-05-07T19:45:56.6945412Z libcurand-10.3.7.77 | 39.9 MB | #####1 | 51%  2025-05-07T19:45:56.6945761Z 2025-05-07T19:45:56.6945765Z 2025-05-07T19:45:56.6945769Z 2025-05-07T19:45:56.6945772Z 2025-05-07T19:45:56.6945776Z 2025-05-07T19:45:56.6945779Z 2025-05-07T19:45:56.6945783Z 2025-05-07T19:45:56.6945786Z 2025-05-07T19:45:56.6946316Z 2025-05-07T19:45:56.9162207Z libcurand-10.3.7.77 | 39.9 MB | #######5 | 75%  2025-05-07T19:45:56.9162803Z 2025-05-07T19:45:56.9943725Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:45:56.9944037Z 2025-05-07T19:45:56.9944043Z 2025-05-07T19:45:56.9944049Z 2025-05-07T19:45:56.9944052Z 2025-05-07T19:45:56.9944056Z 2025-05-07T19:45:56.9944059Z 2025-05-07T19:45:56.9944081Z 2025-05-07T19:45:56.9944084Z 2025-05-07T19:45:56.9944088Z 2025-05-07T19:45:56.9944091Z 2025-05-07T19:45:57.0944408Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:57.0944766Z 2025-05-07T19:45:57.0944771Z 2025-05-07T19:45:57.0944774Z 2025-05-07T19:45:57.0944778Z 2025-05-07T19:45:57.0944782Z 2025-05-07T19:45:57.0944785Z 2025-05-07T19:45:57.0944790Z 2025-05-07T19:45:57.0944793Z 2025-05-07T19:45:57.0944797Z 2025-05-07T19:45:57.0944800Z 2025-05-07T19:45:57.1490776Z gds-tools-1.11.1.6 | 37.8 MB | ##3 | 24%  2025-05-07T19:45:57.1491117Z 2025-05-07T19:45:57.1491122Z 2025-05-07T19:45:57.1491127Z 2025-05-07T19:45:57.1491155Z 2025-05-07T19:45:57.1491203Z 2025-05-07T19:45:57.1491206Z 2025-05-07T19:45:57.1491210Z 2025-05-07T19:45:57.1491213Z 2025-05-07T19:45:57.1841416Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:57.1841765Z 2025-05-07T19:45:57.1841790Z 2025-05-07T19:45:57.1947713Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:57.1948046Z 2025-05-07T19:45:57.1948050Z 2025-05-07T19:45:57.1948054Z 2025-05-07T19:45:57.1948058Z 2025-05-07T19:45:57.1948061Z 2025-05-07T19:45:57.1948065Z 2025-05-07T19:45:57.1948068Z 2025-05-07T19:45:57.1948072Z 2025-05-07T19:45:57.1948075Z 2025-05-07T19:45:57.1948078Z 2025-05-07T19:45:57.1992540Z gds-tools-1.11.1.6 | 37.8 MB | ####9 | 50%  2025-05-07T19:45:57.1992920Z 2025-05-07T19:45:57.1992924Z 2025-05-07T19:45:57.1992928Z 2025-05-07T19:45:57.1992932Z 2025-05-07T19:45:57.1992935Z 2025-05-07T19:45:57.1992939Z 2025-05-07T19:45:57.1992959Z 2025-05-07T19:45:57.1992963Z 2025-05-07T19:45:57.1992967Z 2025-05-07T19:45:57.1992970Z 2025-05-07T19:45:57.1992974Z 2025-05-07T19:45:57.2768699Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:57.2769776Z 2025-05-07T19:45:57.2769791Z 2025-05-07T19:45:57.2770233Z 2025-05-07T19:45:57.2770249Z 2025-05-07T19:45:57.2770382Z 2025-05-07T19:45:57.2770385Z 2025-05-07T19:45:57.2770389Z 2025-05-07T19:45:57.2770392Z 2025-05-07T19:45:57.2770396Z 2025-05-07T19:45:57.2770718Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:57.2771038Z 2025-05-07T19:45:57.2771043Z 2025-05-07T19:45:57.2771046Z 2025-05-07T19:45:57.2771051Z 2025-05-07T19:45:57.2771054Z 2025-05-07T19:45:57.2771059Z 2025-05-07T19:45:57.2771063Z 2025-05-07T19:45:57.2771066Z 2025-05-07T19:45:57.2771069Z 2025-05-07T19:45:57.2949145Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:57.2949515Z 2025-05-07T19:45:57.2949520Z 2025-05-07T19:45:57.2949524Z 2025-05-07T19:45:57.2949528Z 2025-05-07T19:45:57.2949531Z 2025-05-07T19:45:57.2949535Z 2025-05-07T19:45:57.2949538Z 2025-05-07T19:45:57.2949542Z 2025-05-07T19:45:57.2949545Z 2025-05-07T19:45:57.2949549Z 2025-05-07T19:45:57.2993009Z gds-tools-1.11.1.6 | 37.8 MB | ######9 | 70%  2025-05-07T19:45:57.2993358Z 2025-05-07T19:45:57.2993363Z 2025-05-07T19:45:57.2993366Z 2025-05-07T19:45:57.2993370Z 2025-05-07T19:45:57.2993373Z 2025-05-07T19:45:57.2993377Z 2025-05-07T19:45:57.2993380Z 2025-05-07T19:45:57.2993384Z 2025-05-07T19:45:57.2993387Z 2025-05-07T19:45:57.2993390Z 2025-05-07T19:45:57.2993394Z 2025-05-07T19:45:57.3102230Z cuda-nvcc-tools-12.6 | 23.0 MB | ##9 | 30%  2025-05-07T19:45:57.3102590Z 2025-05-07T19:45:57.3102595Z 2025-05-07T19:45:57.3102599Z 2025-05-07T19:45:57.3102602Z 2025-05-07T19:45:57.3102606Z 2025-05-07T19:45:57.3102609Z 2025-05-07T19:45:57.3102824Z 2025-05-07T19:45:57.3258657Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:45:57.3258985Z 2025-05-07T19:45:57.3258989Z 2025-05-07T19:45:57.3258993Z 2025-05-07T19:45:57.3258997Z 2025-05-07T19:45:57.3259000Z 2025-05-07T19:45:57.3259004Z 2025-05-07T19:45:57.3259024Z 2025-05-07T19:45:57.3259028Z 2025-05-07T19:45:57.3259031Z 2025-05-07T19:45:57.3259034Z 2025-05-07T19:45:57.3259062Z 2025-05-07T19:45:57.3259065Z 2025-05-07T19:45:57.3618391Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:57.3618734Z 2025-05-07T19:45:57.3618738Z 2025-05-07T19:45:57.3618742Z 2025-05-07T19:45:57.3618746Z 2025-05-07T19:45:57.3618749Z 2025-05-07T19:45:57.3618777Z 2025-05-07T19:45:57.3618781Z 2025-05-07T19:45:57.3618784Z 2025-05-07T19:45:57.3618787Z 2025-05-07T19:45:57.3618791Z 2025-05-07T19:45:57.3618794Z 2025-05-07T19:45:57.3618798Z 2025-05-07T19:45:57.3618801Z 2025-05-07T19:45:57.3998609Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:57.3999023Z 2025-05-07T19:45:57.3999028Z 2025-05-07T19:45:57.3999032Z 2025-05-07T19:45:57.3999036Z 2025-05-07T19:45:57.3999040Z 2025-05-07T19:45:57.3999043Z 2025-05-07T19:45:57.3999046Z 2025-05-07T19:45:57.3999057Z 2025-05-07T19:45:57.3999060Z 2025-05-07T19:45:57.3999064Z 2025-05-07T19:45:57.3999067Z 2025-05-07T19:45:57.4259404Z cuda-nvcc-tools-12.6 | 23.0 MB | #####1 | 52%  2025-05-07T19:45:57.4259889Z 2025-05-07T19:45:57.4259893Z 2025-05-07T19:45:57.4259897Z 2025-05-07T19:45:57.4259901Z 2025-05-07T19:45:57.4259905Z 2025-05-07T19:45:57.4259908Z 2025-05-07T19:45:57.4259912Z 2025-05-07T19:45:57.4259915Z 2025-05-07T19:45:57.4259919Z 2025-05-07T19:45:57.4259922Z 2025-05-07T19:45:57.4259926Z 2025-05-07T19:45:57.4259929Z 2025-05-07T19:45:57.4453504Z cuda-nvrtc-12.6.85 | 17.3 MB | ##6 | 27%  2025-05-07T19:45:57.4453907Z 2025-05-07T19:45:57.4453912Z 2025-05-07T19:45:57.4453916Z 2025-05-07T19:45:57.4453919Z 2025-05-07T19:45:57.4453923Z 2025-05-07T19:45:57.4454023Z 2025-05-07T19:45:57.4454033Z 2025-05-07T19:45:57.4454039Z 2025-05-07T19:45:57.4454043Z 2025-05-07T19:45:57.4454048Z 2025-05-07T19:45:57.4633283Z gds-tools-1.11.1.6 | 37.8 MB | ########9 | 89%  2025-05-07T19:45:57.4633641Z 2025-05-07T19:45:57.4633645Z 2025-05-07T19:45:57.4633649Z 2025-05-07T19:45:57.4633652Z 2025-05-07T19:45:57.4633656Z 2025-05-07T19:45:57.4633660Z 2025-05-07T19:45:57.4633663Z 2025-05-07T19:45:57.4633667Z 2025-05-07T19:45:57.4633670Z 2025-05-07T19:45:57.4633697Z 2025-05-07T19:45:57.4633701Z 2025-05-07T19:45:57.4633704Z 2025-05-07T19:45:57.4633708Z 2025-05-07T19:45:57.5189017Z libnvjitlink-12.6.85 | 14.9 MB | ##5 | 26%  2025-05-07T19:45:57.5189381Z 2025-05-07T19:45:57.5189386Z 2025-05-07T19:45:57.5189391Z 2025-05-07T19:45:57.5189411Z 2025-05-07T19:45:57.5189437Z 2025-05-07T19:45:57.5189440Z 2025-05-07T19:45:57.5189443Z 2025-05-07T19:45:57.5189447Z 2025-05-07T19:45:57.5189450Z 2025-05-07T19:45:57.5189454Z 2025-05-07T19:45:57.5189464Z 2025-05-07T19:45:57.5261623Z cuda-nvcc-tools-12.6 | 23.0 MB | #######1 | 72%  2025-05-07T19:45:57.5262011Z 2025-05-07T19:45:57.5262015Z 2025-05-07T19:45:57.5262019Z 2025-05-07T19:45:57.5262022Z 2025-05-07T19:45:57.5262026Z 2025-05-07T19:45:57.5262029Z 2025-05-07T19:45:57.5262032Z 2025-05-07T19:45:57.5262036Z 2025-05-07T19:45:57.5262040Z 2025-05-07T19:45:57.5262043Z 2025-05-07T19:45:57.5262047Z 2025-05-07T19:45:57.5263307Z 2025-05-07T19:45:57.5637640Z cuda-nvrtc-12.6.85 | 17.3 MB | #####1 | 51%  2025-05-07T19:45:57.5638018Z 2025-05-07T19:45:57.5638022Z 2025-05-07T19:45:57.5638026Z 2025-05-07T19:45:57.5638030Z 2025-05-07T19:45:57.5638033Z 2025-05-07T19:45:57.5638037Z 2025-05-07T19:45:57.5638240Z 2025-05-07T19:45:57.5638244Z 2025-05-07T19:45:57.5638248Z 2025-05-07T19:45:57.5638251Z 2025-05-07T19:45:57.5638255Z 2025-05-07T19:45:57.5638258Z 2025-05-07T19:45:57.5638261Z 2025-05-07T19:45:57.6262879Z libnvjitlink-12.6.85 | 14.9 MB | #####3 | 53%  2025-05-07T19:45:57.6263972Z 2025-05-07T19:45:57.6263986Z 2025-05-07T19:45:57.6263998Z 2025-05-07T19:45:57.6264008Z 2025-05-07T19:45:57.6264018Z 2025-05-07T19:45:57.6264029Z 2025-05-07T19:45:57.6264039Z 2025-05-07T19:45:57.6264050Z 2025-05-07T19:45:57.6264060Z 2025-05-07T19:45:57.6264070Z 2025-05-07T19:45:57.6264080Z 2025-05-07T19:45:57.6264090Z 2025-05-07T19:45:57.7508360Z cuda-nvrtc-12.6.85 | 17.3 MB | ########9 | 89%  2025-05-07T19:45:57.7508716Z 2025-05-07T19:45:57.7508721Z 2025-05-07T19:45:57.7508725Z 2025-05-07T19:45:57.7508729Z 2025-05-07T19:45:57.7508732Z 2025-05-07T19:45:57.7508737Z 2025-05-07T19:45:57.7508740Z 2025-05-07T19:45:57.7508757Z 2025-05-07T19:45:57.7508781Z 2025-05-07T19:45:57.7508785Z 2025-05-07T19:45:57.7508788Z 2025-05-07T19:45:57.7988113Z cuda-nvcc-tools-12.6 | 23.0 MB | ########9 | 90%  2025-05-07T19:45:57.7988485Z 2025-05-07T19:45:57.7988490Z 2025-05-07T19:45:57.7988494Z 2025-05-07T19:45:57.7988512Z 2025-05-07T19:45:57.7988539Z 2025-05-07T19:45:57.7988542Z 2025-05-07T19:45:57.7988546Z 2025-05-07T19:45:57.7988550Z 2025-05-07T19:45:57.7988553Z 2025-05-07T19:45:57.7988557Z 2025-05-07T19:45:57.7988560Z 2025-05-07T19:45:57.7988564Z 2025-05-07T19:45:57.7988567Z 2025-05-07T19:45:57.7988887Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:57.7989258Z 2025-05-07T19:45:57.7989262Z 2025-05-07T19:45:57.7989266Z 2025-05-07T19:45:57.7989269Z 2025-05-07T19:45:57.7989273Z 2025-05-07T19:45:57.7989276Z 2025-05-07T19:45:57.7989279Z 2025-05-07T19:45:57.7989283Z 2025-05-07T19:45:57.7989286Z 2025-05-07T19:45:57.7989289Z 2025-05-07T19:45:57.7989299Z 2025-05-07T19:45:57.7989303Z 2025-05-07T19:45:57.7989306Z 2025-05-07T19:45:57.8290681Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:57.8291051Z 2025-05-07T19:45:57.8291055Z 2025-05-07T19:45:57.8291059Z 2025-05-07T19:45:57.8291253Z 2025-05-07T19:45:57.8291259Z 2025-05-07T19:45:57.8291262Z 2025-05-07T19:45:57.8291266Z 2025-05-07T19:45:57.8291270Z 2025-05-07T19:45:57.8291273Z 2025-05-07T19:45:57.8291277Z 2025-05-07T19:45:57.8291302Z 2025-05-07T19:45:57.8291306Z 2025-05-07T19:45:57.8345851Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:57.8346207Z 2025-05-07T19:45:57.8346212Z 2025-05-07T19:45:57.8346216Z 2025-05-07T19:45:57.8346219Z 2025-05-07T19:45:57.8346223Z 2025-05-07T19:45:57.8346252Z 2025-05-07T19:45:57.8346255Z 2025-05-07T19:45:57.8346259Z 2025-05-07T19:45:57.8346263Z 2025-05-07T19:45:57.8346266Z 2025-05-07T19:45:57.8346270Z 2025-05-07T19:45:57.8346287Z 2025-05-07T19:45:57.8346291Z 2025-05-07T19:45:57.8346295Z 2025-05-07T19:45:57.8624611Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:57.8625003Z 2025-05-07T19:45:57.8625008Z 2025-05-07T19:45:57.8625011Z 2025-05-07T19:45:57.8625027Z 2025-05-07T19:45:57.8625031Z 2025-05-07T19:45:57.8625034Z 2025-05-07T19:45:57.8625038Z 2025-05-07T19:45:57.8625041Z 2025-05-07T19:45:57.8625044Z 2025-05-07T19:45:57.8625048Z 2025-05-07T19:45:57.8625051Z 2025-05-07T19:45:57.8625055Z 2025-05-07T19:45:57.8625058Z 2025-05-07T19:45:57.8625062Z 2025-05-07T19:45:57.8625070Z 2025-05-07T19:45:57.9346994Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:57.9347369Z 2025-05-07T19:45:57.9347374Z 2025-05-07T19:45:57.9347378Z 2025-05-07T19:45:57.9347381Z 2025-05-07T19:45:57.9347385Z 2025-05-07T19:45:57.9347389Z 2025-05-07T19:45:57.9347392Z 2025-05-07T19:45:57.9347586Z 2025-05-07T19:45:57.9347590Z 2025-05-07T19:45:57.9347593Z 2025-05-07T19:45:57.9347597Z 2025-05-07T19:45:57.9347600Z 2025-05-07T19:45:57.9347604Z 2025-05-07T19:45:57.9347631Z 2025-05-07T19:45:57.9625347Z cuda-nvcc-dev_linux- | 10.8 MB | #####8 | 58%  2025-05-07T19:45:57.9625735Z 2025-05-07T19:45:57.9625740Z 2025-05-07T19:45:57.9625744Z 2025-05-07T19:45:57.9625748Z 2025-05-07T19:45:57.9625751Z 2025-05-07T19:45:57.9625755Z 2025-05-07T19:45:57.9625779Z 2025-05-07T19:45:57.9625783Z 2025-05-07T19:45:57.9625786Z 2025-05-07T19:45:57.9625790Z 2025-05-07T19:45:57.9625793Z 2025-05-07T19:45:57.9625797Z 2025-05-07T19:45:57.9625800Z 2025-05-07T19:45:57.9625804Z 2025-05-07T19:45:57.9626257Z 2025-05-07T19:45:58.0042795Z cuda-nvvm-tools-12.6 | 10.4 MB | ########5 | 86%  2025-05-07T19:45:58.0043198Z 2025-05-07T19:45:58.0043203Z 2025-05-07T19:45:58.0043206Z 2025-05-07T19:45:58.0043210Z 2025-05-07T19:45:58.0043227Z 2025-05-07T19:45:58.0043230Z 2025-05-07T19:45:58.0043234Z 2025-05-07T19:45:58.0043238Z 2025-05-07T19:45:58.0043241Z 2025-05-07T19:45:58.0043245Z 2025-05-07T19:45:58.0419075Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:58.0419436Z 2025-05-07T19:45:58.0422880Z 2025-05-07T19:45:58.0422885Z 2025-05-07T19:45:58.0422889Z 2025-05-07T19:45:58.0422892Z 2025-05-07T19:45:58.0422896Z 2025-05-07T19:45:58.0422899Z 2025-05-07T19:45:58.0422903Z 2025-05-07T19:45:58.0422906Z 2025-05-07T19:45:58.0422910Z 2025-05-07T19:45:58.0422913Z 2025-05-07T19:45:58.0422917Z 2025-05-07T19:45:58.0422920Z 2025-05-07T19:45:58.0422924Z 2025-05-07T19:45:58.0422928Z 2025-05-07T19:45:58.0422931Z 2025-05-07T19:45:58.0798327Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:58.0798760Z 2025-05-07T19:45:58.0798764Z 2025-05-07T19:45:58.0798768Z 2025-05-07T19:45:58.0798771Z 2025-05-07T19:45:58.0798794Z 2025-05-07T19:45:58.0798797Z 2025-05-07T19:45:58.0798801Z 2025-05-07T19:45:58.0798804Z 2025-05-07T19:45:58.0798808Z 2025-05-07T19:45:58.0798811Z 2025-05-07T19:45:58.0798816Z 2025-05-07T19:45:58.1009040Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:45:58.1009597Z 2025-05-07T19:45:58.1009603Z 2025-05-07T19:45:58.1009607Z 2025-05-07T19:45:58.1009611Z 2025-05-07T19:45:58.1009615Z 2025-05-07T19:45:58.1009618Z 2025-05-07T19:45:58.1009621Z 2025-05-07T19:45:58.1009625Z 2025-05-07T19:45:58.1009629Z 2025-05-07T19:45:58.1009632Z 2025-05-07T19:45:58.1009636Z 2025-05-07T19:45:58.1009639Z 2025-05-07T19:45:58.1009642Z 2025-05-07T19:45:58.1009646Z 2025-05-07T19:45:58.1009671Z 2025-05-07T19:45:58.1094558Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:58.1094939Z 2025-05-07T19:45:58.1094944Z 2025-05-07T19:45:58.1094948Z 2025-05-07T19:45:58.1094952Z 2025-05-07T19:45:58.1094972Z 2025-05-07T19:45:58.1094976Z 2025-05-07T19:45:58.1094980Z 2025-05-07T19:45:58.1095009Z 2025-05-07T19:45:58.1095012Z 2025-05-07T19:45:58.1095098Z 2025-05-07T19:45:58.1095165Z 2025-05-07T19:45:58.1095168Z 2025-05-07T19:45:58.1095182Z 2025-05-07T19:45:58.1095226Z 2025-05-07T19:45:58.1095658Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:58.1096036Z 2025-05-07T19:45:58.1096040Z 2025-05-07T19:45:58.1096043Z 2025-05-07T19:45:58.1096047Z 2025-05-07T19:45:58.1096050Z 2025-05-07T19:45:58.1096054Z 2025-05-07T19:45:58.1096057Z 2025-05-07T19:45:58.1096060Z 2025-05-07T19:45:58.1096064Z 2025-05-07T19:45:58.1096067Z 2025-05-07T19:45:58.1096071Z 2025-05-07T19:45:58.1096074Z 2025-05-07T19:45:58.1096077Z 2025-05-07T19:45:58.1096081Z 2025-05-07T19:45:58.1143510Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:58.1143891Z 2025-05-07T19:45:58.1143896Z 2025-05-07T19:45:58.1144100Z 2025-05-07T19:45:58.1144104Z 2025-05-07T19:45:58.1144108Z 2025-05-07T19:45:58.1144111Z 2025-05-07T19:45:58.1144115Z 2025-05-07T19:45:58.1144118Z 2025-05-07T19:45:58.1144122Z 2025-05-07T19:45:58.1144125Z 2025-05-07T19:45:58.1144129Z 2025-05-07T19:45:58.1144132Z 2025-05-07T19:45:58.1144142Z 2025-05-07T19:45:58.1144169Z 2025-05-07T19:45:58.1144172Z 2025-05-07T19:45:58.1144176Z 2025-05-07T19:45:58.1144179Z 2025-05-07T19:45:58.1341523Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:58.1341900Z 2025-05-07T19:45:58.1341904Z 2025-05-07T19:45:58.1341908Z 2025-05-07T19:45:58.1341911Z 2025-05-07T19:45:58.1341915Z 2025-05-07T19:45:58.1341918Z 2025-05-07T19:45:58.1341922Z 2025-05-07T19:45:58.1341925Z 2025-05-07T19:45:58.1341929Z 2025-05-07T19:45:58.1341932Z 2025-05-07T19:45:58.1341962Z 2025-05-07T19:45:58.1341966Z 2025-05-07T19:45:58.1341969Z 2025-05-07T19:45:58.1341973Z 2025-05-07T19:45:58.1341976Z 2025-05-07T19:45:58.1341991Z 2025-05-07T19:45:58.1341995Z 2025-05-07T19:45:58.1341998Z 2025-05-07T19:45:58.1421670Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:58.1422081Z 2025-05-07T19:45:58.1422085Z 2025-05-07T19:45:58.1422104Z 2025-05-07T19:45:58.1422107Z 2025-05-07T19:45:58.1422110Z 2025-05-07T19:45:58.1422114Z 2025-05-07T19:45:58.1422117Z 2025-05-07T19:45:58.1422121Z 2025-05-07T19:45:58.1422124Z 2025-05-07T19:45:58.1422128Z 2025-05-07T19:45:58.1422131Z 2025-05-07T19:45:58.1422134Z 2025-05-07T19:45:58.1422138Z 2025-05-07T19:45:58.1422141Z 2025-05-07T19:45:58.1422145Z 2025-05-07T19:45:58.1422148Z 2025-05-07T19:45:58.1454148Z cuda-sanitizer-api-1 | 8.9 MB | #########2 | 93%  2025-05-07T19:45:58.1455288Z 2025-05-07T19:45:58.1455302Z 2025-05-07T19:45:58.1455313Z 2025-05-07T19:45:58.1455323Z 2025-05-07T19:45:58.1455334Z 2025-05-07T19:45:58.1455345Z 2025-05-07T19:45:58.1455383Z 2025-05-07T19:45:58.1455393Z 2025-05-07T19:45:58.1455403Z 2025-05-07T19:45:58.1455413Z 2025-05-07T19:45:58.1455423Z 2025-05-07T19:45:58.1455433Z 2025-05-07T19:45:58.1455443Z 2025-05-07T19:45:58.1455486Z 2025-05-07T19:45:58.1455496Z 2025-05-07T19:45:58.1455507Z 2025-05-07T19:45:58.1455866Z 2025-05-07T19:45:58.1455880Z 2025-05-07T19:45:58.1455890Z 2025-05-07T19:45:58.2146196Z ... (more hidden) ... 2025-05-07T19:45:58.2146538Z 2025-05-07T19:45:58.2146569Z 2025-05-07T19:45:58.2146575Z 2025-05-07T19:45:58.2146579Z 2025-05-07T19:45:58.2146582Z 2025-05-07T19:45:58.2146586Z 2025-05-07T19:45:58.2146592Z 2025-05-07T19:45:58.2146596Z 2025-05-07T19:45:58.2146600Z 2025-05-07T19:45:58.2146603Z 2025-05-07T19:45:58.2146606Z 2025-05-07T19:45:58.2146610Z 2025-05-07T19:45:58.2146613Z 2025-05-07T19:45:58.2146616Z 2025-05-07T19:45:58.2146620Z 2025-05-07T19:45:58.2146623Z 2025-05-07T19:45:58.2146627Z 2025-05-07T19:45:58.2266213Z cuda-nvvm-impl-12.6. | 7.7 MB | #######9 | 80%  2025-05-07T19:45:58.2266622Z 2025-05-07T19:45:58.2266626Z 2025-05-07T19:45:58.2266630Z 2025-05-07T19:45:58.2266634Z 2025-05-07T19:45:58.2266638Z 2025-05-07T19:45:58.2266642Z 2025-05-07T19:45:58.2266660Z 2025-05-07T19:45:58.2266663Z 2025-05-07T19:45:58.2266667Z 2025-05-07T19:45:58.2266670Z 2025-05-07T19:45:58.2266673Z 2025-05-07T19:45:58.2266677Z 2025-05-07T19:45:58.2266680Z 2025-05-07T19:45:58.2266684Z 2025-05-07T19:45:58.2266710Z 2025-05-07T19:45:58.2266713Z 2025-05-07T19:45:58.2266717Z 2025-05-07T19:45:58.2266720Z 2025-05-07T19:45:58.2266723Z 2025-05-07T19:45:58.2406187Z ... (more hidden) ... 2025-05-07T19:45:58.2406532Z 2025-05-07T19:45:58.2406537Z 2025-05-07T19:45:58.2406541Z 2025-05-07T19:45:58.2406570Z 2025-05-07T19:45:58.2406573Z 2025-05-07T19:45:58.2406577Z 2025-05-07T19:45:58.2406580Z 2025-05-07T19:45:58.2406804Z 2025-05-07T19:45:58.2406822Z 2025-05-07T19:45:58.2406825Z 2025-05-07T19:45:58.2406829Z 2025-05-07T19:45:58.2406832Z 2025-05-07T19:45:58.2406836Z 2025-05-07T19:45:58.2406839Z 2025-05-07T19:45:58.2406843Z 2025-05-07T19:45:58.2406846Z 2025-05-07T19:45:58.2406849Z 2025-05-07T19:45:58.2406853Z 2025-05-07T19:45:58.2407251Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:58.2407611Z 2025-05-07T19:45:58.2407615Z 2025-05-07T19:45:58.2407619Z 2025-05-07T19:45:58.2407622Z 2025-05-07T19:45:58.2407626Z 2025-05-07T19:45:58.2407629Z 2025-05-07T19:45:58.2407633Z 2025-05-07T19:45:58.2407636Z 2025-05-07T19:45:58.2407639Z 2025-05-07T19:45:58.2407643Z 2025-05-07T19:45:58.2407646Z 2025-05-07T19:45:58.2407650Z 2025-05-07T19:45:58.2407653Z 2025-05-07T19:45:58.2407656Z 2025-05-07T19:45:58.2407660Z 2025-05-07T19:45:58.2407663Z 2025-05-07T19:45:58.2407689Z 2025-05-07T19:45:58.2407693Z 2025-05-07T19:45:58.2685855Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:58.2686263Z 2025-05-07T19:45:58.2686268Z 2025-05-07T19:45:58.2686272Z 2025-05-07T19:45:58.2686275Z 2025-05-07T19:45:58.2686279Z 2025-05-07T19:45:58.2686307Z 2025-05-07T19:45:58.2686311Z 2025-05-07T19:45:58.2686322Z 2025-05-07T19:45:58.2686325Z 2025-05-07T19:45:58.2686329Z 2025-05-07T19:45:58.2686332Z 2025-05-07T19:45:58.2686336Z 2025-05-07T19:45:58.2686339Z 2025-05-07T19:45:58.2686342Z 2025-05-07T19:45:58.2686346Z 2025-05-07T19:45:58.2686349Z 2025-05-07T19:45:58.2881518Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:45:58.2882151Z 2025-05-07T19:45:58.2882156Z 2025-05-07T19:45:58.2882160Z 2025-05-07T19:45:58.2882163Z 2025-05-07T19:45:58.2882167Z 2025-05-07T19:45:58.2882170Z 2025-05-07T19:45:58.2882174Z 2025-05-07T19:45:58.2882177Z 2025-05-07T19:45:58.2882181Z 2025-05-07T19:45:58.2882184Z 2025-05-07T19:45:58.2882201Z 2025-05-07T19:45:58.2882204Z 2025-05-07T19:45:58.2882207Z 2025-05-07T19:45:58.2882211Z 2025-05-07T19:45:58.2882214Z 2025-05-07T19:45:58.2882217Z 2025-05-07T19:45:58.2882221Z 2025-05-07T19:45:58.3631953Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:45:58.3632357Z 2025-05-07T19:45:58.3632362Z 2025-05-07T19:45:58.3632365Z 2025-05-07T19:45:58.3632369Z 2025-05-07T19:45:58.3632372Z 2025-05-07T19:45:58.3632376Z 2025-05-07T19:45:58.4862472Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:58.4862823Z 2025-05-07T19:45:58.4862828Z 2025-05-07T19:45:58.4862831Z 2025-05-07T19:45:58.4862835Z 2025-05-07T19:45:58.4862838Z 2025-05-07T19:45:58.6322454Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:58.6322785Z 2025-05-07T19:45:58.6322791Z 2025-05-07T19:45:58.6322795Z 2025-05-07T19:45:58.6322799Z 2025-05-07T19:45:58.6322825Z 2025-05-07T19:45:58.6322845Z 2025-05-07T19:45:58.6322848Z 2025-05-07T19:45:58.6322852Z 2025-05-07T19:45:58.8289865Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:58.8290222Z 2025-05-07T19:45:58.8290228Z 2025-05-07T19:45:58.8290231Z 2025-05-07T19:45:58.8290235Z 2025-05-07T19:45:58.8290282Z 2025-05-07T19:45:58.8290287Z 2025-05-07T19:45:58.8290291Z 2025-05-07T19:45:58.8290294Z 2025-05-07T19:45:58.8290298Z 2025-05-07T19:45:59.0558717Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:59.0559066Z 2025-05-07T19:45:59.0559111Z 2025-05-07T19:45:59.0559142Z 2025-05-07T19:45:59.0559273Z 2025-05-07T19:45:59.0559284Z 2025-05-07T19:45:59.0559291Z 2025-05-07T19:45:59.0559296Z 2025-05-07T19:45:59.0559301Z 2025-05-07T19:45:59.0559307Z 2025-05-07T19:45:59.0559315Z 2025-05-07T19:45:59.0559322Z 2025-05-07T19:45:59.0559327Z 2025-05-07T19:45:59.0559380Z 2025-05-07T19:45:59.3410701Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:59.3411458Z 2025-05-07T19:45:59.3411464Z 2025-05-07T19:45:59.3411468Z 2025-05-07T19:45:59.3411471Z 2025-05-07T19:45:59.3411475Z 2025-05-07T19:45:59.3411478Z 2025-05-07T19:45:59.3411481Z 2025-05-07T19:45:59.3411507Z 2025-05-07T19:45:59.3411531Z 2025-05-07T19:45:59.3411536Z 2025-05-07T19:45:59.3411539Z 2025-05-07T19:45:59.3411543Z 2025-05-07T19:45:59.5727175Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:59.5727559Z 2025-05-07T19:45:59.5727565Z 2025-05-07T19:45:59.5727592Z 2025-05-07T19:45:59.5727595Z 2025-05-07T19:45:59.5727599Z 2025-05-07T19:45:59.5727602Z 2025-05-07T19:45:59.5727606Z 2025-05-07T19:45:59.5727609Z 2025-05-07T19:45:59.5727613Z 2025-05-07T19:45:59.5727616Z 2025-05-07T19:45:59.7358459Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:59.7358893Z 2025-05-07T19:45:59.7358900Z 2025-05-07T19:45:59.7359078Z 2025-05-07T19:45:59.7359082Z 2025-05-07T19:45:59.7359085Z 2025-05-07T19:45:59.7359088Z 2025-05-07T19:45:59.7359093Z 2025-05-07T19:45:59.9058624Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:45:59.9058999Z 2025-05-07T19:45:59.9059029Z 2025-05-07T19:45:59.9059070Z 2025-05-07T19:45:59.9059074Z 2025-05-07T19:45:59.9059077Z 2025-05-07T19:45:59.9059082Z 2025-05-07T19:45:59.9059087Z 2025-05-07T19:45:59.9059091Z 2025-05-07T19:45:59.9059096Z 2025-05-07T19:45:59.9059101Z 2025-05-07T19:45:59.9059104Z 2025-05-07T19:45:59.9059109Z 2025-05-07T19:45:59.9059112Z 2025-05-07T19:45:59.9059116Z 2025-05-07T19:45:59.9059119Z 2025-05-07T19:45:59.9580761Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:59.9624140Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:45:59.9624470Z 2025-05-07T19:45:59.9624475Z 2025-05-07T19:45:59.9624479Z 2025-05-07T19:45:59.9624505Z 2025-05-07T19:45:59.9624539Z 2025-05-07T19:45:59.9624543Z 2025-05-07T19:45:59.9624546Z 2025-05-07T19:45:59.9624550Z 2025-05-07T19:45:59.9624555Z 2025-05-07T19:45:59.9624558Z 2025-05-07T19:45:59.9624562Z 2025-05-07T19:46:00.0162422Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:00.0162848Z 2025-05-07T19:46:00.0162854Z 2025-05-07T19:46:00.0162857Z 2025-05-07T19:46:00.0162861Z 2025-05-07T19:46:00.0162865Z 2025-05-07T19:46:00.0162870Z 2025-05-07T19:46:00.0162875Z 2025-05-07T19:46:00.0162878Z 2025-05-07T19:46:00.0162883Z 2025-05-07T19:46:00.0162886Z 2025-05-07T19:46:00.0162890Z 2025-05-07T19:46:00.0162894Z 2025-05-07T19:46:00.0162898Z 2025-05-07T19:46:00.0162901Z 2025-05-07T19:46:00.0162905Z 2025-05-07T19:46:00.0162908Z 2025-05-07T19:46:00.0162911Z 2025-05-07T19:46:00.0162915Z 2025-05-07T19:46:00.0162918Z 2025-05-07T19:46:00.0163219Z ... (more hidden) ... 2025-05-07T19:46:00.0163530Z 2025-05-07T19:46:00.0163549Z 2025-05-07T19:46:00.0163553Z 2025-05-07T19:46:00.0163556Z 2025-05-07T19:46:00.0163560Z 2025-05-07T19:46:00.0163564Z 2025-05-07T19:46:00.0163567Z 2025-05-07T19:46:00.0163570Z 2025-05-07T19:46:00.0163574Z 2025-05-07T19:46:00.0163577Z 2025-05-07T19:46:00.0163581Z 2025-05-07T19:46:00.0163588Z 2025-05-07T19:46:00.0163617Z 2025-05-07T19:46:00.0163620Z 2025-05-07T19:46:00.0163623Z 2025-05-07T19:46:00.0163627Z 2025-05-07T19:46:00.0163630Z 2025-05-07T19:46:00.0163633Z 2025-05-07T19:46:00.0163637Z 2025-05-07T19:46:00.1266469Z ... (more hidden) ... 2025-05-07T19:46:00.1266817Z 2025-05-07T19:46:00.1266839Z 2025-05-07T19:46:00.1266846Z 2025-05-07T19:46:00.1266853Z 2025-05-07T19:46:00.1266859Z 2025-05-07T19:46:00.1266865Z 2025-05-07T19:46:00.1266871Z 2025-05-07T19:46:00.1266877Z 2025-05-07T19:46:00.1266882Z 2025-05-07T19:46:00.1266888Z 2025-05-07T19:46:00.1266895Z 2025-05-07T19:46:00.1266900Z 2025-05-07T19:46:00.1267143Z 2025-05-07T19:46:00.1267148Z 2025-05-07T19:46:00.1267153Z 2025-05-07T19:46:00.1267160Z 2025-05-07T19:46:00.1267164Z 2025-05-07T19:46:00.1267170Z 2025-05-07T19:46:00.1439535Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:00.1439961Z 2025-05-07T19:46:00.1439966Z 2025-05-07T19:46:00.1439969Z 2025-05-07T19:46:00.1439973Z 2025-05-07T19:46:00.1439976Z 2025-05-07T19:46:00.1439980Z 2025-05-07T19:46:00.1439983Z 2025-05-07T19:46:00.1439986Z 2025-05-07T19:46:00.1439990Z 2025-05-07T19:46:00.1439994Z 2025-05-07T19:46:00.1439998Z 2025-05-07T19:46:00.1440001Z 2025-05-07T19:46:00.1440004Z 2025-05-07T19:46:00.1440008Z 2025-05-07T19:46:00.2696835Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:00.2697233Z 2025-05-07T19:46:00.2697239Z 2025-05-07T19:46:00.2697243Z 2025-05-07T19:46:00.2697247Z 2025-05-07T19:46:00.2697251Z 2025-05-07T19:46:00.2697255Z 2025-05-07T19:46:00.2697294Z 2025-05-07T19:46:00.2697298Z 2025-05-07T19:46:00.2697316Z 2025-05-07T19:46:00.2697319Z 2025-05-07T19:46:00.2697323Z 2025-05-07T19:46:00.2697329Z 2025-05-07T19:46:00.2697333Z 2025-05-07T19:46:00.2697337Z 2025-05-07T19:46:00.2697341Z 2025-05-07T19:46:00.2697344Z 2025-05-07T19:46:00.2697370Z 2025-05-07T19:46:00.2969788Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:00.2970206Z 2025-05-07T19:46:00.2970212Z 2025-05-07T19:46:00.2970217Z 2025-05-07T19:46:00.2970221Z 2025-05-07T19:46:00.2970226Z 2025-05-07T19:46:00.2970232Z 2025-05-07T19:46:00.2970236Z 2025-05-07T19:46:00.2970240Z 2025-05-07T19:46:00.2970245Z 2025-05-07T19:46:00.2970250Z 2025-05-07T19:46:00.2970254Z 2025-05-07T19:46:00.2970257Z 2025-05-07T19:46:00.2970261Z 2025-05-07T19:46:00.2970264Z 2025-05-07T19:46:00.2970268Z 2025-05-07T19:46:00.2970271Z 2025-05-07T19:46:00.8426999Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:00.8427472Z 2025-05-07T19:46:04.5467711Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:04.5472163Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:04.5473014Z 2025-05-07T19:46:04.5473520Z 2025-05-07T19:46:04.5473538Z 2025-05-07T19:46:04.5473550Z 2025-05-07T19:46:04.5473560Z 2025-05-07T19:46:04.5473572Z 2025-05-07T19:46:04.5473583Z 2025-05-07T19:46:04.5473594Z 2025-05-07T19:46:04.5473605Z 2025-05-07T19:46:04.5473616Z 2025-05-07T19:46:04.5473626Z 2025-05-07T19:46:04.5473636Z 2025-05-07T19:46:04.5473646Z 2025-05-07T19:46:04.5473656Z 2025-05-07T19:46:04.5473667Z 2025-05-07T19:46:04.5473679Z 2025-05-07T19:46:04.5473689Z 2025-05-07T19:46:04.5473699Z 2025-05-07T19:46:04.5473743Z 2025-05-07T19:46:04.5473989Z 2025-05-07T19:46:04.5474977Z  2025-05-07T19:46:04.5475978Z 2025-05-07T19:46:04.5476624Z 2025-05-07T19:46:04.5477134Z  2025-05-07T19:46:04.5477759Z 2025-05-07T19:46:04.5477771Z 2025-05-07T19:46:04.5478249Z  2025-05-07T19:46:04.5478900Z 2025-05-07T19:46:04.5478945Z 2025-05-07T19:46:04.5478956Z 2025-05-07T19:46:04.5479580Z  2025-05-07T19:46:04.5480365Z 2025-05-07T19:46:04.5480370Z 2025-05-07T19:46:04.5480373Z 2025-05-07T19:46:04.5480377Z 2025-05-07T19:46:04.5480561Z  2025-05-07T19:46:04.5480793Z 2025-05-07T19:46:04.5480829Z 2025-05-07T19:46:04.5480833Z 2025-05-07T19:46:04.5480836Z 2025-05-07T19:46:04.5480840Z 2025-05-07T19:46:04.5481029Z  2025-05-07T19:46:04.5481267Z 2025-05-07T19:46:04.5481270Z 2025-05-07T19:46:04.5481274Z 2025-05-07T19:46:04.5481436Z 2025-05-07T19:46:04.5481440Z 2025-05-07T19:46:04.5481444Z 2025-05-07T19:46:04.5481668Z  2025-05-07T19:46:04.5482137Z 2025-05-07T19:46:04.5482141Z 2025-05-07T19:46:04.5482144Z 2025-05-07T19:46:04.5482148Z 2025-05-07T19:46:04.5482158Z 2025-05-07T19:46:04.5482162Z 2025-05-07T19:46:04.5482166Z 2025-05-07T19:46:04.5482398Z  2025-05-07T19:46:04.5482640Z 2025-05-07T19:46:04.5482643Z 2025-05-07T19:46:04.5482647Z 2025-05-07T19:46:04.5482650Z 2025-05-07T19:46:04.5482654Z 2025-05-07T19:46:04.5482657Z 2025-05-07T19:46:04.5482661Z 2025-05-07T19:46:04.5482664Z 2025-05-07T19:46:04.5482890Z  2025-05-07T19:46:04.5483130Z 2025-05-07T19:46:04.5483134Z 2025-05-07T19:46:04.5483138Z 2025-05-07T19:46:04.5483141Z 2025-05-07T19:46:04.5483144Z 2025-05-07T19:46:04.5483153Z 2025-05-07T19:46:04.5483156Z 2025-05-07T19:46:04.5483161Z 2025-05-07T19:46:04.5483164Z 2025-05-07T19:46:04.5483391Z  2025-05-07T19:46:04.5483666Z 2025-05-07T19:46:04.5483670Z 2025-05-07T19:46:04.5483674Z 2025-05-07T19:46:04.5483683Z 2025-05-07T19:46:04.5483686Z 2025-05-07T19:46:04.5483690Z 2025-05-07T19:46:04.5483693Z 2025-05-07T19:46:04.5483697Z 2025-05-07T19:46:04.5483700Z 2025-05-07T19:46:04.5483704Z 2025-05-07T19:46:04.5483911Z  2025-05-07T19:46:04.5484185Z 2025-05-07T19:46:04.5484189Z 2025-05-07T19:46:04.5484192Z 2025-05-07T19:46:04.5484196Z 2025-05-07T19:46:04.5484199Z 2025-05-07T19:46:04.5484202Z 2025-05-07T19:46:04.5484205Z 2025-05-07T19:46:04.5484209Z 2025-05-07T19:46:04.5484212Z 2025-05-07T19:46:04.5484217Z 2025-05-07T19:46:04.5484221Z 2025-05-07T19:46:04.5484454Z  2025-05-07T19:46:04.5484712Z 2025-05-07T19:46:04.5484717Z 2025-05-07T19:46:04.5484720Z 2025-05-07T19:46:04.5484723Z 2025-05-07T19:46:04.5484727Z 2025-05-07T19:46:04.5484730Z 2025-05-07T19:46:04.5484734Z 2025-05-07T19:46:04.5484737Z 2025-05-07T19:46:04.5484741Z 2025-05-07T19:46:04.5484861Z 2025-05-07T19:46:04.5484865Z 2025-05-07T19:46:04.5484868Z 2025-05-07T19:46:04.5485116Z  2025-05-07T19:46:04.5485368Z 2025-05-07T19:46:04.5485371Z 2025-05-07T19:46:04.5485375Z 2025-05-07T19:46:04.5485378Z 2025-05-07T19:46:04.5485381Z 2025-05-07T19:46:04.5485385Z 2025-05-07T19:46:04.5485389Z 2025-05-07T19:46:04.5485392Z 2025-05-07T19:46:04.5485395Z 2025-05-07T19:46:04.5485399Z 2025-05-07T19:46:04.5485402Z 2025-05-07T19:46:04.5485406Z 2025-05-07T19:46:04.5485409Z 2025-05-07T19:46:04.5485649Z  2025-05-07T19:46:04.5485906Z 2025-05-07T19:46:04.5485910Z 2025-05-07T19:46:04.5485914Z 2025-05-07T19:46:04.5485917Z 2025-05-07T19:46:04.5485921Z 2025-05-07T19:46:04.5485924Z 2025-05-07T19:46:04.5485928Z 2025-05-07T19:46:04.5485932Z 2025-05-07T19:46:04.5485935Z 2025-05-07T19:46:04.5485939Z 2025-05-07T19:46:04.5485971Z 2025-05-07T19:46:04.5485974Z 2025-05-07T19:46:04.5485978Z 2025-05-07T19:46:04.5485981Z 2025-05-07T19:46:04.5486208Z  2025-05-07T19:46:04.5486466Z 2025-05-07T19:46:04.5486469Z 2025-05-07T19:46:04.5486473Z 2025-05-07T19:46:04.5486477Z 2025-05-07T19:46:04.5486480Z 2025-05-07T19:46:04.5486483Z 2025-05-07T19:46:04.5486515Z 2025-05-07T19:46:04.5486518Z 2025-05-07T19:46:04.5486521Z 2025-05-07T19:46:04.5486525Z 2025-05-07T19:46:04.5486528Z 2025-05-07T19:46:04.5486533Z 2025-05-07T19:46:04.5486537Z 2025-05-07T19:46:04.5486540Z 2025-05-07T19:46:04.5486544Z 2025-05-07T19:46:04.5486866Z  2025-05-07T19:46:04.5487154Z 2025-05-07T19:46:04.5487157Z 2025-05-07T19:46:04.5487161Z 2025-05-07T19:46:04.5487164Z 2025-05-07T19:46:04.5487168Z 2025-05-07T19:46:04.5487172Z 2025-05-07T19:46:04.5487179Z 2025-05-07T19:46:04.5487183Z 2025-05-07T19:46:04.5487187Z 2025-05-07T19:46:04.5487190Z 2025-05-07T19:46:04.5487193Z 2025-05-07T19:46:04.5487197Z 2025-05-07T19:46:04.5487200Z 2025-05-07T19:46:04.5487203Z 2025-05-07T19:46:04.5487207Z 2025-05-07T19:46:04.5487210Z 2025-05-07T19:46:04.5487446Z  2025-05-07T19:46:04.5487734Z 2025-05-07T19:46:04.5487738Z 2025-05-07T19:46:04.5487742Z 2025-05-07T19:46:04.5487745Z 2025-05-07T19:46:04.5487748Z 2025-05-07T19:46:04.5487752Z 2025-05-07T19:46:04.5487755Z 2025-05-07T19:46:04.5487758Z 2025-05-07T19:46:04.5487762Z 2025-05-07T19:46:04.5487766Z 2025-05-07T19:46:04.5487773Z 2025-05-07T19:46:04.5487776Z 2025-05-07T19:46:04.5487780Z 2025-05-07T19:46:04.5487783Z 2025-05-07T19:46:04.5487786Z 2025-05-07T19:46:04.5487790Z 2025-05-07T19:46:04.5487793Z 2025-05-07T19:46:04.5488076Z  2025-05-07T19:46:04.5488342Z 2025-05-07T19:46:04.5488346Z 2025-05-07T19:46:04.5488349Z 2025-05-07T19:46:04.5488353Z 2025-05-07T19:46:04.5488356Z 2025-05-07T19:46:04.5488360Z 2025-05-07T19:46:04.5488363Z 2025-05-07T19:46:04.5488367Z 2025-05-07T19:46:04.5488370Z 2025-05-07T19:46:04.5488373Z 2025-05-07T19:46:04.5488404Z 2025-05-07T19:46:04.5488408Z 2025-05-07T19:46:04.5488411Z 2025-05-07T19:46:04.5488415Z 2025-05-07T19:46:04.5488419Z 2025-05-07T19:46:04.5488422Z 2025-05-07T19:46:04.5488425Z 2025-05-07T19:46:04.5488429Z 2025-05-07T19:46:04.5488675Z  2025-05-07T19:46:04.5488951Z 2025-05-07T19:46:04.5488955Z 2025-05-07T19:46:04.5489209Z  2025-05-07T19:46:04.5489333Z 2025-05-07T19:46:04.5489336Z 2025-05-07T19:46:04.5489448Z  2025-05-07T19:46:04.5489596Z 2025-05-07T19:46:04.5489600Z 2025-05-07T19:46:04.5489604Z 2025-05-07T19:46:04.5489720Z  2025-05-07T19:46:04.5489913Z 2025-05-07T19:46:04.5489918Z 2025-05-07T19:46:04.5489921Z 2025-05-07T19:46:04.5489954Z 2025-05-07T19:46:04.5490076Z  2025-05-07T19:46:04.5490208Z 2025-05-07T19:46:04.5490212Z 2025-05-07T19:46:04.5490215Z 2025-05-07T19:46:04.5490219Z 2025-05-07T19:46:04.5490222Z 2025-05-07T19:46:04.5490375Z  2025-05-07T19:46:04.5490515Z 2025-05-07T19:46:04.5490520Z 2025-05-07T19:46:04.5490523Z 2025-05-07T19:46:04.5490527Z 2025-05-07T19:46:04.5490530Z 2025-05-07T19:46:04.5490534Z 2025-05-07T19:46:04.5490658Z  2025-05-07T19:46:04.5490832Z 2025-05-07T19:46:04.5490836Z 2025-05-07T19:46:04.5490839Z 2025-05-07T19:46:04.5490842Z 2025-05-07T19:46:04.5490850Z 2025-05-07T19:46:04.5490853Z 2025-05-07T19:46:04.5490856Z 2025-05-07T19:46:04.5490983Z  2025-05-07T19:46:04.5491138Z 2025-05-07T19:46:04.5491142Z 2025-05-07T19:46:04.5491171Z 2025-05-07T19:46:04.5491175Z 2025-05-07T19:46:04.5491178Z 2025-05-07T19:46:04.5491181Z 2025-05-07T19:46:04.5491189Z 2025-05-07T19:46:04.5491192Z 2025-05-07T19:46:04.5491322Z  2025-05-07T19:46:04.5491487Z 2025-05-07T19:46:04.5491491Z 2025-05-07T19:46:04.5491494Z 2025-05-07T19:46:04.5491497Z 2025-05-07T19:46:04.5491501Z 2025-05-07T19:46:04.5491530Z 2025-05-07T19:46:04.5491533Z 2025-05-07T19:46:04.5491537Z 2025-05-07T19:46:04.5491540Z 2025-05-07T19:46:04.5491670Z  2025-05-07T19:46:04.5491849Z 2025-05-07T19:46:04.5491853Z 2025-05-07T19:46:04.5491856Z 2025-05-07T19:46:04.5491860Z 2025-05-07T19:46:04.5491863Z 2025-05-07T19:46:04.5491867Z 2025-05-07T19:46:04.5491870Z 2025-05-07T19:46:04.5491900Z 2025-05-07T19:46:04.5491904Z 2025-05-07T19:46:04.5491965Z 2025-05-07T19:46:04.5492106Z  2025-05-07T19:46:04.5492288Z 2025-05-07T19:46:04.5492292Z 2025-05-07T19:46:04.5492295Z 2025-05-07T19:46:04.5492299Z 2025-05-07T19:46:04.5492302Z 2025-05-07T19:46:04.5492305Z 2025-05-07T19:46:04.5492309Z 2025-05-07T19:46:04.5492316Z 2025-05-07T19:46:04.5492347Z 2025-05-07T19:46:04.5492350Z 2025-05-07T19:46:04.5492353Z 2025-05-07T19:46:04.5492495Z  2025-05-07T19:46:04.5492689Z 2025-05-07T19:46:04.5492692Z 2025-05-07T19:46:04.5492696Z 2025-05-07T19:46:04.5492699Z 2025-05-07T19:46:04.5492702Z 2025-05-07T19:46:04.5492706Z 2025-05-07T19:46:04.5492709Z 2025-05-07T19:46:04.5492749Z 2025-05-07T19:46:04.5492753Z 2025-05-07T19:46:04.5492756Z 2025-05-07T19:46:04.5492759Z 2025-05-07T19:46:04.5492763Z 2025-05-07T19:46:04.5492902Z  2025-05-07T19:46:04.5493103Z 2025-05-07T19:46:04.5493107Z 2025-05-07T19:46:04.5493110Z 2025-05-07T19:46:04.5493113Z 2025-05-07T19:46:04.5493121Z 2025-05-07T19:46:04.5493124Z 2025-05-07T19:46:04.5493153Z 2025-05-07T19:46:04.5493157Z 2025-05-07T19:46:04.5493160Z 2025-05-07T19:46:04.5493163Z 2025-05-07T19:46:04.5493167Z 2025-05-07T19:46:04.5493170Z 2025-05-07T19:46:04.5493174Z 2025-05-07T19:46:04.5493327Z  2025-05-07T19:46:04.5493536Z 2025-05-07T19:46:04.5493540Z 2025-05-07T19:46:04.5493544Z 2025-05-07T19:46:04.5493547Z 2025-05-07T19:46:04.5493575Z 2025-05-07T19:46:04.5493578Z 2025-05-07T19:46:04.5493582Z 2025-05-07T19:46:04.5493585Z 2025-05-07T19:46:04.5493588Z 2025-05-07T19:46:04.5493592Z 2025-05-07T19:46:04.5493595Z 2025-05-07T19:46:04.5493598Z 2025-05-07T19:46:04.5493602Z 2025-05-07T19:46:04.5493605Z 2025-05-07T19:46:04.5493760Z  2025-05-07T19:46:04.5493998Z 2025-05-07T19:46:04.5494002Z 2025-05-07T19:46:04.5494005Z 2025-05-07T19:46:04.5494009Z 2025-05-07T19:46:04.5494012Z 2025-05-07T19:46:04.5494016Z 2025-05-07T19:46:04.5494022Z 2025-05-07T19:46:04.5494026Z 2025-05-07T19:46:04.5494029Z 2025-05-07T19:46:04.5494032Z 2025-05-07T19:46:04.5494036Z 2025-05-07T19:46:04.5494040Z 2025-05-07T19:46:04.5494043Z 2025-05-07T19:46:04.5494046Z 2025-05-07T19:46:04.5494050Z 2025-05-07T19:46:04.5494292Z  2025-05-07T19:46:04.5494544Z 2025-05-07T19:46:04.5494548Z 2025-05-07T19:46:04.5494552Z 2025-05-07T19:46:04.5494555Z 2025-05-07T19:46:04.5494559Z 2025-05-07T19:46:04.5494562Z 2025-05-07T19:46:04.5494566Z 2025-05-07T19:46:04.5494569Z 2025-05-07T19:46:04.5494573Z 2025-05-07T19:46:04.5494576Z 2025-05-07T19:46:04.5494579Z 2025-05-07T19:46:04.5494582Z 2025-05-07T19:46:04.5494586Z 2025-05-07T19:46:04.5494589Z 2025-05-07T19:46:04.5494593Z 2025-05-07T19:46:04.5494596Z 2025-05-07T19:46:04.5494794Z  2025-05-07T19:46:04.5495021Z 2025-05-07T19:46:04.5495025Z 2025-05-07T19:46:04.5495028Z 2025-05-07T19:46:04.5495032Z 2025-05-07T19:46:04.5495039Z 2025-05-07T19:46:04.5495042Z 2025-05-07T19:46:04.5495046Z 2025-05-07T19:46:04.5495049Z 2025-05-07T19:46:04.5495052Z 2025-05-07T19:46:04.5495056Z 2025-05-07T19:46:04.5495059Z 2025-05-07T19:46:04.5495062Z 2025-05-07T19:46:04.5495066Z 2025-05-07T19:46:04.5495070Z 2025-05-07T19:46:04.5495105Z 2025-05-07T19:46:04.5495109Z 2025-05-07T19:46:04.5495112Z 2025-05-07T19:46:04.5495280Z  2025-05-07T19:46:04.5495514Z 2025-05-07T19:46:04.5495517Z 2025-05-07T19:46:04.5495521Z 2025-05-07T19:46:04.5495524Z 2025-05-07T19:46:04.5495527Z 2025-05-07T19:46:04.5495531Z 2025-05-07T19:46:04.5495535Z 2025-05-07T19:46:04.5495539Z 2025-05-07T19:46:04.5495566Z 2025-05-07T19:46:04.5495569Z 2025-05-07T19:46:04.5495573Z 2025-05-07T19:46:04.5495576Z 2025-05-07T19:46:04.5495579Z 2025-05-07T19:46:04.5495583Z 2025-05-07T19:46:04.5495587Z 2025-05-07T19:46:04.5495590Z 2025-05-07T19:46:04.5495593Z 2025-05-07T19:46:04.5495597Z 2025-05-07T19:46:04.5495839Z  2025-05-07T19:46:04.5496112Z 2025-05-07T19:46:04.5496115Z 2025-05-07T19:46:04.5496223Z  2025-05-07T19:46:04.5496341Z 2025-05-07T19:46:04.5496345Z 2025-05-07T19:46:04.5496456Z  2025-05-07T19:46:04.5496610Z 2025-05-07T19:46:04.5496614Z 2025-05-07T19:46:04.5496621Z 2025-05-07T19:46:04.5496737Z  2025-05-07T19:46:04.5496865Z 2025-05-07T19:46:04.5496868Z 2025-05-07T19:46:04.5496872Z 2025-05-07T19:46:04.5496904Z 2025-05-07T19:46:04.5497020Z  2025-05-07T19:46:04.5497149Z 2025-05-07T19:46:04.5497153Z 2025-05-07T19:46:04.5497156Z 2025-05-07T19:46:04.5497159Z 2025-05-07T19:46:04.5497163Z 2025-05-07T19:46:04.5497311Z  2025-05-07T19:46:04.5497446Z 2025-05-07T19:46:04.5497450Z 2025-05-07T19:46:04.5497453Z 2025-05-07T19:46:04.5497457Z 2025-05-07T19:46:04.5497460Z 2025-05-07T19:46:04.5497463Z 2025-05-07T19:46:04.5497587Z  2025-05-07T19:46:04.5497755Z 2025-05-07T19:46:04.5497759Z 2025-05-07T19:46:04.5497766Z 2025-05-07T19:46:04.5497770Z 2025-05-07T19:46:04.5497773Z 2025-05-07T19:46:04.5497777Z 2025-05-07T19:46:04.5497780Z 2025-05-07T19:46:04.5497905Z  2025-05-07T19:46:04.5498066Z 2025-05-07T19:46:04.5498070Z 2025-05-07T19:46:04.5498096Z 2025-05-07T19:46:04.5498103Z 2025-05-07T19:46:04.5498106Z 2025-05-07T19:46:04.5498110Z 2025-05-07T19:46:04.5498113Z 2025-05-07T19:46:04.5498117Z 2025-05-07T19:46:04.5498247Z  2025-05-07T19:46:04.5498412Z 2025-05-07T19:46:04.5498416Z 2025-05-07T19:46:04.5498419Z 2025-05-07T19:46:04.5498423Z 2025-05-07T19:46:04.5498455Z 2025-05-07T19:46:04.5498458Z 2025-05-07T19:46:04.5498462Z 2025-05-07T19:46:04.5498465Z 2025-05-07T19:46:04.5498468Z 2025-05-07T19:46:04.5498597Z  2025-05-07T19:46:04.5498768Z 2025-05-07T19:46:04.5498771Z 2025-05-07T19:46:04.5498775Z 2025-05-07T19:46:04.5498778Z 2025-05-07T19:46:04.5498782Z 2025-05-07T19:46:04.5498785Z 2025-05-07T19:46:04.5498815Z 2025-05-07T19:46:04.5498819Z 2025-05-07T19:46:04.5498822Z 2025-05-07T19:46:04.5498825Z 2025-05-07T19:46:04.5498959Z  2025-05-07T19:46:04.5499137Z 2025-05-07T19:46:04.5499141Z 2025-05-07T19:46:04.5499145Z 2025-05-07T19:46:04.5499148Z 2025-05-07T19:46:04.5499152Z 2025-05-07T19:46:04.5500784Z 2025-05-07T19:46:04.5500794Z 2025-05-07T19:46:04.5500830Z 2025-05-07T19:46:04.5500833Z 2025-05-07T19:46:04.5500836Z 2025-05-07T19:46:04.5500840Z 2025-05-07T19:46:04.5501019Z  2025-05-07T19:46:04.5501223Z 2025-05-07T19:46:04.5501227Z 2025-05-07T19:46:04.5501232Z 2025-05-07T19:46:04.5501236Z 2025-05-07T19:46:04.5501239Z 2025-05-07T19:46:04.5501243Z 2025-05-07T19:46:04.5501246Z 2025-05-07T19:46:04.5501274Z 2025-05-07T19:46:04.5501277Z 2025-05-07T19:46:04.5501281Z 2025-05-07T19:46:04.5501286Z 2025-05-07T19:46:04.5501289Z 2025-05-07T19:46:04.5501432Z  2025-05-07T19:46:04.5501632Z 2025-05-07T19:46:04.5501641Z 2025-05-07T19:46:04.5501645Z 2025-05-07T19:46:04.5501648Z 2025-05-07T19:46:04.5501652Z 2025-05-07T19:46:04.5501657Z 2025-05-07T19:46:04.5501687Z 2025-05-07T19:46:04.5501690Z 2025-05-07T19:46:04.5501693Z 2025-05-07T19:46:04.5501697Z 2025-05-07T19:46:04.5501700Z 2025-05-07T19:46:04.5501703Z 2025-05-07T19:46:04.5501711Z 2025-05-07T19:46:04.5501859Z  2025-05-07T19:46:04.5502066Z 2025-05-07T19:46:04.5502069Z 2025-05-07T19:46:04.5502073Z 2025-05-07T19:46:04.5502101Z 2025-05-07T19:46:04.5502104Z 2025-05-07T19:46:04.5502107Z 2025-05-07T19:46:04.5502111Z 2025-05-07T19:46:04.5502114Z 2025-05-07T19:46:04.5502118Z 2025-05-07T19:46:04.5502122Z 2025-05-07T19:46:04.5502126Z 2025-05-07T19:46:04.5502129Z 2025-05-07T19:46:04.5502133Z 2025-05-07T19:46:04.5502136Z 2025-05-07T19:46:04.5502286Z  2025-05-07T19:46:04.5502527Z 2025-05-07T19:46:04.5502530Z 2025-05-07T19:46:04.5502534Z 2025-05-07T19:46:04.5502537Z 2025-05-07T19:46:04.5502609Z 2025-05-07T19:46:04.5502612Z 2025-05-07T19:46:04.5502616Z 2025-05-07T19:46:04.5502619Z 2025-05-07T19:46:04.5502623Z 2025-05-07T19:46:04.5502626Z 2025-05-07T19:46:04.5502629Z 2025-05-07T19:46:04.5502633Z 2025-05-07T19:46:04.5502636Z 2025-05-07T19:46:04.5502639Z 2025-05-07T19:46:04.5502646Z 2025-05-07T19:46:04.5502927Z  2025-05-07T19:46:04.5503180Z 2025-05-07T19:46:04.5503183Z 2025-05-07T19:46:04.5503187Z 2025-05-07T19:46:04.5503191Z 2025-05-07T19:46:04.5503194Z 2025-05-07T19:46:04.5503198Z 2025-05-07T19:46:04.5503201Z 2025-05-07T19:46:04.5503204Z 2025-05-07T19:46:04.5503208Z 2025-05-07T19:46:04.5503211Z 2025-05-07T19:46:04.5503215Z 2025-05-07T19:46:04.5503219Z 2025-05-07T19:46:04.5503222Z 2025-05-07T19:46:04.5503226Z 2025-05-07T19:46:04.5503229Z 2025-05-07T19:46:04.5503233Z 2025-05-07T19:46:04.5503419Z  2025-05-07T19:46:04.5503644Z 2025-05-07T19:46:04.5503647Z 2025-05-07T19:46:04.5503654Z 2025-05-07T19:46:04.5503658Z 2025-05-07T19:46:04.5503661Z 2025-05-07T19:46:04.5503665Z 2025-05-07T19:46:04.5503668Z 2025-05-07T19:46:04.5503672Z 2025-05-07T19:46:04.5503676Z 2025-05-07T19:46:04.5503679Z 2025-05-07T19:46:04.5503683Z 2025-05-07T19:46:04.5503686Z 2025-05-07T19:46:04.5503692Z 2025-05-07T19:46:04.5503719Z 2025-05-07T19:46:04.5503722Z 2025-05-07T19:46:04.5503726Z 2025-05-07T19:46:04.5503729Z 2025-05-07T19:46:04.5503897Z  2025-05-07T19:46:04.5504127Z 2025-05-07T19:46:04.5504131Z 2025-05-07T19:46:04.5504134Z 2025-05-07T19:46:04.5504138Z 2025-05-07T19:46:04.5504141Z 2025-05-07T19:46:04.5504145Z 2025-05-07T19:46:04.5504173Z 2025-05-07T19:46:04.5504176Z 2025-05-07T19:46:04.5504180Z 2025-05-07T19:46:04.5504183Z 2025-05-07T19:46:04.5504187Z 2025-05-07T19:46:04.5504190Z 2025-05-07T19:46:04.5504194Z 2025-05-07T19:46:04.5504197Z 2025-05-07T19:46:04.5504201Z 2025-05-07T19:46:04.5504205Z 2025-05-07T19:46:04.5504211Z 2025-05-07T19:46:04.5504215Z 2025-05-07T19:46:04.5504393Z  2025-05-07T19:46:04.5504654Z 2025-05-07T19:46:04.5504657Z 2025-05-07T19:46:04.5504762Z  2025-05-07T19:46:04.5504882Z 2025-05-07T19:46:04.5504885Z 2025-05-07T19:46:04.5505058Z  2025-05-07T19:46:04.5505217Z 2025-05-07T19:46:04.5505220Z 2025-05-07T19:46:04.5505224Z 2025-05-07T19:46:04.5505337Z  2025-05-07T19:46:04.5505464Z 2025-05-07T19:46:04.5505468Z 2025-05-07T19:46:04.5505501Z 2025-05-07T19:46:04.5505504Z 2025-05-07T19:46:04.5505624Z  2025-05-07T19:46:04.5505756Z 2025-05-07T19:46:04.5505760Z 2025-05-07T19:46:04.5505764Z 2025-05-07T19:46:04.5505768Z 2025-05-07T19:46:04.5505772Z 2025-05-07T19:46:04.5505921Z  2025-05-07T19:46:04.5506062Z 2025-05-07T19:46:04.5506065Z 2025-05-07T19:46:04.5506069Z 2025-05-07T19:46:04.5506072Z 2025-05-07T19:46:04.5506075Z 2025-05-07T19:46:04.5506079Z 2025-05-07T19:46:04.5506201Z  2025-05-07T19:46:04.5506369Z 2025-05-07T19:46:04.5506373Z 2025-05-07T19:46:04.5506377Z 2025-05-07T19:46:04.5506380Z 2025-05-07T19:46:04.5506384Z 2025-05-07T19:46:04.5506387Z 2025-05-07T19:46:04.5506391Z 2025-05-07T19:46:04.5506514Z  2025-05-07T19:46:04.5506693Z 2025-05-07T19:46:04.5506699Z 2025-05-07T19:46:04.5506703Z 2025-05-07T19:46:04.5506707Z 2025-05-07T19:46:04.5506710Z 2025-05-07T19:46:04.5506714Z 2025-05-07T19:46:04.5506717Z 2025-05-07T19:46:04.5506721Z 2025-05-07T19:46:04.5506847Z  2025-05-07T19:46:04.5507015Z 2025-05-07T19:46:04.5507018Z 2025-05-07T19:46:04.5507021Z 2025-05-07T19:46:04.5507051Z 2025-05-07T19:46:04.5507055Z 2025-05-07T19:46:04.5507058Z 2025-05-07T19:46:04.5507062Z 2025-05-07T19:46:04.5507066Z 2025-05-07T19:46:04.5507069Z 2025-05-07T19:46:04.5507202Z  2025-05-07T19:46:04.5507378Z 2025-05-07T19:46:04.5507382Z 2025-05-07T19:46:04.5507386Z 2025-05-07T19:46:04.5507389Z 2025-05-07T19:46:04.5507459Z 2025-05-07T19:46:04.5507488Z 2025-05-07T19:46:04.5507492Z 2025-05-07T19:46:04.5507495Z 2025-05-07T19:46:04.5507499Z 2025-05-07T19:46:04.5507503Z 2025-05-07T19:46:04.5507641Z  2025-05-07T19:46:04.5507825Z 2025-05-07T19:46:04.5507829Z 2025-05-07T19:46:04.5507836Z 2025-05-07T19:46:04.5507839Z 2025-05-07T19:46:04.5507843Z 2025-05-07T19:46:04.5507846Z 2025-05-07T19:46:04.5507876Z 2025-05-07T19:46:04.5507880Z 2025-05-07T19:46:04.5507884Z 2025-05-07T19:46:04.5507887Z 2025-05-07T19:46:04.5507891Z 2025-05-07T19:46:04.5508027Z  2025-05-07T19:46:04.5508223Z 2025-05-07T19:46:04.5508226Z 2025-05-07T19:46:04.5508230Z 2025-05-07T19:46:04.5508233Z 2025-05-07T19:46:04.5508237Z 2025-05-07T19:46:04.5508240Z 2025-05-07T19:46:04.5508268Z 2025-05-07T19:46:04.5508271Z 2025-05-07T19:46:04.5508275Z 2025-05-07T19:46:04.5508278Z 2025-05-07T19:46:04.5508281Z 2025-05-07T19:46:04.5508285Z 2025-05-07T19:46:04.5508428Z  2025-05-07T19:46:04.5508628Z 2025-05-07T19:46:04.5508632Z 2025-05-07T19:46:04.5508635Z 2025-05-07T19:46:04.5508638Z 2025-05-07T19:46:04.5508642Z 2025-05-07T19:46:04.5508672Z 2025-05-07T19:46:04.5508675Z 2025-05-07T19:46:04.5508678Z 2025-05-07T19:46:04.5508682Z 2025-05-07T19:46:04.5508688Z 2025-05-07T19:46:04.5508692Z 2025-05-07T19:46:04.5508695Z 2025-05-07T19:46:04.5508699Z 2025-05-07T19:46:04.5508845Z  2025-05-07T19:46:04.5509173Z 2025-05-07T19:46:04.5509177Z 2025-05-07T19:46:04.5509208Z 2025-05-07T19:46:04.5509211Z 2025-05-07T19:46:04.5509214Z 2025-05-07T19:46:04.5509218Z 2025-05-07T19:46:04.5509221Z 2025-05-07T19:46:04.5509224Z 2025-05-07T19:46:04.5509228Z 2025-05-07T19:46:04.5509231Z 2025-05-07T19:46:04.5509235Z 2025-05-07T19:46:04.5509238Z 2025-05-07T19:46:04.5509241Z 2025-05-07T19:46:04.5509244Z 2025-05-07T19:46:04.5509398Z  2025-05-07T19:46:04.5509636Z 2025-05-07T19:46:04.5509644Z 2025-05-07T19:46:04.5509647Z 2025-05-07T19:46:04.5509651Z 2025-05-07T19:46:04.5509654Z 2025-05-07T19:46:04.5509657Z 2025-05-07T19:46:04.5509661Z 2025-05-07T19:46:04.5509664Z 2025-05-07T19:46:04.5509667Z 2025-05-07T19:46:04.5509670Z 2025-05-07T19:46:04.5509674Z 2025-05-07T19:46:04.5509678Z 2025-05-07T19:46:04.5509738Z 2025-05-07T19:46:04.5509742Z 2025-05-07T19:46:04.5509745Z 2025-05-07T19:46:04.5509903Z  2025-05-07T19:46:04.5510144Z 2025-05-07T19:46:04.5510147Z 2025-05-07T19:46:04.5510150Z 2025-05-07T19:46:04.5510154Z 2025-05-07T19:46:04.5510157Z 2025-05-07T19:46:04.5510161Z 2025-05-07T19:46:04.5510164Z 2025-05-07T19:46:04.5510168Z 2025-05-07T19:46:04.5510171Z 2025-05-07T19:46:04.5510174Z 2025-05-07T19:46:04.5510178Z 2025-05-07T19:46:04.5510181Z 2025-05-07T19:46:04.5510185Z 2025-05-07T19:46:04.5510188Z 2025-05-07T19:46:04.5510191Z 2025-05-07T19:46:04.5510194Z 2025-05-07T19:46:04.5510378Z  2025-05-07T19:46:04.5510603Z 2025-05-07T19:46:04.5510607Z 2025-05-07T19:46:04.5510610Z 2025-05-07T19:46:04.5510613Z 2025-05-07T19:46:04.5510617Z 2025-05-07T19:46:04.5510620Z 2025-05-07T19:46:04.5510623Z 2025-05-07T19:46:04.5510627Z 2025-05-07T19:46:04.5510630Z 2025-05-07T19:46:04.5510633Z 2025-05-07T19:46:04.5510640Z 2025-05-07T19:46:04.5510643Z 2025-05-07T19:46:04.5510646Z 2025-05-07T19:46:04.5510674Z 2025-05-07T19:46:04.5510678Z 2025-05-07T19:46:04.5510681Z 2025-05-07T19:46:04.5510684Z 2025-05-07T19:46:04.5510848Z  2025-05-07T19:46:04.5511070Z 2025-05-07T19:46:04.5511074Z 2025-05-07T19:46:04.5511077Z 2025-05-07T19:46:04.5511081Z 2025-05-07T19:46:04.5511084Z 2025-05-07T19:46:04.5511087Z 2025-05-07T19:46:04.5511115Z 2025-05-07T19:46:04.5511118Z 2025-05-07T19:46:04.5511121Z 2025-05-07T19:46:04.5511124Z 2025-05-07T19:46:04.5511128Z 2025-05-07T19:46:04.5511131Z 2025-05-07T19:46:04.5511134Z 2025-05-07T19:46:04.5511138Z 2025-05-07T19:46:04.5511212Z 2025-05-07T19:46:04.5511215Z 2025-05-07T19:46:04.5511219Z 2025-05-07T19:46:04.5511222Z 2025-05-07T19:46:04.5511395Z  2025-05-07T19:46:04.5511648Z 2025-05-07T19:46:04.5511651Z 2025-05-07T19:46:04.5511754Z  2025-05-07T19:46:04.5511870Z 2025-05-07T19:46:04.5511874Z 2025-05-07T19:46:04.5511982Z  2025-05-07T19:46:04.5512126Z 2025-05-07T19:46:04.5512129Z 2025-05-07T19:46:04.5512133Z 2025-05-07T19:46:04.5512239Z  2025-05-07T19:46:04.5512358Z 2025-05-07T19:46:04.5512363Z 2025-05-07T19:46:04.5512391Z 2025-05-07T19:46:04.5512394Z 2025-05-07T19:46:04.5512504Z  2025-05-07T19:46:04.5512629Z 2025-05-07T19:46:04.5512633Z 2025-05-07T19:46:04.5512637Z 2025-05-07T19:46:04.5512640Z 2025-05-07T19:46:04.5512643Z 2025-05-07T19:46:04.5512784Z  2025-05-07T19:46:04.5512918Z 2025-05-07T19:46:04.5512923Z 2025-05-07T19:46:04.5512927Z 2025-05-07T19:46:04.5512930Z 2025-05-07T19:46:04.5512933Z 2025-05-07T19:46:04.5512940Z 2025-05-07T19:46:04.5513057Z  2025-05-07T19:46:04.5513220Z 2025-05-07T19:46:04.5513224Z 2025-05-07T19:46:04.5513227Z 2025-05-07T19:46:04.5513231Z 2025-05-07T19:46:04.5513234Z 2025-05-07T19:46:04.5513238Z 2025-05-07T19:46:04.5513241Z 2025-05-07T19:46:04.5513363Z  2025-05-07T19:46:04.5513536Z 2025-05-07T19:46:04.5513540Z 2025-05-07T19:46:04.5513543Z 2025-05-07T19:46:04.5513547Z 2025-05-07T19:46:04.5513550Z 2025-05-07T19:46:04.5513553Z 2025-05-07T19:46:04.5513557Z 2025-05-07T19:46:04.5513560Z 2025-05-07T19:46:04.5513686Z  2025-05-07T19:46:04.5513844Z 2025-05-07T19:46:04.5513848Z 2025-05-07T19:46:04.5513851Z 2025-05-07T19:46:04.5513879Z 2025-05-07T19:46:04.5513882Z 2025-05-07T19:46:04.5513885Z 2025-05-07T19:46:04.5513889Z 2025-05-07T19:46:04.5513892Z 2025-05-07T19:46:04.5513895Z 2025-05-07T19:46:04.5514031Z  2025-05-07T19:46:04.5514197Z 2025-05-07T19:46:04.5514201Z 2025-05-07T19:46:04.5514207Z 2025-05-07T19:46:04.5514211Z 2025-05-07T19:46:04.5514216Z 2025-05-07T19:46:04.5514242Z 2025-05-07T19:46:04.5514245Z 2025-05-07T19:46:04.5514248Z 2025-05-07T19:46:04.5514251Z 2025-05-07T19:46:04.5514255Z 2025-05-07T19:46:04.5514392Z  2025-05-07T19:46:04.5514641Z 2025-05-07T19:46:04.5514645Z 2025-05-07T19:46:04.5514649Z 2025-05-07T19:46:04.5514653Z 2025-05-07T19:46:04.5514656Z 2025-05-07T19:46:04.5514660Z 2025-05-07T19:46:04.5514687Z 2025-05-07T19:46:04.5514691Z 2025-05-07T19:46:04.5514694Z 2025-05-07T19:46:04.5514697Z 2025-05-07T19:46:04.5514701Z 2025-05-07T19:46:04.5514840Z  2025-05-07T19:46:04.5515031Z 2025-05-07T19:46:04.5515035Z 2025-05-07T19:46:04.5515039Z 2025-05-07T19:46:04.5515042Z 2025-05-07T19:46:04.5515046Z 2025-05-07T19:46:04.5515050Z 2025-05-07T19:46:04.5515077Z 2025-05-07T19:46:04.5515081Z 2025-05-07T19:46:04.5515084Z 2025-05-07T19:46:04.5515087Z 2025-05-07T19:46:04.5515090Z 2025-05-07T19:46:04.5515097Z 2025-05-07T19:46:04.5515238Z  2025-05-07T19:46:04.5515432Z 2025-05-07T19:46:04.5515436Z 2025-05-07T19:46:04.5515439Z 2025-05-07T19:46:04.5515444Z 2025-05-07T19:46:04.5515447Z 2025-05-07T19:46:04.5515474Z 2025-05-07T19:46:04.5515477Z 2025-05-07T19:46:04.5515483Z 2025-05-07T19:46:04.5515487Z 2025-05-07T19:46:04.5515490Z 2025-05-07T19:46:04.5515493Z 2025-05-07T19:46:04.5515497Z 2025-05-07T19:46:04.5515501Z 2025-05-07T19:46:04.5515644Z  2025-05-07T19:46:04.5515846Z 2025-05-07T19:46:04.5515850Z 2025-05-07T19:46:04.5515880Z 2025-05-07T19:46:04.5515883Z 2025-05-07T19:46:04.5515886Z 2025-05-07T19:46:04.5515890Z 2025-05-07T19:46:04.5515893Z 2025-05-07T19:46:04.5515896Z 2025-05-07T19:46:04.5515900Z 2025-05-07T19:46:04.5515903Z 2025-05-07T19:46:04.5515907Z 2025-05-07T19:46:04.5515911Z 2025-05-07T19:46:04.5515914Z 2025-05-07T19:46:04.5515917Z 2025-05-07T19:46:04.5516067Z  2025-05-07T19:46:04.5516364Z 2025-05-07T19:46:04.5516368Z 2025-05-07T19:46:04.5516371Z 2025-05-07T19:46:04.5516374Z 2025-05-07T19:46:04.5516377Z 2025-05-07T19:46:04.5516381Z 2025-05-07T19:46:04.5516385Z 2025-05-07T19:46:04.5516388Z 2025-05-07T19:46:04.5516391Z 2025-05-07T19:46:04.5516398Z 2025-05-07T19:46:04.5516402Z 2025-05-07T19:46:04.5516405Z 2025-05-07T19:46:04.5516408Z 2025-05-07T19:46:04.5516411Z 2025-05-07T19:46:04.5516415Z 2025-05-07T19:46:04.5516571Z  2025-05-07T19:46:04.5516833Z 2025-05-07T19:46:04.5516862Z 2025-05-07T19:46:04.5516866Z 2025-05-07T19:46:04.5516869Z 2025-05-07T19:46:04.5516872Z 2025-05-07T19:46:04.5516876Z 2025-05-07T19:46:04.5516879Z 2025-05-07T19:46:04.5516882Z 2025-05-07T19:46:04.5516886Z 2025-05-07T19:46:04.5516889Z 2025-05-07T19:46:04.5516892Z 2025-05-07T19:46:04.5516896Z 2025-05-07T19:46:04.5516900Z 2025-05-07T19:46:04.5516903Z 2025-05-07T19:46:04.5516907Z 2025-05-07T19:46:04.5516910Z 2025-05-07T19:46:04.5517077Z  2025-05-07T19:46:04.5517326Z 2025-05-07T19:46:04.5517329Z 2025-05-07T19:46:04.5517332Z 2025-05-07T19:46:04.5517336Z 2025-05-07T19:46:04.5517339Z 2025-05-07T19:46:04.5517343Z 2025-05-07T19:46:04.5517346Z 2025-05-07T19:46:04.5517354Z 2025-05-07T19:46:04.5517357Z 2025-05-07T19:46:04.5517361Z 2025-05-07T19:46:04.5517364Z 2025-05-07T19:46:04.5517367Z 2025-05-07T19:46:04.5517370Z 2025-05-07T19:46:04.5517374Z 2025-05-07T19:46:04.5517377Z 2025-05-07T19:46:04.5517380Z 2025-05-07T19:46:04.5517383Z 2025-05-07T19:46:04.5517583Z  2025-05-07T19:46:04.5517812Z 2025-05-07T19:46:04.5517816Z 2025-05-07T19:46:04.5517819Z 2025-05-07T19:46:04.5517822Z 2025-05-07T19:46:04.5517825Z 2025-05-07T19:46:04.5517829Z 2025-05-07T19:46:04.5517832Z 2025-05-07T19:46:04.5517835Z 2025-05-07T19:46:04.5517838Z 2025-05-07T19:46:04.5517842Z 2025-05-07T19:46:04.5517845Z 2025-05-07T19:46:04.5517848Z 2025-05-07T19:46:04.5517886Z 2025-05-07T19:46:04.5517889Z 2025-05-07T19:46:04.5517892Z 2025-05-07T19:46:04.5517895Z 2025-05-07T19:46:04.5517899Z 2025-05-07T19:46:04.5517902Z 2025-05-07T19:46:04.5518076Z  2025-05-07T19:46:04.5518312Z 2025-05-07T19:46:04.5518370Z 2025-05-07T19:46:04.5518503Z  2025-05-07T19:46:04.5518619Z 2025-05-07T19:46:04.5518623Z 2025-05-07T19:46:04.5518732Z  2025-05-07T19:46:04.5518877Z 2025-05-07T19:46:04.5518880Z 2025-05-07T19:46:04.5518883Z 2025-05-07T19:46:04.5518994Z  2025-05-07T19:46:04.5519117Z 2025-05-07T19:46:04.5519121Z 2025-05-07T19:46:04.5519124Z 2025-05-07T19:46:04.5519128Z 2025-05-07T19:46:04.5519275Z  2025-05-07T19:46:04.5519405Z 2025-05-07T19:46:04.5519409Z 2025-05-07T19:46:04.5519412Z 2025-05-07T19:46:04.5519416Z 2025-05-07T19:46:04.5519419Z 2025-05-07T19:46:04.5519539Z  2025-05-07T19:46:04.5519706Z 2025-05-07T19:46:04.5519709Z 2025-05-07T19:46:04.5519716Z 2025-05-07T19:46:04.5519719Z 2025-05-07T19:46:04.5519722Z 2025-05-07T19:46:04.5519726Z 2025-05-07T19:46:04.5519848Z  2025-05-07T19:46:04.5519990Z 2025-05-07T19:46:04.5519994Z 2025-05-07T19:46:04.5520027Z 2025-05-07T19:46:04.5520031Z 2025-05-07T19:46:04.5520034Z 2025-05-07T19:46:04.5520040Z 2025-05-07T19:46:04.5520044Z 2025-05-07T19:46:04.5520168Z  2025-05-07T19:46:04.5520326Z 2025-05-07T19:46:04.5520329Z 2025-05-07T19:46:04.5520333Z 2025-05-07T19:46:04.5520336Z 2025-05-07T19:46:04.5520340Z 2025-05-07T19:46:04.5520343Z 2025-05-07T19:46:04.5520378Z 2025-05-07T19:46:04.5520381Z 2025-05-07T19:46:04.5520528Z  done 2025-05-07T19:46:04.7616707Z Preparing transaction: | / done 2025-05-07T19:46:05.5638708Z Verifying transaction: \ | / - \ | / - done 2025-05-07T19:46:05.8698148Z Executing transaction: | / - done 2025-05-07T19:46:07.8422680Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:07.8423594Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:07.8424429Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:07.8425072Z 2025-05-07T19:46:07.8439375Z 2025-05-07T19:46:07.8440407Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:07.8441244Z 2025-05-07T19:46:07.8452684Z 2025-05-07T19:46:07.8452901Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:07.8457527Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:07.8462164Z 2025-05-07T19:46:07.8684493Z 2025-05-07T19:46:07.8689411Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:07.8693854Z 2025-05-07T19:46:07.8706848Z 2025-05-07T19:46:07.8707121Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:07.9109331Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:09.6903071Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:46:09.7477106Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:09.7478980Z 2025-05-07T19:46:10.1723268Z 2025-05-07T19:46:10.1726319Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:10.2095614Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:10.2097313Z 2025-05-07T19:46:10.6545867Z 2025-05-07T19:46:10.6547906Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:10.6550938Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:10.6552562Z 2025-05-07T19:46:11.0722609Z 2025-05-07T19:46:13.0158200Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:14.9662527Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:16.8919072Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:16.8920005Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:18.8335589Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:20.6127497Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:20.6128329Z 2025-05-07T19:46:20.6696054Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:24.3360952Z /tmp/tmp0fssn86t: line 3: clang: command not found 2025-05-07T19:46:24.3361348Z 2025-05-07T19:46:24.3361697Z ERROR conda.cli.main_run:execute(125): `conda run clang --version` failed. (See above for error) 2025-05-07T19:46:24.3929916Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:24.3930384Z 2025-05-07T19:46:24.3953744Z total 56 2025-05-07T19:46:24.3954195Z drwxr-xr-x. 2 root root 16384 May 7 19:46 . 2025-05-07T19:46:24.3954611Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:24.3955126Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:24.3955675Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:46:24.3956162Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:46:24.3956665Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:24.3957113Z -rw-r--r--. 2 root root 872 May 7 16:10 libxml2_activate.sh 2025-05-07T19:46:24.3957581Z -rw-r--r--. 2 root root 499 Mar 28 22:35 openjdk_activate.sh 2025-05-07T19:46:24.3958031Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:24.3958348Z 2025-05-07T19:46:24.3958587Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:24.3959337Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:24.3959813Z 2025-05-07T19:46:24.3974390Z 2025-05-07T19:46:24.3975583Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:24.3976492Z 2025-05-07T19:46:26.3172053Z 2025-05-07T19:46:26.3172803Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:26.3173465Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler" 2025-05-07T19:46:26.3173930Z 2025-05-07T19:46:26.7298880Z 2025-05-07T19:46:26.7299782Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:26.7300236Z 2025-05-07T19:46:28.5223889Z -allow-unsupported-compiler 2025-05-07T19:46:28.5224214Z 2025-05-07T19:46:28.5802487Z 2025-05-07T19:46:28.5803254Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:28.5803890Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:28.5804320Z 2025-05-07T19:46:30.4706721Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:30.4707392Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:30.4707850Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:30.4708230Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:30.4708604Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:30.4708929Z #define _STL_PAIR_H 1 2025-05-07T19:46:30.4709232Z #define __cpp_attributes 200809L 2025-05-07T19:46:30.4709627Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:30.4710013Z #define __DELETE_THROW throw() 2025-05-07T19:46:30.4710328Z #define _PTRDIFF_T_ 2025-05-07T19:46:30.4710592Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:30.4710938Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:46:30.4711275Z #define _IO_LEFT 02 2025-05-07T19:46:30.4711531Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:30.4712149Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:30.4712458Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:30.4816245Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:30.4816877Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:46:30.4817249Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:30.4817614Z #define _IOS_OUTPUT 2 2025-05-07T19:46:30.4818022Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:46:30.4818457Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:30.4818791Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:30.4819138Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:30.4820109Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:30.4821046Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:30.4821445Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:46:30.4821773Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:30.4822146Z #define _T_WCHAR_ 2025-05-07T19:46:30.4822394Z #define stdout stdout 2025-05-07T19:46:30.4822800Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:30.4823233Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:30.4823545Z #define __flexarr [] 2025-05-07T19:46:30.4823858Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:30.4824222Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:30.4824638Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:30.4824931Z #define _MATH_H 1 2025-05-07T19:46:30.4825250Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:30.4825636Z #define __S64_TYPE long int 2025-05-07T19:46:30.4825944Z #define __stub_fchflags 2025-05-07T19:46:30.4826244Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:30.4826601Z #define __SQUAD_TYPE long int 2025-05-07T19:46:30.4826906Z #define __INTMAX_C(c) c ## L 2025-05-07T19:46:30.4827223Z #define _BSD_SIZE_T_DEFINED_ 2025-05-07T19:46:30.4827555Z #define NL_NMAX INT_MAX 2025-05-07T19:46:30.4827823Z #define _BITS_TIME_H 1 2025-05-07T19:46:30.4828159Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:30.4828719Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:30.4829101Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:30.4829498Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:30.4829978Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:30.4830393Z #define __CHAR_BIT__ 8 2025-05-07T19:46:30.4830718Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:30.4831100Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:30.4831427Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:30.4831755Z #define FP_NAN 0 2025-05-07T19:46:30.4832050Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:30.4832569Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:30.4833128Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:30.4833586Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:30.4833899Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:30.4834228Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:30.4834538Z #define __SM_80_RT_H__ 2025-05-07T19:46:30.4834794Z #define _NEW 2025-05-07T19:46:30.4835078Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:30.4835389Z #define __UINT8_MAX__ 0xff 2025-05-07T19:46:30.4835825Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:30.4836280Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:46:30.4836557Z #define __USE_ANSI 1 2025-05-07T19:46:30.4836874Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:30.4837337Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:30.4837772Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:30.4838193Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:46:30.4838484Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:30.4838766Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:46:30.4839072Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:30.4839374Z #define PIPE_BUF 4096 2025-05-07T19:46:30.4839727Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:30.4840115Z #define ADJ_TICK 0x4000 2025-05-07T19:46:30.4840434Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:30.4840771Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:30.4841089Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:30.4841469Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:30.4841978Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:30.4842579Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:30.4842981Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:30.4843282Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:30.4843581Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:30.4843918Z #define __cpp_static_assert 201411L 2025-05-07T19:46:30.4844281Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:30.4844794Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:30.4845118Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:30.4845422Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:30.4845772Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:30.4846072Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:30.4846418Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.4846798Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:30.4847188Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:46:30.4847488Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:30.4847854Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:30.4848277Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:30.4848661Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:30.4849009Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:30.4849323Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:30.4849697Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:30.4850113Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:30.4850583Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:46:30.4851029Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:30.4851388Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:30.4851677Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:30.4852015Z #define __GCC_IEC_559 2 2025-05-07T19:46:30.4852364Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:30.4852727Z #define _IO_flockfile(_fp) 2025-05-07T19:46:30.4853032Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:30.4853325Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:46:30.4853635Z #define _IOFBF 0 2025-05-07T19:46:30.4853866Z #define __USE_BSD 1 2025-05-07T19:46:30.4854144Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:46:30.4854435Z #define SHRT_MIN (-SHRT_MAX - 1) 2025-05-07T19:46:30.4854748Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:30.4855024Z #define _IO_NO_WRITES 8 2025-05-07T19:46:30.4855321Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:30.4855721Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:30.4856098Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:30.4856445Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:30.4856788Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:30.4857125Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:30.4857407Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:30.4857718Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:46:30.4858053Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:30.4858491Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:30.4859321Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:30.4859804Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:30.4860334Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:30.4860692Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:30.4861061Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:30.4861398Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:30.4861719Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:30.4862010Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:30.4862682Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:30.4863364Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:30.4863714Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:30.4864095Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:30.4864421Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:30.4864743Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:30.4865037Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:30.4865396Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:30.4865753Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:30.4866111Z #define RAND_MAX 2147483647 2025-05-07T19:46:30.4866438Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:30.4866801Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.4867175Z #define __SM_90_RT_H__ 2025-05-07T19:46:30.4867442Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:46:30.4867752Z #define __COMPAR_FN_T 2025-05-07T19:46:30.4868013Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:30.4868327Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:30.4868853Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:30.4869456Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:30.4869852Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:30.4870251Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:30.4870607Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:30.4870974Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:30.4871338Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:30.4871984Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:30.4872716Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:30.4873081Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:30.4873415Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:30.4873767Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:30.4874098Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:30.4874423Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:46:30.4874712Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:30.4875020Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:30.4875288Z #define __u_char_defined 2025-05-07T19:46:30.4875657Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:30.4876051Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:30.4876358Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:30.4876638Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:46:30.4876976Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:30.4877482Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:30.4877949Z #define FP_INFINITE 1 2025-05-07T19:46:30.4878375Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:30.4878832Z #define _IO_pid_t __pid_t 2025-05-07T19:46:30.4879276Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:46:30.4879553Z #define __LEAF , __leaf__ 2025-05-07T19:46:30.4879837Z #define PATH_MAX 4096 2025-05-07T19:46:30.4880107Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:46:30.4880483Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:30.4880849Z #define _LIMITS_H___ 2025-05-07T19:46:30.4881092Z #define __size_t 2025-05-07T19:46:30.4881356Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:30.4882095Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:30.4883005Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:30.4883339Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:30.4883733Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:46:30.4884015Z #define _WCHAR_T_DEFINED 2025-05-07T19:46:30.4884436Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:30.4884902Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:30.4885224Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:30.4885618Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:30.4885933Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:30.4886275Z #define __INT8_C(c) c 2025-05-07T19:46:30.4886553Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:30.4886904Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:30.4887190Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:30.4887498Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:30.4887772Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:30.4888097Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:30.4888475Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.4888834Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:46:30.4889159Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:30.4889459Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:30.4889769Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:30.4890111Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:30.4890472Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:30.4890868Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:30.4891310Z #define NFDBITS __NFDBITS 2025-05-07T19:46:30.4891620Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:30.4891935Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:30.4892309Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:30.4892658Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:46:30.4892966Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:30.4893280Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:30.4893635Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:30.4893972Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:30.4894559Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:30.4895063Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:46:30.4895398Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:30.4895763Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:30.4896164Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:30.4896558Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:30.4896904Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:30.4897291Z #define __daddr_t_defined 2025-05-07T19:46:30.4897567Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:46:30.4897883Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:30.4898219Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:30.4898796Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:30.4899344Z #define _ACRTIMP 2025-05-07T19:46:30.4899581Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:30.4899961Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:30.4900441Z #define _IOS_BIN 128 2025-05-07T19:46:30.4900857Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:30.4901313Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.4901635Z #define UNDERFLOW 4 2025-05-07T19:46:30.4901871Z #define NAME_MAX 255 2025-05-07T19:46:30.4902155Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:30.4902473Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:46:30.4902776Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:30.4903129Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:30.4903543Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:46:30.4904095Z #define __ptr_t void * 2025-05-07T19:46:30.4904352Z #define M_E 2.7182818284590452354 2025-05-07T19:46:30.4904682Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:30.4904980Z #define __USE_ISOCXX11 1 2025-05-07T19:46:30.4905300Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:30.4905651Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:30.4905997Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:30.4906320Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:30.4906633Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:30.4907002Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:30.4907287Z #define __linux 1 2025-05-07T19:46:30.4907567Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:46:30.4907867Z #define cudaDeviceMask 0xff 2025-05-07T19:46:30.4908188Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:30.4908508Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:30.4908842Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:30.4909158Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:30.4909518Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:46:30.4909884Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:30.4910200Z #define _BITS_TYPES_H 1 2025-05-07T19:46:30.4910530Z #define ULONG_LONG_MAX (LONG_LONG_MAX * 2ULL + 1ULL) 2025-05-07T19:46:30.4910905Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:30.4911253Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:30.4911551Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:30.4911886Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:30.4912332Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:30.4913141Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:30.4914022Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:30.4914311Z #define __unix 1 2025-05-07T19:46:30.4914555Z #define MATH_ERRNO 1 2025-05-07T19:46:30.4914804Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:30.4915116Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:30.4915396Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:46:30.4915722Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:30.4916049Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:30.4916406Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:30.4916917Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:30.4917398Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:30.4917693Z #define CUDARTAPI_CDECL 2025-05-07T19:46:30.4917933Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:30.4918206Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:30.4918479Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:30.4918739Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:30.4918961Z #define __SIZE_T 2025-05-07T19:46:30.4919206Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:30.4919526Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:30.4919809Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:30.4920069Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:30.4920315Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:30.4920703Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:30.4921137Z #define __WAIT_STATUS void * 2025-05-07T19:46:30.4921404Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:30.4921829Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:30.4922110Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:46:30.4922412Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:30.4922693Z #define __WINT_MIN__ 0U 2025-05-07T19:46:30.4923322Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:30.4924015Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:30.4924343Z #define WUNTRACED 2 2025-05-07T19:46:30.4924601Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:30.4924996Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:30.4925295Z #define NZERO 20 2025-05-07T19:46:30.4925536Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:30.4925855Z #define _PSTL_PRAGMA(x) _Pragma(#x) 2025-05-07T19:46:30.4926177Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:30.4926518Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:30.4926794Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:46:30.4927122Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:46:30.4927414Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:30.4927731Z #define SCHAR_MIN (-SCHAR_MAX - 1) 2025-05-07T19:46:30.4928019Z #define EXIT_FAILURE 1 2025-05-07T19:46:30.4928298Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:30.4928571Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:30.4928878Z #define _SIZE_T_DEFINED_ 2025-05-07T19:46:30.4929171Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:30.4929442Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:30.4929800Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:30.4930167Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:30.4930464Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:46:30.4930708Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:46:30.4930994Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:30.4931292Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:30.4931614Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:46:30.4931908Z #define SEEK_DATA 3 2025-05-07T19:46:30.4932151Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:30.4932466Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:30.4932896Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:30.4933321Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:30.4933588Z #define __INT64_C(c) c ## L 2025-05-07T19:46:30.4933996Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:30.4934336Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:30.4934703Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:30.4934990Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:30.4935320Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:30.4935658Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:30.4935922Z #define __INT_WCHAR_T_H 2025-05-07T19:46:30.4936247Z #define WSTOPPED 2 2025-05-07T19:46:30.4936496Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:30.4936817Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:30.4937078Z #define FP_NORMAL 4 2025-05-07T19:46:30.4937358Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:30.4937657Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:30.4937942Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:30.4938206Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:30.4938525Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:30.4938808Z #define cudaTextureType1D 0x01 2025-05-07T19:46:30.4939115Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:30.4939418Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:30.4939766Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:30.4940285Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:30.4940776Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:30.4941312Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:30.4941606Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:30.4941938Z #define _POSIX_SOURCE 1 2025-05-07T19:46:30.4942216Z #define cudaTextureType2D 0x02 2025-05-07T19:46:30.4942536Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:30.4942859Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:30.4943206Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:30.4943535Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:30.4943890Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:30.4944281Z #define cudaTextureType3D 0x03 2025-05-07T19:46:30.4944579Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:30.4944887Z #define CLOCK_REALTIME 0 2025-05-07T19:46:30.4945159Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:46:30.4945489Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:30.4945899Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:30.4946233Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:30.4946569Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:30.4946886Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:30.4947225Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:30.4947563Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:30.4947922Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:30.4948236Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:46:30.4948555Z #define __GLIBC__ 2 2025-05-07T19:46:30.4948803Z #define __END_DECLS } 2025-05-07T19:46:30.4949105Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:30.4949512Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:46:30.4949964Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:30.4950258Z #define WCONTINUED 8 2025-05-07T19:46:30.4950516Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:30.4950824Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:30.4951129Z #define _ALLOCA_H 1 2025-05-07T19:46:30.4951417Z #define __host__ __location__(host) 2025-05-07T19:46:30.4951886Z #define __warndecl(name,msg) extern void name (void) __attribute__((__warning__ (msg))) 2025-05-07T19:46:30.4952505Z #define __SLONG32_TYPE int 2025-05-07T19:46:30.4952814Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:30.4953111Z #define _SYS_SELECT_H 1 2025-05-07T19:46:30.4953388Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:30.4953684Z #define _IOS_NOCREATE 32 2025-05-07T19:46:30.4953946Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:46:30.4954267Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:30.4954575Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:30.4954904Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:30.4955204Z #define __global__ __location__(global) 2025-05-07T19:46:30.4955535Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:30.4955810Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:30.4956136Z #define __DBL_DIG__ 15 2025-05-07T19:46:30.4956382Z #define TIME_UTC 1 2025-05-07T19:46:30.4956650Z #define __FLT32_DIG__ 6 2025-05-07T19:46:30.4957028Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:30.4957444Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:30.4957870Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:30.4958192Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:30.4958534Z #define _G_BUFSIZ 8192 2025-05-07T19:46:30.4958850Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:46:30.4959264Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:30.4959565Z #define __cudaCDP2GetDevice 2025-05-07T19:46:30.4959881Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:30.4960193Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:30.4960446Z #define __GXX_WEAK__ 1 2025-05-07T19:46:30.4960730Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:30.4961041Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:30.4961330Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:30.4961634Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:30.4961999Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:30.4962286Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:30.4962598Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:30.4962908Z #define _G_config_h 1 2025-05-07T19:46:30.4963215Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:30.4963589Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:30.4963873Z #define _GCC_WCHAR_T 2025-05-07T19:46:30.4964134Z #define TMP_MAX 238328 2025-05-07T19:46:30.4964379Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:46:30.4964682Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:30.4964955Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:30.4965267Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:30.4965551Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:30.4965868Z #define _IO_SKIPWS 01 2025-05-07T19:46:30.4966276Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:30.4966837Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:30.4967143Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:30.4967482Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:46:30.4967894Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:30.4968276Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:30.4968681Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:46:30.4968943Z #define le32toh(x) (x) 2025-05-07T19:46:30.4969208Z #define _SIZE_T_DEFINED 2025-05-07T19:46:30.4969467Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:30.4969837Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:30.4970218Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:46:30.4970626Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:30.4971077Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:30.4971351Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:30.4971651Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:30.4971924Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:30.4972229Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:30.4972781Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:30.4973320Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:30.4973657Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:30.4974010Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:30.4974362Z #define _WCHAR_T_ 2025-05-07T19:46:30.4974594Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:30.4974992Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:46:30.4975391Z #define RTSIG_MAX 32 2025-05-07T19:46:30.4975642Z #define _STDDEF_H 2025-05-07T19:46:30.4975880Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:30.4976181Z #define _VA_LIST_DEFINED 2025-05-07T19:46:30.4976436Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:46:30.4976802Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:30.4977228Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:30.4977565Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:46:30.4980620Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:30.4981199Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:30.4981814Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:30.4982400Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:30.4982776Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:30.4983124Z #define __unix__ 1 2025-05-07T19:46:30.4983407Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:30.4983735Z #define __INT_WIDTH__ 32 2025-05-07T19:46:30.4984000Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:30.4984276Z #define _IONBF 2 2025-05-07T19:46:30.4984763Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:30.4985659Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:30.4986233Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:30.4986511Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:30.4986798Z #define __UINT16_C(c) c 2025-05-07T19:46:30.4987040Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:30.4987338Z #define STA_DEL 0x0020 2025-05-07T19:46:30.4987587Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:30.4987859Z #define __id_t_defined 2025-05-07T19:46:30.4988138Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:30.4988636Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:30.4989090Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:30.4989386Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:30.4989650Z #define __DECIMAL_DIG__ 21 2025-05-07T19:46:30.4989936Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:30.4990336Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:30.4990606Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:30.4990887Z #define SING 2 2025-05-07T19:46:30.4991104Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:30.4991429Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:30.4991768Z #define cudaStreamDefault 0x00 2025-05-07T19:46:30.4992195Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:46:30.4992623Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:30.4992964Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:30.4993275Z #define __gnu_linux__ 1 2025-05-07T19:46:30.4993582Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:46:30.4994082Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:30.4994352Z #define MAX_INPUT 255 2025-05-07T19:46:30.4994645Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:46:30.4994987Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:30.4995404Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:30.4995744Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:30.4996051Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:30.4996462Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:30.4996934Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:30.4997301Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:30.4997684Z #define _Mfloat_ float 2025-05-07T19:46:30.4997985Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:30.4998308Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:46:30.4998638Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:30.4999141Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:30.4999690Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.4999979Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:30.5000340Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:30.5000742Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:30.5001053Z #define __USE_ISOC11 1 2025-05-07T19:46:30.5001321Z #define _BSD_SIZE_T_ 2025-05-07T19:46:30.5001566Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:30.5001853Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:30.5002213Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:30.5002559Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:30.5002894Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:46:30.5003246Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:30.5003596Z #define __THROW throw () 2025-05-07T19:46:30.5003897Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:30.5004236Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:30.5004609Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:30.5005005Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:30.5005292Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:30.5005592Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:46:30.5005883Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:30.5006188Z #define L_tmpnam 20 2025-05-07T19:46:30.5006425Z #define ___int_wchar_t_h 2025-05-07T19:46:30.5006811Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:30.5007250Z #define isascii(c) __isascii (c) 2025-05-07T19:46:30.5007525Z #define _T_PTRDIFF 2025-05-07T19:46:30.5007874Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:30.5008249Z #define toascii(c) __toascii (c) 2025-05-07T19:46:30.5008551Z #define __GNUC__ 11 2025-05-07T19:46:30.5008814Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:30.5009155Z #define __GXX_RTTI 1 2025-05-07T19:46:30.5009389Z #define __pie__ 2 2025-05-07T19:46:30.5009640Z #define __MMX__ 1 2025-05-07T19:46:30.5009870Z #define __cudaCDP2Malloc 2025-05-07T19:46:30.5010164Z #define __timespec_defined 1 2025-05-07T19:46:30.5010426Z #define L_ctermid 9 2025-05-07T19:46:30.5010695Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:30.5011101Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:30.5011498Z #define offsetof(TYPE,MEMBER) __builtin_offsetof (TYPE, MEMBER) 2025-05-07T19:46:30.5011916Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:30.5012199Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:30.5012539Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:30.5012856Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:30.5013215Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:30.5013490Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:30.5013977Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:30.5014788Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:30.5015421Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:30.5015767Z #define __USE_SVID 1 2025-05-07T19:46:30.5016032Z #define __constant__ __location__(constant) 2025-05-07T19:46:30.5016388Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:30.5016695Z #define __device__ __location__(device) 2025-05-07T19:46:30.5017057Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:30.5017422Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:30.5017702Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:30.5018022Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:30.5018381Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:30.5018792Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:30.5019086Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:30.5019488Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:30.5019953Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:30.5020426Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:30.5020840Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:30.5021353Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:30.5021754Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:30.5022063Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:46:30.5022391Z #define NGROUPS_MAX 65536 2025-05-07T19:46:30.5022672Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:30.5022996Z #define __USE_ISOC95 1 2025-05-07T19:46:30.5023313Z #define _TIME_H 1 2025-05-07T19:46:30.5023652Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:30.5024016Z #define __USE_ISOC99 1 2025-05-07T19:46:30.5024416Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:30.5024876Z #define HOST_NAME_MAX 64 2025-05-07T19:46:30.5025159Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:30.5025488Z #define _IOS_ATEND 4 2025-05-07T19:46:30.5025756Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:30.5026148Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:30.5026600Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:30.5027017Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:30.5027341Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:30.5027730Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:30.5028094Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:46:30.5028427Z #define _STDIO_H 1 2025-05-07T19:46:30.5028911Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:30.5029444Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:30.5029883Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:30.5030313Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:30.5030679Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:30.5030985Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:30.5031322Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:30.5031647Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:30.5032108Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5032465Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:30.5032744Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:30.5033146Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:30.5033461Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:30.5033774Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:30.5034071Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:30.5034464Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:30.5034843Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:30.5035125Z #define __USE_XOPEN 1 2025-05-07T19:46:30.5035376Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:30.5035855Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:30.5036334Z #define __USE_XOPEN2K 1 2025-05-07T19:46:30.5036587Z #define _PSTL_UDR_PRESENT 1 2025-05-07T19:46:30.5036888Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:46:30.5037193Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:30.5037503Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:30.5038033Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:30.5038612Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:30.5038901Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:46:30.5039306Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:30.5039751Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:30.5040137Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:30.5040566Z #define __END_NAMESPACE_C99 2025-05-07T19:46:30.5040849Z #define __glibcxx_integral_traps true 2025-05-07T19:46:30.5041172Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:30.5041441Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:30.5041730Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:46:30.5042009Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:30.5042308Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:30.5042635Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:46:30.5042947Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:30.5043353Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:30.5043751Z #define LONG_MIN (-LONG_MAX - 1L) 2025-05-07T19:46:30.5044066Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:30.5044339Z #define _IO_UNITBUF 020000 2025-05-07T19:46:30.5044629Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:30.5044953Z #define __FD_SETSIZE 1024 2025-05-07T19:46:30.5045237Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:30.5045521Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:30.5045898Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:30.5046297Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:46:30.5046577Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:46:30.5046937Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:30.5047268Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:30.5047576Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:46:30.5047884Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:30.5048255Z #define _WCHAR_T_DEFINED_ 2025-05-07T19:46:30.5048549Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:30.5048910Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:30.5049237Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:30.5049517Z #define __USE_POSIX199506 1 2025-05-07T19:46:30.5049803Z #define _FEATURES_H 1 2025-05-07T19:46:30.5050046Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:30.5050472Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:30.5050901Z #define __stub_getmsg 2025-05-07T19:46:30.5051166Z #define _IO_FIXED 010000 2025-05-07T19:46:30.5051444Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:30.5051788Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:30.5052064Z #define __stub_setlogin 2025-05-07T19:46:30.5052337Z #define __stub_fattach 2025-05-07T19:46:30.5052612Z #define __cplusplus 201703L 2025-05-07T19:46:30.5052890Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:30.5053203Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:30.5053534Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:30.5053844Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:30.5054336Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:30.5054897Z #define _IO_INTERNAL 010 2025-05-07T19:46:30.5055153Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:46:30.5055522Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:30.5055895Z #define __dev_t_defined 2025-05-07T19:46:30.5056184Z #define __DEPRECATED 1 2025-05-07T19:46:30.5056454Z #define __S32_TYPE int 2025-05-07T19:46:30.5056712Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:30.5057040Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:30.5057303Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:30.5057597Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:30.5058204Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:30.5058888Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:30.5059212Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:30.5059652Z #define OVERFLOW 3 2025-05-07T19:46:30.5059963Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:30.5060484Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:46:30.5060846Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:30.5061232Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:30.5061640Z #define __SSE2_MATH__ 1 2025-05-07T19:46:30.5061917Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:46:30.5062293Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:30.5062637Z #define _IO_STDIO_H 2025-05-07T19:46:30.5062948Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:30.5063277Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:30.5063672Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:30.5064037Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5064392Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:30.5064723Z #define __amd64 1 2025-05-07T19:46:30.5064983Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:30.5065302Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:30.5065614Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:30.5065971Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:30.5066382Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:30.5066705Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:30.5067065Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:30.5067354Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:30.5067663Z #define __bounded 2025-05-07T19:46:30.5067919Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:30.5068268Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:30.5068580Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:30.5068896Z #define _PTRDIFF_T_DECLARED 2025-05-07T19:46:30.5069195Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:30.5069567Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:30.5070020Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:30.5070489Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:30.5070812Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:30.5071180Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:30.5071594Z #define STA_PLL 0x0001 2025-05-07T19:46:30.5071864Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:46:30.5072185Z #define __GNUG__ 11 2025-05-07T19:46:30.5072547Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:30.5072858Z #define _T_WCHAR 2025-05-07T19:46:30.5073112Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:30.5073448Z #define __specialization_static 2025-05-07T19:46:30.5073780Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:46:30.5074152Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:30.5074466Z #define cudaArraySparse 0x40 2025-05-07T19:46:30.5074758Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:30.5075053Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:30.5075359Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:30.5075776Z #define _WCHAR_T 2025-05-07T19:46:30.5076018Z #define __cudaCDP2Free 2025-05-07T19:46:30.5076762Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:30.5077563Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:30.5078114Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:30.5078612Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:46:30.5078906Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:30.5079215Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:30.5079567Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:30.5079968Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:30.5080224Z #define __NO_CTYPE 1 2025-05-07T19:46:30.5080485Z #define __stub_bdflush 2025-05-07T19:46:30.5080851Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:30.5081314Z #define __CORRECT_ISO_CPP_STRING_H_PROTO 2025-05-07T19:46:30.5081656Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:30.5082074Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:46:30.5082591Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:30.5082936Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:30.5083300Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:30.5083673Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:30.5084096Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:30.5084411Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:46:30.5084745Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:30.5085140Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:30.5085517Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:30.5085857Z #define _IO_STDIO 040000 2025-05-07T19:46:30.5086213Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:30.5086663Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:30.5087006Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:30.5087351Z #define _PTRDIFF_T 2025-05-07T19:46:30.5087586Z #define _MOVE_H 1 2025-05-07T19:46:30.5087859Z #define __cpp_hex_float 201603L 2025-05-07T19:46:30.5088286Z #define ADJ_TAI 0x0080 2025-05-07T19:46:30.5088572Z #define __ptrvalue 2025-05-07T19:46:30.5088851Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:30.5089129Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:46:30.5089475Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:30.5089810Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:30.5090118Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:30.5090429Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:30.5090895Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:30.5091324Z #define __USE_GNU 1 2025-05-07T19:46:30.5091606Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:46:30.5091948Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:30.5092248Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:30.5092709Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:30.5093141Z #define WEXITED 4 2025-05-07T19:46:30.5093409Z #define _IO_NO_READS 4 2025-05-07T19:46:30.5093736Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:30.5094145Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:30.5094457Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:30.5094812Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:30.5095163Z #define __uid_t_defined 2025-05-07T19:46:30.5095471Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:30.5096039Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:30.5096331Z #define WNOHANG 1 2025-05-07T19:46:30.5096624Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:30.5096948Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:30.5097269Z #define cudaEventDefault 0x00 2025-05-07T19:46:30.5097669Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:30.5098037Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:30.5098292Z #define __x86_64 1 2025-05-07T19:46:30.5098573Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:30.5098991Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:30.5099527Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:30.5100318Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:30.5100873Z #define __PTRDIFF_T 2025-05-07T19:46:30.5101271Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:30.5101702Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:30.5102045Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:30.5102369Z #define _Mlong_double_ long double 2025-05-07T19:46:30.5102710Z #define __cpp_lambdas 200907L 2025-05-07T19:46:30.5103000Z #define _IO_DEC 020 2025-05-07T19:46:30.5103290Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:30.5103620Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:30.5103941Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:30.5104290Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:30.5104586Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:30.5104938Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:30.5105294Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:30.5105621Z #define _ANSI_STDDEF_H 2025-05-07T19:46:30.5105917Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:30.5106286Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:30.5106710Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:46:30.5107136Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:30.5107473Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:30.5107793Z #define __cpp_template_auto 201606L 2025-05-07T19:46:30.5108213Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:46:30.5108626Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:30.5108952Z #define __key_t_defined 2025-05-07T19:46:30.5109232Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:30.5109664Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:30.5110281Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:46:30.5110693Z #define __GNUC_VA_LIST 2025-05-07T19:46:30.5111100Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:30.5111532Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:30.5111858Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:30.5112170Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:30.5112612Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:30.5112873Z #define __WCOREFLAG 0x80 2025-05-07T19:46:30.5113161Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:30.5113474Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:30.5113792Z #define __LP64__ 1 2025-05-07T19:46:30.5114082Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:30.5114412Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:30.5114735Z #define _IO_off64_t __off64_t 2025-05-07T19:46:30.5115007Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.5115282Z #define __time_t_defined 1 2025-05-07T19:46:30.5115526Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:30.5115871Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:46:30.5116234Z #define __USE_UNIX98 1 2025-05-07T19:46:30.5116474Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:30.5116739Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:30.5117045Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:30.5117378Z #define __LEAF_ATTR __attribute__ ((__leaf__)) 2025-05-07T19:46:30.5117701Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:46:30.5117997Z #define SEEK_CUR 1 2025-05-07T19:46:30.5118238Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:30.5118543Z #define _ASSERT_H 1 2025-05-07T19:46:30.5119217Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:30.5119910Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:30.5120200Z #define CHAR_MAX SCHAR_MAX 2025-05-07T19:46:30.5120499Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:30.5120807Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:30.5121095Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:30.5121509Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:30.5121942Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:30.5122657Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:30.5123347Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:30.5123677Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:30.5124061Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:30.5124456Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:30.5124753Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:46:30.5125048Z #define cudaArrayDefault 0x00 2025-05-07T19:46:30.5125358Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:30.5125658Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:30.5125981Z #define TLOSS 5 2025-05-07T19:46:30.5126206Z #define __ssize_t_defined 2025-05-07T19:46:30.5126500Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:30.5126782Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:30.5127116Z #define ULONG_MAX (LONG_MAX * 2UL + 1UL) 2025-05-07T19:46:30.5127452Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:46:30.5127821Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:30.5128263Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:30.5128562Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:46:30.5128897Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:30.5129224Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:30.5129540Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:30.5151361Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:30.5151654Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:46:30.5152174Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:30.5152676Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:30.5152963Z #define __cdecl 2025-05-07T19:46:30.5153216Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:30.5153576Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:30.5153944Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:30.5154209Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:30.5154522Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:30.5154818Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:30.5155088Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:30.5155380Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:30.5155730Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:30.5156144Z #define __NV_GLIBCXX_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:30.5156592Z #define ADJ_NANO 0x2000 2025-05-07T19:46:30.5156913Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:46:30.5157266Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:30.5157589Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:30.5157847Z #define __FLT_DIG__ 6 2025-05-07T19:46:30.5158216Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:30.5158604Z #define __NO_INLINE__ 1 2025-05-07T19:46:30.5158934Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:30.5159286Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:30.5159552Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:30.5159811Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:30.5160100Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:30.5160385Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:30.5160752Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:30.5161049Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:46:30.5161437Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:30.5161879Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:30.5162217Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:46:30.5162584Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:30.5162848Z #define MAX_CANON 255 2025-05-07T19:46:30.5163088Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:30.5163361Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:30.5163611Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:30.5163917Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:30.5164236Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:30.5164552Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:30.5164818Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:30.5165176Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:30.5165493Z #define __VERSION__ "11.4.0" 2025-05-07T19:46:30.5165783Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:30.5166092Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:30.5166368Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:30.5166666Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:30.5166976Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:30.5167470Z #define __UINT64_C(c) c ## UL 2025-05-07T19:46:30.5167736Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:30.5168020Z #define _SYS_TYPES_H 1 2025-05-07T19:46:30.5168264Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:30.5168533Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:30.5168772Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:30.5169036Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:30.5169345Z #define __cpp_unicode_characters 201411L 2025-05-07T19:46:30.5169649Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:30.5169928Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:30.5170228Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:30.5170535Z #define FP_SUBNORMAL 3 2025-05-07T19:46:30.5170807Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:30.5171121Z #define _INITIALIZER_LIST 2025-05-07T19:46:30.5171386Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:30.5171681Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:30.5172042Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:30.5172377Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:30.5172683Z #define _IO_file_flags _flags 2025-05-07T19:46:30.5172968Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:30.5173269Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:30.5173572Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:30.5173894Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:30.5174185Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:30.5174598Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:30.5175024Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:30.5175369Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:30.5175666Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:30.5175944Z #define _BSD_SOURCE 1 2025-05-07T19:46:30.5176223Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:30.5177158Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_ ##_NTYPE : false_type { }; template struct __has_ ##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:30.5178129Z #define __catch(X) catch(X) 2025-05-07T19:46:30.5178411Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:46:30.5178733Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:30.5179024Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:30.5179321Z #define __STRING(x) #x 2025-05-07T19:46:30.5179678Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:30.5179990Z #define _T_PTRDIFF_ 2025-05-07T19:46:30.5180454Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:30.5180791Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:30.5181125Z #define __unbounded 2025-05-07T19:46:30.5181462Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:30.5181805Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:46:30.5183020Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:30.5183376Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:30.5183685Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:30.5184033Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:30.5184411Z #define LONG_LONG_MIN (-LONG_LONG_MAX - 1LL) 2025-05-07T19:46:30.5184741Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:30.5185067Z #define __managed__ __location__(managed) 2025-05-07T19:46:30.5185396Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:30.5185854Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:30.5186319Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:30.5186628Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:30.5187036Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:30.5187500Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:46:30.5187806Z #define _SYS_SIZE_T_H 2025-05-07T19:46:30.5188132Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:30.5188538Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:30.5188845Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:30.5189196Z #define _CRTIMP 2025-05-07T19:46:30.5189436Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:30.5189781Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:30.5190136Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:30.5190539Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:30.5191018Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:30.5191364Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:30.5191690Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:30.5192002Z #define __SIZE_T__ 2025-05-07T19:46:30.5192264Z #define __stub_gtty 2025-05-07T19:46:30.5192514Z #define __pid_t_defined 2025-05-07T19:46:30.5192818Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:30.5193140Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:30.5193503Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:30.5193825Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:30.5194103Z #define __need_clockid_t 2025-05-07T19:46:30.5194628Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:30.5194896Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:30.5195248Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:30.5195579Z #define _IO_HEX 0100 2025-05-07T19:46:30.5195873Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:30.5196216Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:30.5196559Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:30.5196842Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:30.5197279Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:30.5197726Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:30.5198067Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:30.5198382Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:30.5198660Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:30.5198768Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:30.5198860Z #define __stub_sstk 2025-05-07T19:46:30.5198961Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:30.5199119Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:30.5199222Z #define __wur 2025-05-07T19:46:30.5199338Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:30.5199429Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:30.5199520Z #define _IO_OCT 040 2025-05-07T19:46:30.5199626Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:46:30.5199721Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:30.5199820Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:30.5199966Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:30.5200061Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:30.5200170Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:30.5200461Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:30.5200575Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:46:30.5200667Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:30.5200775Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:30.5200902Z #define __off64_t_defined 2025-05-07T19:46:30.5201001Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:30.5201094Z #define __FLT128_DIG__ 33 2025-05-07T19:46:30.5201206Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:30.5201331Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:30.5201422Z #define __INT32_C(c) c 2025-05-07T19:46:30.5201523Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:46:30.5201646Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:30.5201742Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:46:30.5201837Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:30.5201930Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:30.5202051Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:30.5202182Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:30.5202282Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:30.5202403Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:30.5202507Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:46:30.5202608Z #define __have_pthread_attr_t 1 2025-05-07T19:46:30.5202720Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:30.5202984Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:30.5203086Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:30.5203192Z #define __cudaCDP2EventRecord 2025-05-07T19:46:30.5203323Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:30.5203419Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:46:30.5203512Z #define htole32(x) (x) 2025-05-07T19:46:30.5203778Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:30.5203901Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:30.5204004Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:30.5204165Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:30.5204332Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:30.5204457Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:46:30.5204597Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:30.5204792Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:30.5204894Z #define cudaArrayLayered 0x01 2025-05-07T19:46:30.5205062Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:30.5205173Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:30.5205304Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:30.5205406Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:30.5205489Z #define unix 1 2025-05-07T19:46:30.5205614Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:30.5205718Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:30.5205814Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:30.5205932Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:30.5206055Z #define __USE_POSIX 1 2025-05-07T19:46:30.5206156Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:30.5206292Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:30.5206412Z #define __THROWNL throw () 2025-05-07T19:46:30.5206509Z #define __cpp_rtti 199711L 2025-05-07T19:46:30.5206620Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:30.5206712Z #define __PMT(args) args 2025-05-07T19:46:30.5206850Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.5206997Z #define __va_arg_pack_len() __builtin_va_arg_pack_len () 2025-05-07T19:46:30.5207114Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:30.5207232Z #define _SIZE_T_DECLARED 2025-05-07T19:46:30.5207335Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:30.5207423Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:46:30.5207855Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:30.5207962Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:30.5208105Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:30.5208199Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:30.5208371Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:46:30.5208458Z #define _WCHAR_T_H 2025-05-07T19:46:30.5208551Z #define __FLT64X_DIG__ 18 2025-05-07T19:46:30.5208663Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:30.5208757Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:30.5208862Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:30.5208952Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:30.5209065Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:30.5209183Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:30.5209265Z #define __ELF__ 1 2025-05-07T19:46:30.5209388Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:30.5209490Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:30.5209579Z #define STA_INS 0x0010 2025-05-07T19:46:30.5209679Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:30.5209881Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:30.5209981Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:30.5210085Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:30.5210220Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:30.5210331Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:30.5210442Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:30.5210539Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:30.5210663Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:30.5210821Z #define __warnattr(msg) __attribute__((__warning__ (msg))) 2025-05-07T19:46:30.5210979Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:30.5211102Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:30.5211424Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:30.5211557Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:30.5211666Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:30.5211760Z #define __FLT_RADIX__ 2 2025-05-07T19:46:30.5211869Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:46:30.5212045Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:46:30.5212152Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:46:30.5212249Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:30.5212405Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:30.5212522Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:30.5212624Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:30.5212728Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:30.5212815Z #define WORD_BIT 32 2025-05-07T19:46:30.5212930Z #define _IO_USER_BUF 1 2025-05-07T19:46:30.5213027Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:30.5213123Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:30.5213246Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:30.5213344Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:30.5213446Z #define __long_double_t long double 2025-05-07T19:46:30.5213545Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:30.5213660Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:30.5214061Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:30.5214147Z #define __k8 1 2025-05-07T19:46:30.5214366Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:30.5214537Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:46:30.5214657Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:30.5214789Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:46:30.5214880Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:30.5214980Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:30.5215083Z #define __blksize_t_defined 2025-05-07T19:46:30.5215204Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:30.5215303Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:30.5215419Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:30.5215546Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:30.5215653Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:30.5215798Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:30.5215894Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:30.5216187Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:30.5216533Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:30.5216634Z #define UCHAR_MAX (SCHAR_MAX * 2 + 1) 2025-05-07T19:46:30.5216757Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:30.5216844Z #define SEEK_SET 0 2025-05-07T19:46:30.5216945Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:30.5217044Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:30.5217266Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:30.5217361Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:30.5217462Z #define __cudaCDP2GetLastError 2025-05-07T19:46:30.5217584Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:30.5217680Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:30.5218014Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:30.5218140Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:30.5218250Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:30.5218343Z #define __stub_sigreturn 2025-05-07T19:46:30.5218586Z #define __errordecl(name,msg) extern void name (void) __attribute__((__error__ (msg))) 2025-05-07T19:46:30.5218711Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:30.5218809Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:30.5218909Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:30.5219014Z #define CLOCK_TAI 11 2025-05-07T19:46:30.5219124Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:30.5219223Z #define __restrict_arr 2025-05-07T19:46:30.5219340Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:30.5219515Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:30.5220331Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:30.5220540Z #define __attribute_artificial__ __attribute__ ((__artificial__)) 2025-05-07T19:46:30.5220721Z #define __USE_MISC 1 2025-05-07T19:46:30.5220840Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:30.5220949Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:30.5221057Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:30.5221154Z #define __LDBL_DIG__ 18 2025-05-07T19:46:30.5221258Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:30.5221370Z #define __malloc_and_calloc_defined 2025-05-07T19:46:30.5221488Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:46:30.5221601Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:30.5221697Z #define __x86_64__ 1 2025-05-07T19:46:30.5221800Z #define _SIZE_T_ 2025-05-07T19:46:30.5222825Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:30.5222949Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:30.5223079Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:46:30.5223206Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:30.5223336Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:46:30.5223444Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:30.5223578Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:30.5223704Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:30.5223854Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:30.5223976Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:30.5224492Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:30.5225984Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:30.5226171Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:30.5226280Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:30.5226385Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:30.5226487Z #define STA_FLL 0x0008 2025-05-07T19:46:30.5226677Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:30.5226779Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:30.5226910Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5227051Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:30.5227142Z #define __stub_revoke 2025-05-07T19:46:30.5227240Z #define __timer_t_defined 1 2025-05-07T19:46:30.5227388Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:30.5227509Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:30.5227627Z #define ULLONG_MAX (LLONG_MAX * 2ULL + 1) 2025-05-07T19:46:30.5227736Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:30.5227868Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:30.5227973Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:30.5228085Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:30.5228221Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:30.5228381Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:30.5228477Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:30.5228578Z #define _IO_off_t __off_t 2025-05-07T19:46:30.5228701Z #define __FLT64_DIG__ 15 2025-05-07T19:46:30.5228940Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:30.5229037Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:30.5229203Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.5229337Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:30.5229430Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:30.5229548Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:30.5229664Z #define NULL __null 2025-05-07T19:46:30.5229807Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:30.5229917Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:30.5230041Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:30.5230194Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.5230295Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:30.5230389Z #define FP_ZERO 2 2025-05-07T19:46:30.5230515Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:30.5230684Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:30.5230796Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5230915Z #define __WCHAR_T__ 2025-05-07T19:46:30.5231024Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:46:30.5231242Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:46:30.5231408Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:30.5231539Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:30.5231677Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:46:30.5231811Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:30.5231975Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:30.5232114Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:30.5232222Z #define _BSD_PTRDIFF_T_ 2025-05-07T19:46:30.5232449Z #define _SIGSET_H_types 1 2025-05-07T19:46:30.5232561Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:30.5232670Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:30.5232817Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:30.5232931Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:30.5233054Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:46:30.5233189Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:30.5233318Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:46:30.5233441Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:30.5233671Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:30.5233774Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:46:30.5233894Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:30.5233996Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:30.5234098Z #define STA_MODE 0x4000 2025-05-07T19:46:30.5234232Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:46:30.5234334Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:30.5234454Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:30.5234561Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:30.5234679Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:30.5234787Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:30.5234884Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:30.5235022Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:30.5235119Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:30.5235239Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:30.5235334Z #define __SEG_FS 1 2025-05-07T19:46:30.5235446Z #define _IO_size_t size_t 2025-05-07T19:46:30.5235545Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:46:30.5235646Z #define INT_MIN (-INT_MAX - 1) 2025-05-07T19:46:30.5235748Z #define __stub_lchmod 2025-05-07T19:46:30.5235840Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:46:30.5235947Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5236079Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:30.5236163Z #define __SEG_GS 1 2025-05-07T19:46:30.5236346Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:46:30.5236439Z #define _IOS_APPEND 8 2025-05-07T19:46:30.5236564Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:30.5236659Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:30.5236759Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:30.5236886Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:30.5236983Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:30.5237076Z #define htole16(x) (x) 2025-05-07T19:46:30.5237192Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:30.5237311Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:30.5237405Z #define __INT16_TYPE__ short int 2025-05-07T19:46:30.5237512Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:30.5237715Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:30.5237827Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:30.5237945Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:30.5238039Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:30.5238157Z #define __WCLONE 0x80000000 2025-05-07T19:46:30.5238245Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:46:30.5238332Z #define SEEK_HOLE 4 2025-05-07T19:46:30.5238451Z #define TIMER_ABSTIME 1 2025-05-07T19:46:30.5238543Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:46:30.5238634Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:30.5238807Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:30.5238950Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5239047Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:30.5239157Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:46:30.5239284Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:30.5239402Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:30.5239495Z #define _LINUX_LIMITS_H 2025-05-07T19:46:30.5239610Z #define linux 1 2025-05-07T19:46:30.5239714Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:30.5239827Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:30.5239936Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:30.5240059Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:30.5240174Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:30.5240329Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:30.5240452Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:30.5240542Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.5240631Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:30.5240711Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:30.5240853Z #define htole64(x) (x) 2025-05-07T19:46:30.5240946Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:30.5241060Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:30.5241158Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:30.5241644Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:30.5241727Z #define __USE_POSIX2 1 2025-05-07T19:46:30.5241818Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:30.5241913Z #define __WALL 0x40000000 2025-05-07T19:46:30.5242004Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:30.5242082Z #define _XLOCALE_H 1 2025-05-07T19:46:30.5242181Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:30.5242271Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:46:30.5242361Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:30.5242457Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:30.5242555Z #define __EXCEPTIONS 1 2025-05-07T19:46:30.5242650Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:30.5242841Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:30.5242934Z #define __WORDSIZE 64 2025-05-07T19:46:30.5243019Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:30.5243106Z #define _STL_RELOPS_H 1 2025-05-07T19:46:30.5243197Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:30.5243302Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:30.5243393Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:30.5243476Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:30.5243577Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:30.5243869Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:30.5244099Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:30.5244228Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:30.5244316Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:30.5244413Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:30.5244523Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:30.5244630Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:30.5244735Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:30.5244905Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:30.5245054Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:46:30.5245143Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:30.5245240Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:30.5245411Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:30.5245536Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:46:30.5245622Z #define _STRING_H 1 2025-05-07T19:46:30.5245735Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:30.5245856Z #define _GCC_MAX_ALIGN_T 2025-05-07T19:46:30.5245958Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:30.5246103Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:46:30.5246194Z #define __code_model_small__ 1 2025-05-07T19:46:30.5246289Z #define _PSTL_CONFIG_H 2025-05-07T19:46:30.5246381Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:30.5246495Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:30.5246625Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:30.5246733Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:30.5247071Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:30.5247195Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:46:30.5247289Z #define le64toh(x) (x) 2025-05-07T19:46:30.5247376Z #define FILENAME_MAX 4096 2025-05-07T19:46:30.5247527Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:30.5247667Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:30.5247748Z #define L_cuserid 9 2025-05-07T19:46:30.5247836Z #define __ino_t_defined 2025-05-07T19:46:30.5247950Z #define __k8__ 1 2025-05-07T19:46:30.5248047Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:30.5248201Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:46:30.5248294Z #define __int8_t_defined 2025-05-07T19:46:30.5248421Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:30.5248520Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:30.5248633Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:30.5248762Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:30.5248848Z #define _IOS_TRUNC 16 2025-05-07T19:46:30.5248961Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:30.5249110Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:30.5249229Z #define __HAVE_COLUMN 2025-05-07T19:46:30.5249311Z #define __stub_fdetach 2025-05-07T19:46:30.5249723Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:30.5249814Z #define __pic__ 2 2025-05-07T19:46:30.5249939Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.5250025Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:30.5250115Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:30.5250223Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:30.5250300Z #define __stub_chflags 2025-05-07T19:46:30.5250384Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:30.5250477Z #define __need_IOV_MAX 2025-05-07T19:46:30.5250580Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:30.5250675Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:30.5250777Z #define __cpp_decltype 200707L 2025-05-07T19:46:30.5250872Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:30.5250957Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:30.5251057Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:30.5251151Z #define TTY_NAME_MAX 32 2025-05-07T19:46:30.5251309Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:30.5251423Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5251593Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:30.5251701Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:30.5251792Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:30.5251881Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:30.5251968Z #define __import__ 2025-05-07T19:46:30.5252052Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:30.5252226Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:30.5252335Z #define __export__ 2025-05-07T19:46:30.5252457Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:30.5252560Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:30.5252714Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:30.5252832Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:30.5252924Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:30.5253028Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:46:30.5253139Z #define _WCHAR_T_DECLARED 2025-05-07T19:46:30.5253267Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:30.5253392Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:30.5253498Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:30.5253612Z #define WNOWAIT 0x01000000 2025-05-07T19:46:30.5253698Z #define PLOSS 6 2025-05-07T19:46:30.5253796Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:30.5254088Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:30.5254180Z #define EXIT_SUCCESS 0 2025-05-07T19:46:30.5254284Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:30.5254387Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:30.5254502Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:30.5254600Z #define __thread__ __thread 2025-05-07T19:46:30.5254702Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:30.5254810Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:46:30.5254918Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:30.5255151Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:30.5255295Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:30.5255441Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:30.5255527Z #define __linux__ 1 2025-05-07T19:46:30.5255628Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:30.5255780Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:30.5255873Z #define __S16_TYPE short int 2025-05-07T19:46:30.5256235Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:30.5256371Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:30.5256561Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:30.5256661Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:30.5256759Z #define UINT_MAX (INT_MAX * 2U + 1U) 2025-05-07T19:46:30.5256869Z #define _T_SIZE_ 2025-05-07T19:46:30.5256965Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:30.5257086Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:30.5257209Z #define _PSTL_VERSION 12000 2025-05-07T19:46:30.5257326Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:30.5257427Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:30.5257527Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:30.5257677Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:30.5257766Z #define _IOS_INPUT 1 2025-05-07T19:46:30.5257863Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:30.5257997Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:30.5258092Z #define __INT64_TYPE__ long int 2025-05-07T19:46:30.5258186Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:30.5258311Z #define __shared__ __location__(shared) 2025-05-07T19:46:30.5258401Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:30.5258550Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:30.5258639Z #define __gid_t_defined 2025-05-07T19:46:30.5258779Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:30.5258874Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:30.5259079Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:30.5259208Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:30.5259304Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:30.5259388Z #define ___int_size_t_h 2025-05-07T19:46:30.5259498Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:30.5259782Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:30.5259949Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:30.5260219Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:30.5260352Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:30.5260461Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:30.5260561Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:30.5260695Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5260842Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:30.5260977Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:30.5261071Z #define __clock_t_defined 1 2025-05-07T19:46:30.5261215Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:30.5261339Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:30.5261441Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:30.5261540Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:46:30.5261672Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:30.5261797Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:30.5261893Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:30.5262099Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:30.5262189Z #define __SSE__ 1 2025-05-07T19:46:30.5262296Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:30.5262397Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:30.5262508Z #define _CTYPE_H 1 2025-05-07T19:46:30.5262609Z #define __sigset_t_defined 2025-05-07T19:46:30.5262713Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:30.5262832Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:30.5262924Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:30.5263032Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:30.5263211Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:30.5263323Z #define __SM_70_RT_H__ 2025-05-07T19:46:30.5263427Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:30.5263543Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:30.5263666Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:30.5263843Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:30.5263944Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:30.5264090Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:30.5264200Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:30.5264301Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:30.5264389Z #define __amd64__ 1 2025-05-07T19:46:30.5264510Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:30.5264628Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:30.5264926Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:30.5265043Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:30.5265131Z #define EOF (-1) 2025-05-07T19:46:30.5265244Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:30.5265348Z #define __USE_POSIX199309 1 2025-05-07T19:46:30.5265474Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:30.5265574Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:30.5265679Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:46:30.5265817Z #define LLONG_MIN (-LLONG_MAX-1) 2025-05-07T19:46:30.5265939Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:30.5266043Z #define ____mbstate_t_defined 1 2025-05-07T19:46:30.5266144Z #define STA_NANO 0x2000 2025-05-07T19:46:30.5266272Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:30.5266380Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:30.5266482Z #define _IO_LINKED 0x80 2025-05-07T19:46:30.5266591Z #define __cpp_lib_launder 201606 2025-05-07T19:46:30.5266692Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:30.5266821Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:30.5266918Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:46:30.5267024Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:30.5267200Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:30.5267313Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:30.5267426Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:30.5267527Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:30.5267705Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:30.5267801Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:30.5267937Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:30.5268086Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:30.5268290Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:30.5268490Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:46:30.5268584Z #define __stub_stty 2025-05-07T19:46:30.5268783Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:30.5268876Z #define le16toh(x) (x) 2025-05-07T19:46:30.5268989Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:30.5269202Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:46:30.5269285Z #define _SIZET_ 2025-05-07T19:46:30.5269384Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:30.5269482Z #define _SVID_SOURCE 1 2025-05-07T19:46:30.5269584Z #define _LP64 1 2025-05-07T19:46:30.5269687Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:30.5269949Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:30.5270094Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:30.5270183Z #define __UINT8_C(c) c 2025-05-07T19:46:30.5270280Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:30.5270385Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:30.5270527Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:30.5270627Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:30.5270721Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:46:30.5270847Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:30.5270933Z #define CUDARTAPI 2025-05-07T19:46:30.5271017Z #define IOV_MAX 1024 2025-05-07T19:46:30.5271230Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:30.5271367Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:30.5271470Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:30.5271556Z #define __wchar_t__ 2025-05-07T19:46:30.5271691Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:30.5271778Z #define SEEK_END 2 2025-05-07T19:46:30.5271867Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:30.5272085Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:30.5272190Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:30.5272444Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:30.5272533Z #define ____FILE_defined 1 2025-05-07T19:46:30.5272678Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:30.5272781Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:46:30.5272867Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:30.5272981Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:30.5273235Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:30.5273363Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:30.5273445Z #define _IO_RIGHT 04 2025-05-07T19:46:30.5273567Z #define __END_NAMESPACE_STD 2025-05-07T19:46:30.5273763Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:30.5273851Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:30.5273976Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:30.5274071Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:30.5274178Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:30.5274256Z #define _STDDEF_H_ 2025-05-07T19:46:30.5274454Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:30.5274544Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.5274658Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:30.5274876Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:30.5274993Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:30.5275137Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:30.5275270Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:30.5275370Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:30.5275525Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:30.5275621Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:30.5275749Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:30.5275849Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:30.5275947Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:30.5276053Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:30.5276216Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:46:30.5276312Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:30.5276493Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:30.5276612Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:30.5276701Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:30.5276851Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:30.5276954Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:46:30.5277043Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:30.5277148Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:30.5277255Z #define P_tmpdir "/tmp" 2025-05-07T19:46:30.5277387Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:30.5277482Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:46:30.5277581Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:30.5277755Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:30.5277917Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:46:30.5278018Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:30.5278153Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:30.5278263Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:30.5278358Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:30.5278635Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:30.5278751Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:30.5278859Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:30.5278958Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:30.5279080Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:30.5279168Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:30.5279262Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:30.5279364Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:30.5279461Z #define __FXSR__ 1 2025-05-07T19:46:30.5279534Z #define _SIZE_T 2025-05-07T19:46:30.5279625Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:30.5279743Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:30.5279901Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:30.5280038Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:30.5280122Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:30.5280222Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:30.5280400Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:30.5280593Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:30.5280696Z #define _GXX_NULLPTR_T 2025-05-07T19:46:30.5280818Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:30.5280904Z #define FOPEN_MAX 16 2025-05-07T19:46:30.5281017Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:30.5281135Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:30.5281225Z #define __suseconds_t_defined 2025-05-07T19:46:30.5281307Z #define __off_t_defined 2025-05-07T19:46:30.5281419Z #define stderr stderr 2025-05-07T19:46:30.5281516Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:30.5281624Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:30.5281741Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:30.5281830Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:30.5282590Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:30.5282687Z #define __mode_t_defined 2025-05-07T19:46:30.5282802Z #define _GCC_SIZE_T 2025-05-07T19:46:30.5282906Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:30.5283167Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:46:30.5283303Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:30.5283397Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:30.5283496Z #define __UINT32_C(c) c ## U 2025-05-07T19:46:30.5283605Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:30.5283745Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:30.5283862Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:30.5283953Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:30.5284057Z #define __size_t__ 2025-05-07T19:46:30.5284199Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:30.5284298Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:30.5284406Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:30.5284589Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:30.5284688Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:30.5284873Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:46:30.5284984Z #define _ENDIAN_H 1 2025-05-07T19:46:30.5285104Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:30.5285201Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:30.5285306Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:30.5285408Z #define __try try 2025-05-07T19:46:30.5285506Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:30.5285600Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:46:30.5285720Z #define __INT8_MAX__ 0x7f 2025-05-07T19:46:30.5286006Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:30.5286101Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:30.5286204Z #define __PIC__ 2 2025-05-07T19:46:30.5286326Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:30.5286524Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:46:30.5286659Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:30.5286782Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:30.5286878Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:30.5287083Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:30.5287206Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:30.5287309Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:30.5287407Z #define _IO_uid_t __uid_t 2025-05-07T19:46:30.5287508Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:30.5287656Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:30.5287765Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:30.5287920Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:30.5288041Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:30.5288172Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:30.5288262Z #define LONG_BIT 64 2025-05-07T19:46:30.5288386Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:30.5288503Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:30.5288640Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:30.5288743Z #define __fsfilcnt_t_defined 2025-05-07T19:46:30.5288862Z #define __blkcnt_t_defined 2025-05-07T19:46:30.5289147Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:30.5289250Z #define __USE_LARGEFILE 1 2025-05-07T19:46:30.5289373Z #define __cpp_constexpr 201603L 2025-05-07T19:46:30.5289470Z #define CUDART_VERSION 12060 2025-05-07T19:46:30.5289568Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:30.5289671Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:30.5289787Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:30.5290000Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:30.5290106Z #define __lldiv_t_defined 1 2025-05-07T19:46:30.5290226Z #define __SSE2__ 1 2025-05-07T19:46:30.5290324Z #define _IOLBF 1 2025-05-07T19:46:30.5290445Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:30.5290551Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:30.5290700Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:30.5290809Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:30.5291018Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:30.5291146Z #define __INT32_TYPE__ int 2025-05-07T19:46:30.5291251Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:30.5291376Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:30.5291491Z #define __cpp_exceptions 199711L 2025-05-07T19:46:30.5291626Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:30.5291751Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:30.5291855Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:30.5292021Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:30.5292207Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:46:30.5292322Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:30.5292435Z #define __SWORD_TYPE long int 2025-05-07T19:46:30.5292579Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:30.5292680Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:30.5292792Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:30.5292931Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:30.5293247Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:30.5293360Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:46:30.5293527Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:30.5293656Z #define _T_SIZE 2025-05-07T19:46:30.5293780Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:30.5293927Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:30.5294103Z #define __va_arg_pack() __builtin_va_arg_pack () 2025-05-07T19:46:30.5294218Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:30.5294436Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:30.5294574Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:30.5294819Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:30.5294999Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.5295103Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:30.5295319Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_ ##_FEAT 2025-05-07T19:46:30.5295420Z #define __GNUC_MINOR__ 4 2025-05-07T19:46:30.5295535Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:30.5295668Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:46:30.5295795Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.5295891Z #define __PIE__ 2 2025-05-07T19:46:30.5296002Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:30.5296144Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:30.5296351Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:46:30.5296581Z #define __intN_t(N,MODE) typedef int int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:30.5296713Z #define __nlink_t_defined 2025-05-07T19:46:30.5296850Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:30.5296978Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:30.5297081Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:30.5297391Z #define __u_intN_t(N,MODE) typedef unsigned int u_int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:30.5297525Z #define __cpp_template_template_args 201611L 2025-05-07T19:46:30.5297639Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:30.5297785Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:30.5297892Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:30.5297987Z #define __FILE_defined 1 2025-05-07T19:46:30.5298172Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:46:30.5298295Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:30.5298402Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:30.5298516Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:30.5298669Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:30.5298781Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:30.5298892Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:30.5299008Z #define __INT16_C(c) c 2025-05-07T19:46:30.5299102Z #define __U32_TYPE unsigned int 2025-05-07T19:46:30.5299204Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:30.5299333Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:30.5299493Z #define __STDC__ 1 2025-05-07T19:46:30.5299596Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:30.5299773Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:30.5299906Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:30.5300230Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:30.5300336Z #define __FLT32X_DIG__ 15 2025-05-07T19:46:30.5300448Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:30.5300591Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:30.5300788Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:30.5300912Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:30.5301055Z #define USHRT_MAX (SHRT_MAX * 2 + 1) 2025-05-07T19:46:30.5301177Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:30.5301273Z #define stdin stdin 2025-05-07T19:46:30.5301376Z #define __ino64_t_defined 2025-05-07T19:46:30.5301506Z #define STA_CLK 0x8000 2025-05-07T19:46:30.5301611Z #define __clockid_t_defined 1 2025-05-07T19:46:30.5301779Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:30.5301992Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:30.5302109Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:30.5302226Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:30.5302345Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:30.5302494Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:30.5302718Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:30.5302827Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:30.5303461Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:30.5303618Z #define DOMAIN 1 2025-05-07T19:46:30.5303722Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:30.5303852Z #define __NVCC__ 1 2025-05-07T19:46:30.5303975Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:30.5304105Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:30.5304224Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:30.5304371Z #define __throw_exception_again throw 2025-05-07T19:46:30.5304477Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:30.5304580Z #define __EXCEPTION_H 1 2025-05-07T19:46:30.5304716Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:46:30.5304835Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:30.5305167Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:30.5305324Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:30.5305436Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:30.5305550Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:30.5305666Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:30.5305795Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:30.5305947Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:46:30.5306069Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:30.5306224Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:30.5306332Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:46:30.5306451Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:30.5306562Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:30.5306707Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:30.5306858Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:30.5306970Z #define __useconds_t_defined 2025-05-07T19:46:30.5307111Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:30.5307314Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:30.5307484Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:30.5307586Z #define __SSE_MATH__ 1 2025-05-07T19:46:30.5307716Z #define _IO_wint_t wint_t 2025-05-07T19:46:30.5307823Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:30.5307928Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:30.5308111Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:30.5308245Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:30.5308355Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:30.5308462Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:30.5308585Z #define __USE_ATFILE 1 2025-05-07T19:46:30.5308690Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:30.5308800Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:30.5308930Z #define _GCC_PTRDIFF_T 2025-05-07T19:46:30.5309174Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:30.5309281Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:46:30.5309393Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:30.5309534Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:30.5309662Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:30.5309758Z #define _STDLIB_H 1 2025-05-07T19:46:30.5309936Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:30.5310045Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:46:30.5310153Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:30.5310305Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:30.5310426Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:30.5310533Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:30.5310734Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:30.5310931Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:30.5311051Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:30.5311182Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:30.5311316Z #define __ldiv_t_defined 1 2025-05-07T19:46:30.5311516Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:30.5311678Z #define ___int_ptrdiff_t_h 2025-05-07T19:46:30.5311866Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:30.5312006Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:30.5312114Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:30.5312233Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:30.5312479Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:30.5312589Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:30.5312678Z #define CUDART_CB 2025-05-07T19:46:30.5312788Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:30.5312941Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:30.5313037Z #define MB_LEN_MAX 16 2025-05-07T19:46:30.5313273Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:30.5313408Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:30.5313531Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:30.5313651Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:30.5313783Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:30.5313936Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:30.5314049Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:30.5314142Z #define _GNU_SOURCE 1 2025-05-07T19:46:30.5314265Z #define __stub_putmsg 2025-05-07T19:46:30.5314355Z #define __CUDACC__ 1 2025-05-07T19:46:30.5314453Z #define __N(msgid) (msgid) 2025-05-07T19:46:30.5314570Z #define __P(args) args 2025-05-07T19:46:30.5314827Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:30.5314936Z #define __cpp_init_captures 201304L 2025-05-07T19:46:30.5315046Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:30.5315171Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:30.5315274Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:30.5315365Z #define __WCHAR_T 2025-05-07T19:46:30.5315488Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:30.5315593Z #define __fsblkcnt_t_defined 2025-05-07T19:46:30.5315715Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:30.5315827Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:30.5315835Z 2025-05-07T19:46:30.5509791Z 2025-05-07T19:46:30.5510306Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:30.5510320Z 2025-05-07T19:46:32.3442710Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:32.3443364Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:32.3443717Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:32.3444091Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:32.3444453Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:32.3444711Z 2025-05-07T19:46:32.4253777Z 2025-05-07T19:46:32.4263101Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:32.4263900Z [CHECK] nvidia-smi not found 2025-05-07T19:46:32.4264221Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:32.4357179Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:32.4357803Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:32.4358429Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:32.4358776Z env: 2025-05-07T19:46:32.4359029Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:32.4359333Z BUILD_ENV: build_binary 2025-05-07T19:46:32.4359606Z BUILD_TARGET: default 2025-05-07T19:46:32.4359849Z BUILD_VARIANT: cuda 2025-05-07T19:46:32.4360119Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:32.4360378Z ##[endgroup] 2025-05-07T19:46:32.8880569Z ################################################################################ 2025-05-07T19:46:32.8881025Z # Install PyTorch (PIP) 2025-05-07T19:46:32.8881298Z # 2025-05-07T19:46:32.8903146Z # [2025-05-07T19:46:32.889Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:32.8904719Z ################################################################################ 2025-05-07T19:46:32.8905884Z 2025-05-07T19:46:32.8937944Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:33.7944782Z Channels: 2025-05-07T19:46:33.7945171Z - conda-forge 2025-05-07T19:46:33.7945444Z Platform: linux-64 2025-05-07T19:46:36.8919938Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:38.5018496Z Solving environment: \ | / - done 2025-05-07T19:46:38.7921801Z 2025-05-07T19:46:38.7922122Z ## Package Plan ## 2025-05-07T19:46:38.7922388Z 2025-05-07T19:46:38.7922624Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:38.7922961Z 2025-05-07T19:46:38.7923106Z added / updated specs: 2025-05-07T19:46:38.7923390Z - numpy 2025-05-07T19:46:38.7923554Z 2025-05-07T19:46:38.7923558Z 2025-05-07T19:46:38.7923692Z The following packages will be downloaded: 2025-05-07T19:46:38.7923930Z 2025-05-07T19:46:38.7924124Z package | build 2025-05-07T19:46:38.7924508Z ---------------------------|----------------- 2025-05-07T19:46:38.7924956Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:38.7925455Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:38.7925990Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:38.7926470Z numpy-2.0.2 | py39h9cb892a_1 7.6 MB conda-forge 2025-05-07T19:46:38.7926926Z ------------------------------------------------------------ 2025-05-07T19:46:38.7927327Z Total: 7.6 MB 2025-05-07T19:46:38.7927560Z 2025-05-07T19:46:38.7927704Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:38.7927948Z 2025-05-07T19:46:38.7928226Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:38.7928790Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:38.7929400Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:38.7929964Z numpy conda-forge/linux-64::numpy-2.0.2-py39h9cb892a_1 2025-05-07T19:46:38.7930266Z 2025-05-07T19:46:38.7930270Z 2025-05-07T19:46:38.7930274Z 2025-05-07T19:46:38.7930435Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:38.7930871Z numpy-2.0.2 | 7.6 MB | | 0% 2025-05-07T19:46:38.7931120Z 2025-05-07T19:46:38.7931534Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:38.7931824Z 2025-05-07T19:46:38.7931828Z 2025-05-07T19:46:38.7940913Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:38.7941194Z 2025-05-07T19:46:38.7941230Z 2025-05-07T19:46:38.7941234Z 2025-05-07T19:46:39.1201093Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:39.1201427Z 2025-05-07T19:46:39.1203071Z 2025-05-07T19:46:39.1218388Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:39.1218739Z 2025-05-07T19:46:39.1218743Z 2025-05-07T19:46:39.1427546Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.1427854Z 2025-05-07T19:46:39.1427858Z 2025-05-07T19:46:39.1844625Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.1896470Z numpy-2.0.2 | 7.6 MB | | 0% 2025-05-07T19:46:39.1896799Z 2025-05-07T19:46:39.1899803Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:46:39.1900124Z 2025-05-07T19:46:39.1934093Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.1934396Z 2025-05-07T19:46:39.1934401Z 2025-05-07T19:46:39.1934405Z 2025-05-07T19:46:39.1941630Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:39.1941957Z 2025-05-07T19:46:39.1941967Z 2025-05-07T19:46:39.1943125Z 2025-05-07T19:46:39.2304509Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.2305442Z 2025-05-07T19:46:39.2307741Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.2308035Z 2025-05-07T19:46:39.2308039Z 2025-05-07T19:46:39.2308950Z 2025-05-07T19:46:39.2746719Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.6290028Z numpy-2.0.2 | 7.6 MB | ########## | 100% 2025-05-07T19:46:39.6290521Z numpy-2.0.2 | 7.6 MB | ########## | 100% 2025-05-07T19:46:39.6296301Z numpy-2.0.2 | 7.6 MB | ########## | 100% 2025-05-07T19:46:39.6296670Z 2025-05-07T19:46:39.6296946Z 2025-05-07T19:46:39.6297344Z  2025-05-07T19:46:39.6297576Z 2025-05-07T19:46:39.6297582Z 2025-05-07T19:46:39.6297799Z  2025-05-07T19:46:39.6298036Z 2025-05-07T19:46:39.6298041Z 2025-05-07T19:46:39.6298110Z 2025-05-07T19:46:39.6298305Z  done 2025-05-07T19:46:39.7307265Z Preparing transaction: | done 2025-05-07T19:46:39.9317562Z Verifying transaction: - \ done 2025-05-07T19:46:40.0331594Z Executing transaction: / done 2025-05-07T19:46:40.1396555Z ################################################################################ 2025-05-07T19:46:40.1397102Z # Install Package From PyTorch PIP: torch 2025-05-07T19:46:40.1397440Z # 2025-05-07T19:46:40.1422382Z # [2025-05-07T19:46:40.141Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:46:40.1422968Z ################################################################################ 2025-05-07T19:46:40.1423215Z 2025-05-07T19:46:40.1442094Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:46:40.2442019Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:46:40.2442483Z ################################################################################ 2025-05-07T19:46:40.2442931Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:46:40.2443278Z # 2025-05-07T19:46:40.2462587Z # [2025-05-07T19:46:40.245Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:46:40.2463126Z ################################################################################ 2025-05-07T19:46:40.2463371Z 2025-05-07T19:46:40.2486054Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:46:40.2512400Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:46:40.2526721Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:46:40.2527415Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:46:40.2539981Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:46:40.2547549Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:46:40.2572800Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:13.9031287Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:13.9032894Z 2025-05-07T19:48:13.9033232Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:13.9033679Z Collecting torch 2025-05-07T19:48:13.9034444Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp39-cp39-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:13.9035281Z Collecting filelock (from torch) 2025-05-07T19:48:13.9035891Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:13.9037357Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from torch) (4.13.2) 2025-05-07T19:48:13.9038171Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:13.9038764Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:13.9039773Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 24.7 MB/s eta 0:00:00 2025-05-07T19:48:13.9040168Z Collecting networkx (from torch) 2025-05-07T19:48:13.9040761Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.2.1-py3-none-any.whl (1.6 MB) 2025-05-07T19:48:13.9041490Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.6/1.6 MB 10.4 MB/s eta 0:00:00 2025-05-07T19:48:13.9042289Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from torch) (3.1.6) 2025-05-07T19:48:13.9043030Z Collecting fsspec (from torch) 2025-05-07T19:48:13.9043621Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:13.9044289Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:13.9045094Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:13.9046010Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 62.9 MB/s eta 0:00:00 2025-05-07T19:48:13.9046465Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:13.9047423Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:13.9048318Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 7.4 MB/s eta 0:00:00 2025-05-07T19:48:13.9048746Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:13.9049554Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:13.9050416Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 41.2 MB/s eta 0:00:00 2025-05-07T19:48:13.9050857Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:13.9051698Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:13.9052539Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 49.1 MB/s eta 0:00:00 2025-05-07T19:48:13.9052962Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:13.9053773Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:13.9054697Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 47.5 MB/s eta 0:00:00 2025-05-07T19:48:13.9055084Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:13.9056025Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:13.9056862Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 67.2 MB/s eta 0:00:00 2025-05-07T19:48:13.9057260Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:13.9058004Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:13.9058802Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 71.8 MB/s eta 0:00:00 2025-05-07T19:48:13.9059236Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:13.9060254Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:13.9061146Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 70.6 MB/s eta 0:00:00 2025-05-07T19:48:13.9061620Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:13.9062508Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:13.9063426Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 70.3 MB/s eta 0:00:00 2025-05-07T19:48:13.9063863Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:13.9064690Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:13.9065606Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 73.4 MB/s eta 0:00:00 2025-05-07T19:48:13.9066018Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:13.9066968Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:13.9067775Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:13.9068507Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:13.9069237Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:13.9070054Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:13.9070990Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 46.1 MB/s eta 0:00:00 2025-05-07T19:48:13.9082692Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:13.9083710Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:13.9084693Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:13.9085660Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:13.9087163Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:48:13.9088176Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:13.9088884Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:13.9089667Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 2.1 MB/s eta 0:00:00 2025-05-07T19:48:13.9090487Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:13.9091622Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp39-cp39-manylinux_2_28_x86_64.whl (825.5 MB) 2025-05-07T19:48:13.9092663Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.5/825.5 MB 26.5 MB/s eta 0:00:00 2025-05-07T19:48:13.9093590Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:13.9094512Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 7.0 MB/s eta 0:00:00 2025-05-07T19:48:13.9095340Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:13.9096228Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 69.2 MB/s eta 0:00:00 2025-05-07T19:48:13.9097086Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.4 MB) 2025-05-07T19:48:13.9097993Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.4/153.4 MB 66.9 MB/s eta 0:00:00 2025-05-07T19:48:13.9099858Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:13.9101804Z 2025-05-07T19:48:13.9103954Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.2.1 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:13.9106318Z 2025-05-07T19:48:16.0710557Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:16.0716029Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:19.2692990Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:22.5481103Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:22.5481681Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:25.7102194Z True 2025-05-07T19:48:25.7102555Z True 2025-05-07T19:48:25.7102682Z 2025-05-07T19:48:25.7684891Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:25.7751348Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:25.7752086Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:25.7752908Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:25.7753257Z env: 2025-05-07T19:48:25.7753541Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:25.7753876Z BUILD_ENV: build_binary 2025-05-07T19:48:25.7754199Z BUILD_TARGET: default 2025-05-07T19:48:25.7754453Z BUILD_VARIANT: cuda 2025-05-07T19:48:25.7754739Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:25.7755013Z ##[endgroup] 2025-05-07T19:48:26.2257940Z /github/home/miniconda/bin/conda 2025-05-07T19:48:26.2258915Z ################################################################################ 2025-05-07T19:48:26.2260501Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:26.2261670Z # 2025-05-07T19:48:26.2274997Z # [2025-05-07T19:48:26.226Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:26.2276273Z ################################################################################ 2025-05-07T19:48:26.2276992Z 2025-05-07T19:48:26.2289796Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:26.3210656Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:26.3222017Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:26.3223078Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:26.3223528Z 2025-05-07T19:48:26.4064285Z 2025-05-07T19:48:26.4064989Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:26.4096989Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:31.9949364Z Collecting environment information... 2025-05-07T19:48:31.9949854Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:31.9950272Z Is debug build: False 2025-05-07T19:48:31.9950589Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:31.9950912Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:31.9951125Z 2025-05-07T19:48:31.9951283Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:31.9951661Z GCC version: (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:48:31.9952076Z Clang version: Could not collect 2025-05-07T19:48:31.9952395Z CMake version: version 4.0.2 2025-05-07T19:48:31.9952730Z Libc version: glibc-2.34 2025-05-07T19:48:31.9952932Z 2025-05-07T19:48:31.9953291Z Python version: 3.9.22 | packaged by conda-forge | (main, Apr 14 2025, 23:35:59) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:31.9954020Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:31.9954518Z Is CUDA available: False 2025-05-07T19:48:31.9954803Z CUDA runtime version: 12.6.85 2025-05-07T19:48:31.9955139Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:31.9955493Z GPU models and configuration: Could not collect 2025-05-07T19:48:31.9955905Z Nvidia driver version: Could not collect 2025-05-07T19:48:31.9956247Z cuDNN version: Could not collect 2025-05-07T19:48:31.9956581Z HIP runtime version: N/A 2025-05-07T19:48:31.9956864Z MIOpen runtime version: N/A 2025-05-07T19:48:31.9957186Z Is XNNPACK available: True 2025-05-07T19:48:31.9957368Z 2025-05-07T19:48:31.9957485Z CPU: 2025-05-07T19:48:31.9957729Z Architecture: x86_64 2025-05-07T19:48:31.9958124Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:31.9958566Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:31.9959027Z Byte Order: Little Endian 2025-05-07T19:48:31.9959377Z CPU(s): 96 2025-05-07T19:48:31.9959735Z On-line CPU(s) list: 0-95 2025-05-07T19:48:31.9960088Z Vendor ID: GenuineIntel 2025-05-07T19:48:31.9960876Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:31.9961373Z CPU family: 6 2025-05-07T19:48:31.9961694Z Model: 85 2025-05-07T19:48:31.9962051Z Thread(s) per core: 2 2025-05-07T19:48:31.9962378Z Core(s) per socket: 24 2025-05-07T19:48:31.9962730Z Socket(s): 2 2025-05-07T19:48:31.9963042Z Stepping: 7 2025-05-07T19:48:31.9963400Z BogoMIPS: 5999.99 2025-05-07T19:48:31.9965999Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:31.9968348Z Hypervisor vendor: KVM 2025-05-07T19:48:31.9968739Z Virtualization type: full 2025-05-07T19:48:31.9969091Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:31.9969496Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:31.9970010Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:31.9970410Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:31.9970743Z NUMA node(s): 2 2025-05-07T19:48:31.9971083Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:31.9971428Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:31.9971924Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:31.9972516Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:31.9973019Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:31.9973645Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:31.9974228Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:31.9974866Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:31.9975509Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:31.9975888Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:31.9976306Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:31.9976693Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:31.9977285Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:31.9978119Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:31.9978795Z Vulnerability Srbds: Not affected 2025-05-07T19:48:31.9979201Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:31.9979445Z 2025-05-07T19:48:31.9979560Z Versions of relevant libraries: 2025-05-07T19:48:31.9979955Z [pip3] numpy==2.0.2 2025-05-07T19:48:31.9980393Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:31.9980780Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:31.9981132Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:31.9981518Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:31.9981868Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:31.9982398Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:31.9982760Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:31.9983099Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:31.9983478Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:31.9983954Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:31.9984324Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:31.9984646Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:31.9985013Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:31.9985331Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:31.9985691Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:31.9986105Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:31.9986684Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:31.9987289Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:31.9987871Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:31.9988491Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:31.9989089Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:31.9989650Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:31.9990200Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:31.9990737Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:31.9991321Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:31.9991994Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:31.9992520Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:31.9993076Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:31.9993601Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:31.9994260Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:31.9994762Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:31.9995280Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:31.9995794Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:31.9996278Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:31.9996789Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:31.9997276Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:31.9997792Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:31.9998293Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:31.9998827Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:31.9999364Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:31.9999866Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:32.0000406Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:32.0000917Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:32.0001430Z [conda] numpy 2.0.2 py39h9cb892a_1 conda-forge 2025-05-07T19:48:32.0001944Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:32.0002460Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:32.0003006Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:32.0003523Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:32.0004063Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:32.0004612Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:32.0005138Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:32.0005668Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:32.0006176Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:32.0006721Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:32.0007231Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:32.0007761Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:32.0008264Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:32.0008791Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:32.0009302Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:32.0009588Z 2025-05-07T19:48:32.1039087Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:32.1039795Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:32.1040598Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:32.1040990Z env: 2025-05-07T19:48:32.1041246Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:32.1041612Z BUILD_ENV: build_binary 2025-05-07T19:48:32.1042061Z BUILD_TARGET: default 2025-05-07T19:48:32.1042357Z BUILD_VARIANT: cuda 2025-05-07T19:48:32.1042649Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:32.1042920Z ##[endgroup] 2025-05-07T19:48:32.5622022Z ################################################################################ 2025-05-07T19:48:32.5622417Z # Install cuDNN 2025-05-07T19:48:32.5622675Z # 2025-05-07T19:48:32.5639375Z # [2025-05-07T19:48:32.563Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:32.5640095Z ################################################################################ 2025-05-07T19:48:32.5640343Z 2025-05-07T19:48:32.5662489Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:32.6520249Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:32.6521503Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:32.6524748Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:32.6525549Z 2025-05-07T19:48:32.6541773Z 2025-05-07T19:48:32.6542448Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:32.6543205Z 2025-05-07T19:48:32.6555402Z 2025-05-07T19:48:32.6574978Z [INSTALL] Downloading cuDNN to /tmp/tmp.eLEk0TiEKG ... 2025-05-07T19:48:32.6596592Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:38.3486842Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:38.3487397Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:38.3487585Z 2025-05-07T19:48:38.3520273Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:38.3521445Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:38.3522776Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:48:43.0182256Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:48:43.0823250Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:48:50.6313068Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:48:50.8742819Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:48:50.9120365Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:48:51.4466396Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:48:53.5327955Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:48:53.5329646Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:48:53.5331332Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:48:53.5333300Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:48:53.5335173Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:48:53.5335750Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:48:53.5336331Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:48:53.5336834Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:48:53.5337332Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:48:53.5337849Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:48:53.5338357Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:48:53.5339220Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:48:53.5339903Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:48:58.0534085Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:48:58.0535663Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:48:58.1152299Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:48:58.1154548Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:05.3550826Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:05.3552698Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:05.3554471Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:05.3556486Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:05.5507604Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:05.5508245Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:05.5508796Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:05.5509325Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:05.5867760Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:06.1294848Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:06.1295468Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:06.1296013Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:06.1296546Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:06.1297092Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:08.2597095Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:08.2597717Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:08.2598246Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:08.2598811Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:08.2599340Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:08.2599899Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:08.2600445Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:08.2601695Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:08.2602291Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:08.2603024Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:08.2605676Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:08.2606232Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:08.2606741Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:08.2607269Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:08.2607774Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:08.2608262Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:08.2618194Z 2025-05-07T19:49:08.2619155Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:08.2620886Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:08.2621657Z 2025-05-07T19:49:08.2637220Z 2025-05-07T19:49:08.2637640Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:08.2637928Z 2025-05-07T19:49:08.2656482Z 2025-05-07T19:49:08.2657856Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:08.2658291Z 2025-05-07T19:49:08.2684009Z 2025-05-07T19:49:08.2686582Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:08.2687058Z 2025-05-07T19:49:09.2205352Z 2025-05-07T19:49:09.2205849Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:09.2206341Z + rm -rf /tmp/tmp.eLEk0TiEKG 2025-05-07T19:49:09.2206549Z 2025-05-07T19:49:09.2707205Z 2025-05-07T19:49:09.2713028Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:09.2714461Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:09.2715175Z 2025-05-07T19:49:09.6808516Z 2025-05-07T19:49:09.6809296Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:09.6881671Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:09.6882673Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:09.6883381Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:09.6883742Z env: 2025-05-07T19:49:09.6883979Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:09.6884319Z BUILD_ENV: build_binary 2025-05-07T19:49:09.6884597Z BUILD_TARGET: default 2025-05-07T19:49:09.6884850Z BUILD_VARIANT: cuda 2025-05-07T19:49:09.6885113Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:09.6885376Z ##[endgroup] 2025-05-07T19:49:10.0754791Z ################################################################################ 2025-05-07T19:49:10.0755701Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:10.0756046Z # 2025-05-07T19:49:10.0769237Z # [2025-05-07T19:49:10.076Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:10.0770915Z ################################################################################ 2025-05-07T19:49:10.0771715Z 2025-05-07T19:49:10.0793581Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:10.1685555Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:10.1703821Z [BUILD] Running git submodules update ... 2025-05-07T19:49:10.1730088Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:10.2033067Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:10.2034566Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:10.2035950Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:10.2037207Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:10.2037908Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:10.2038394Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:10.2038842Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:10.2066141Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:10.2521982Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:10.2542455Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:12.3642619Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:12.3781085Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:12.3878447Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:12.5300147Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:12.5339056Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:12.5419336Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:12.5423622Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:12.5426947Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:12.5428240Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:12.5752093Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:12.5787782Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:12.5865117Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 21)) (2.0.2) 2025-05-07T19:49:12.6030905Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:12.6063608Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:12.6131862Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:12.6135920Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:12.6138063Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:12.6383828Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:12.6436914Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:12.6626487Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:12.6679760Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:12.6928049Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:12.6966435Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:12.7125685Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:12.7129987Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:12.7134406Z Requirement already satisfied: importlib-metadata>=4.6 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (8.7.0) 2025-05-07T19:49:12.7139099Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (2.2.1) 2025-05-07T19:49:12.7263423Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:12.7267619Z Requirement already satisfied: exceptiongroup>=1.0.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from hypothesis->-r requirements.txt (line 17)) (1.2.2) 2025-05-07T19:49:12.7270986Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:12.7292495Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:12.7422570Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:12.7460764Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:12.7532581Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:12.7579277Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:12.7585162Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:12.7802963Z Requirement already satisfied: zipp>=3.20 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from importlib-metadata>=4.6->build->-r requirements.txt (line 14)) (3.21.0) 2025-05-07T19:49:12.7989524Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:12.8022526Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:12.8134586Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:12.8235326Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:12.9464940Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 233.2 MB/s eta 0:00:00 2025-05-07T19:49:12.9519396Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:12.9623189Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:12.9712802Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:12.9783564Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:12.9862879Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:12.9961788Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:13.0022614Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:13.1848840Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:14.1495471Z 2025-05-07T19:49:14.1567423Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:14.1569751Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:14.3103217Z ################################################################################ 2025-05-07T19:49:14.3103729Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:14.3104025Z # 2025-05-07T19:49:14.3121920Z # [2025-05-07T19:49:14.311Z] + install_triton_pip build_binary 2025-05-07T19:49:14.3122820Z ################################################################################ 2025-05-07T19:49:14.3123140Z 2025-05-07T19:49:14.3123429Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:14.3123919Z ################################################################################ 2025-05-07T19:49:14.3124350Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:14.3124762Z # 2025-05-07T19:49:14.3141513Z # [2025-05-07T19:49:14.313Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:14.3142175Z ################################################################################ 2025-05-07T19:49:14.3142452Z 2025-05-07T19:49:14.3158673Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:14.4002598Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:14.4003125Z ################################################################################ 2025-05-07T19:49:14.4003550Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:14.4003870Z # 2025-05-07T19:49:14.4027563Z # [2025-05-07T19:49:14.402Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:14.4029180Z ################################################################################ 2025-05-07T19:49:14.4029874Z 2025-05-07T19:49:14.4083452Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:14.4099010Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:14.4100961Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:14.4104297Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:14.4114121Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:14.4142470Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:19.9023014Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:19.9024626Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:19.9026972Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:19.9028531Z 2025-05-07T19:49:19.9028788Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:19.9029280Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:19.9030175Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:19.9031589Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.4 MB) 2025-05-07T19:49:19.9032880Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.4/166.4 MB 172.5 MB/s eta 0:00:00 2025-05-07T19:49:19.9033417Z Installing collected packages: pytorch-triton 2025-05-07T19:49:19.9033847Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:19.9034279Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:19.9034778Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:19.9035248Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:19.9035829Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:19.9036101Z 2025-05-07T19:49:22.0221738Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:22.0222310Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:24.0725853Z ################################################################################ 2025-05-07T19:49:24.0726587Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:24.0727006Z ################################################################################ 2025-05-07T19:49:24.0727257Z 2025-05-07T19:49:26.0159596Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:28.0705996Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:28.0706690Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:28.0804946Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:28.0805670Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:28.0806355Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:28.0806697Z env: 2025-05-07T19:49:28.0806975Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:28.0807301Z BUILD_ENV: build_binary 2025-05-07T19:49:28.0807601Z BUILD_TARGET: default 2025-05-07T19:49:28.0807888Z BUILD_VARIANT: cuda 2025-05-07T19:49:28.0808326Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:28.0808625Z ##[endgroup] 2025-05-07T19:49:28.5154694Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:28.5155194Z [BUILD] Extracted build target: default 2025-05-07T19:49:28.5155553Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:30.3132464Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:30.3132777Z 2025-05-07T19:49:30.3759943Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:32.1766313Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:32.1767203Z 2025-05-07T19:49:32.2541650Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:34.0484776Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:34.0485112Z 2025-05-07T19:49:34.1075091Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:35.9169546Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:35.9169934Z 2025-05-07T19:49:35.9920282Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:37.8592535Z [BUILD] Extracted and set Python tag: py39 2025-05-07T19:49:37.8593098Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:37.8822483Z core = 24 2025-05-07T19:49:37.9063899Z sockets = 2 2025-05-07T19:49:37.9064295Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:37.9064715Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:37.9065016Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:37.9065387Z + rm -rf dist 2025-05-07T19:49:37.9065522Z 2025-05-07T19:49:37.9084092Z 2025-05-07T19:49:37.9085409Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:37.9086426Z 2025-05-07T19:49:41.1593192Z INFO:root:running clean 2025-05-07T19:49:41.1595559Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:41.1596721Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:41.1597938Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:41.1598451Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:41.1612598Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:41.1613341Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:41.1613929Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:41.1614379Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:41.1615828Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:41.5143725Z 2025-05-07T19:49:41.5145187Z [BUILD] Printing git status ... 2025-05-07T19:49:41.5146166Z + git status 2025-05-07T19:49:41.5146766Z 2025-05-07T19:49:42.1367108Z HEAD detached at pull/4066/merge 2025-05-07T19:49:42.1368024Z Untracked files: 2025-05-07T19:49:42.1368949Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:42.1370001Z ../build_only/ 2025-05-07T19:49:42.1370647Z ../collect_env.py 2025-05-07T19:49:42.1371334Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:42.1371876Z 2025-05-07T19:49:42.1373157Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:42.1374217Z 2025-05-07T19:49:42.1374483Z + git diff 2025-05-07T19:49:42.1374821Z 2025-05-07T19:49:42.1652463Z 2025-05-07T19:49:42.1653105Z ################################################################################ 2025-05-07T19:49:42.1654212Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:42.1654981Z # 2025-05-07T19:49:42.1677453Z # [2025-05-07T19:49:42.166Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:42.1678003Z ################################################################################ 2025-05-07T19:49:42.1678769Z 2025-05-07T19:49:42.1683189Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:42.1684598Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:44.0085524Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:44.0085879Z 2025-05-07T19:49:44.0846685Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:49:45.9225201Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:45.9225970Z 2025-05-07T19:49:45.9822673Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:49:47.8036659Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:47.8036981Z 2025-05-07T19:49:47.8770480Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:49:49.7053512Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:49:49.7053959Z 2025-05-07T19:49:49.7816550Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:49:51.6938412Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:49:51.6939043Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:49:51.6939463Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:49:51.6940065Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:49:51.6940542Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:49:51.6940980Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:49:51.6941428Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:49:53.6013454Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:49:57.3867080Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:49:57.3867653Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:49:57.3867963Z 2025-05-07T19:49:57.7986200Z 2025-05-07T19:49:57.7986715Z [BUILD] Setting CUDA build args ... 2025-05-07T19:49:59.6304709Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:03.3195358Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:03.3196275Z 2025-05-07T19:50:05.1658776Z 2025-05-07T19:50:05.1659272Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:05.1660427Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:05.1661252Z 2025-05-07T19:50:05.5756704Z 2025-05-07T19:50:05.5757644Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:05.5758521Z 2025-05-07T19:50:07.3708571Z -std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:07.3710193Z 2025-05-07T19:50:07.4300825Z 2025-05-07T19:50:07.4301549Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:07.4302532Z + conda run -n build_binary c++ --version 2025-05-07T19:50:07.4303197Z 2025-05-07T19:50:09.2289006Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:50:09.2289552Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:50:09.2290082Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:50:09.2290836Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:50:09.2291197Z 2025-05-07T19:50:09.2291201Z 2025-05-07T19:50:09.2911497Z 2025-05-07T19:50:09.2912830Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:09.2913749Z 2025-05-07T19:50:11.1661326Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:11.1662542Z 2025-05-07T19:50:11.1662970Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:11.1668098Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --debug 2025-05-07T19:50:11.1670792Z ################################################################################ 2025-05-07T19:50:11.1671153Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:11.1671464Z # 2025-05-07T19:50:11.1680327Z # [2025-05-07T19:50:11.167Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:11.1681283Z ################################################################################ 2025-05-07T19:50:11.1681530Z 2025-05-07T19:50:11.1681742Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:11.1688113Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py39 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:11.1692899Z 2025-05-07T19:50:12.9940916Z * Getting build dependencies for wheel... 2025-05-07T19:50:14.4226828Z INFO:root:running egg_info 2025-05-07T19:50:14.4259407Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:14.4260101Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:14.4261986Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:14.4263920Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:14.4264939Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:14.4266123Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:14.4332235Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:14.4344807Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:14.4350921Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:14.4352101Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:14.4353253Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:14.4353791Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:14.4354419Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:14.4355294Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:14.4355920Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:14.4356351Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:14.4357680Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:14.7744749Z * Building wheel... 2025-05-07T19:50:16.2020721Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-wvfgktr9', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--debug', '--package_channel=nightly', '--python-tag=py39', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:16.2025293Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:16.2028563Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-wvfgktr9', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py39', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:16.2030531Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:16.2031165Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:16.2031783Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:16.2032418Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:16.2032865Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:16.2037973Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:16.2042920Z 2025-05-07T19:50:16.2042924Z 2025-05-07T19:50:16.2043357Z -------------------------------------------------------------------------------- 2025-05-07T19:50:16.2043773Z -- Trying 'Ninja' generator 2025-05-07T19:50:16.2044091Z -------------------------------- 2025-05-07T19:50:16.2044377Z --------------------------- 2025-05-07T19:50:16.2044672Z ---------------------- 2025-05-07T19:50:16.2044909Z ----------------- 2025-05-07T19:50:16.2045167Z ------------ 2025-05-07T19:50:16.2045379Z ------- 2025-05-07T19:50:16.2045615Z -- 2025-05-07T19:50:16.2475536Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:16.2476224Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:16.2477100Z CMake. 2025-05-07T19:50:16.2477442Z Not searching for unused variables given on the command line. 2025-05-07T19:50:16.2477803Z 2025-05-07T19:50:16.2478042Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:16.2478632Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:16.2479206Z to work with policies introduced by or earlier. 2025-05-07T19:50:16.2479481Z 2025-05-07T19:50:16.2479486Z 2025-05-07T19:50:16.2932354Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:16.3022145Z -- Detecting C compiler ABI info 2025-05-07T19:50:16.3969217Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:16.4151821Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:16.4153735Z -- Detecting C compile features 2025-05-07T19:50:16.4157348Z -- Detecting C compile features - done 2025-05-07T19:50:16.4936645Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:16.5010579Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:16.5959769Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:16.6158391Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:16.6159631Z -- Detecting CXX compile features 2025-05-07T19:50:16.6165511Z -- Detecting CXX compile features - done 2025-05-07T19:50:16.6231026Z -- Configuring done (0.4s) 2025-05-07T19:50:16.6279455Z -- Generating done (0.0s) 2025-05-07T19:50:16.6291253Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:16.6341787Z -- 2025-05-07T19:50:16.6342058Z ------- 2025-05-07T19:50:16.6342305Z ------------ 2025-05-07T19:50:16.6342539Z ----------------- 2025-05-07T19:50:16.6342806Z ---------------------- 2025-05-07T19:50:16.6343061Z --------------------------- 2025-05-07T19:50:16.6343357Z -------------------------------- 2025-05-07T19:50:16.6343706Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:16.6344096Z -------------------------------------------------------------------------------- 2025-05-07T19:50:16.6344396Z 2025-05-07T19:50:16.6366912Z Configuring Project 2025-05-07T19:50:16.6367306Z Working directory: 2025-05-07T19:50:16.6367766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build 2025-05-07T19:50:16.6368245Z Command: 2025-05-07T19:50:16.6388573Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install -DPYTHON_VERSION_STRING:STRING=3.9.22 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.9 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.9.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.9 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.9 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:50:16.6408605Z 2025-05-07T19:50:16.6902373Z 2025-05-07T19:50:16.6902391Z 2025-05-07T19:50:16.6903013Z ================================================================================ 2025-05-07T19:50:16.6903521Z Default C compiler flags 2025-05-07T19:50:16.6903902Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:16.6904223Z 2025-05-07T19:50:16.6904736Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib 2025-05-07T19:50:16.6905773Z ================================================================================ 2025-05-07T19:50:16.6906056Z 2025-05-07T19:50:16.6906060Z 2025-05-07T19:50:16.6906064Z 2025-05-07T19:50:16.6906180Z ================================================================================ 2025-05-07T19:50:16.6906556Z Default C++ compiler flags 2025-05-07T19:50:16.6906932Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:16.6907247Z 2025-05-07T19:50:16.6907730Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib 2025-05-07T19:50:16.6908567Z ================================================================================ 2025-05-07T19:50:16.6908832Z 2025-05-07T19:50:16.6908836Z 2025-05-07T19:50:16.6908841Z 2025-05-07T19:50:16.6908957Z ================================================================================ 2025-05-07T19:50:16.6909307Z AVX2_FLAGS: 2025-05-07T19:50:16.6909436Z 2025-05-07T19:50:16.6909525Z -mavx2 2025-05-07T19:50:16.6909759Z -mf16c 2025-05-07T19:50:16.6909959Z -mfma 2025-05-07T19:50:16.6910186Z -fopenmp 2025-05-07T19:50:16.6910421Z ================================================================================ 2025-05-07T19:50:16.6910684Z 2025-05-07T19:50:16.6910688Z 2025-05-07T19:50:16.6910692Z 2025-05-07T19:50:16.6910810Z ================================================================================ 2025-05-07T19:50:16.6911163Z AVX512_FLAGS: 2025-05-07T19:50:16.6911298Z 2025-05-07T19:50:16.6911383Z -mavx2 2025-05-07T19:50:16.6911615Z -mf16c 2025-05-07T19:50:16.6911822Z -mfma 2025-05-07T19:50:16.6912052Z -mavx512f 2025-05-07T19:50:16.6912260Z -mavx512bw 2025-05-07T19:50:16.6912491Z -mavx512dq 2025-05-07T19:50:16.6912698Z -mavx512vl 2025-05-07T19:50:16.6912930Z -fopenmp 2025-05-07T19:50:16.6913170Z ================================================================================ 2025-05-07T19:50:16.6913436Z 2025-05-07T19:50:16.6913642Z Not searching for unused variables given on the command line. 2025-05-07T19:50:16.6913979Z 2025-05-07T19:50:16.6914012Z 2025-05-07T19:50:16.6914133Z ================================================================================ 2025-05-07T19:50:16.6914491Z The project is built using scikit-build 2025-05-07T19:50:16.6914862Z ================================================================================ 2025-05-07T19:50:16.6915102Z 2025-05-07T19:50:16.6915106Z 2025-05-07T19:50:16.6915109Z 2025-05-07T19:50:16.6915258Z ================================================================================ 2025-05-07T19:50:16.6915601Z Build Settings 2025-05-07T19:50:16.6915768Z 2025-05-07T19:50:16.6915886Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:16.6916195Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:16.6916408Z 2025-05-07T19:50:16.6916514Z NVCC_VERBOSE : 2025-05-07T19:50:16.6916780Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:16.6917074Z CUDNN_LIBRARY : 2025-05-07T19:50:16.6917555Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:16.6918162Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:16.6918444Z 8.0 2025-05-07T19:50:16.6918635Z 9.0 2025-05-07T19:50:16.6918850Z 9.0a 2025-05-07T19:50:16.6918962Z 2025-05-07T19:50:16.6919061Z HIP_ROOT_DIR : 2025-05-07T19:50:16.6919342Z HIPCC_VERBOSE : 2025-05-07T19:50:16.6919605Z AMDGPU_TARGETS : 2025-05-07T19:50:16.6919889Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:16.6920173Z ================================================================================ 2025-05-07T19:50:16.6920435Z 2025-05-07T19:50:16.7705111Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:16.8152919Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:17.7575146Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler GNU 11.4.0 2025-05-07T19:50:17.7666549Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:17.8612748Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:17.8804546Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:17.8805243Z -- Detecting CXX compile features 2025-05-07T19:50:17.8810189Z -- Detecting CXX compile features - done 2025-05-07T19:50:17.8931468Z -- Detecting C compiler ABI info 2025-05-07T19:50:17.9799740Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:17.9981595Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:17.9982527Z -- Detecting C compile features 2025-05-07T19:50:17.9985867Z -- Detecting C compile features - done 2025-05-07T19:50:18.0086117Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:18.9295971Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:18.9861877Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:18.9894974Z -- Detecting CUDA compile features 2025-05-07T19:50:18.9896334Z -- Detecting CUDA compile features - done 2025-05-07T19:50:18.9987805Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:19.2583761Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:19.2584794Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:19.5302201Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:19.5303258Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:19.7845643Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:19.7846696Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:20.0521901Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:20.0522978Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:20.3074056Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:20.3076328Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:20.5222008Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:20.5223096Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:20.7777569Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:20.7778695Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:21.0490751Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:21.0491676Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:21.3026338Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:21.3027486Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:21.5703705Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:21.5705580Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:21.8252801Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:21.8253922Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:22.0401851Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:22.0583119Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:22.0619218Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:22.0696630Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:22.1575280Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed 2025-05-07T19:50:22.1576174Z -- Looking for pthread_create in pthreads 2025-05-07T19:50:22.2342033Z -- Looking for pthread_create in pthreads - not found 2025-05-07T19:50:22.2343216Z -- Looking for pthread_create in pthread 2025-05-07T19:50:22.3223480Z -- Looking for pthread_create in pthread - found 2025-05-07T19:50:22.3235896Z -- Found Threads: TRUE 2025-05-07T19:50:22.4858849Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:22.4859575Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:22.4860530Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:22.6058784Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:22.7846098Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.9.22") found components: Interpreter 2025-05-07T19:50:22.7859811Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:22.7860769Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:22.7861299Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:22.7861641Z Call Stack (most recent call first): 2025-05-07T19:50:22.7862369Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:22.7863506Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:22.7864545Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:22.7865019Z CMakeLists.txt:112 (include) 2025-05-07T19:50:22.7865205Z 2025-05-07T19:50:22.7865209Z 2025-05-07T19:50:22.7865414Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:22.7865898Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:22.7866325Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:22.7867268Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:22.8211790Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:22.8214272Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:22.8215335Z Call Stack (most recent call first): 2025-05-07T19:50:22.8217382Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:22.8218360Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:22.8218858Z CMakeLists.txt:112 (include) 2025-05-07T19:50:22.8219052Z 2025-05-07T19:50:22.8219056Z 2025-05-07T19:50:22.8219083Z 2025-05-07T19:50:22.8219087Z 2025-05-07T19:50:22.8219212Z ================================================================================ 2025-05-07T19:50:22.8219557Z PyTorch Flags: 2025-05-07T19:50:22.8219950Z 2025-05-07T19:50:22.8220168Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:22.8220741Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:22.8221553Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:22.8222184Z 2025-05-07T19:50:22.8222393Z TORCH_LIBRARIES: 2025-05-07T19:50:22.8222656Z torch 2025-05-07T19:50:22.8222871Z torch_library 2025-05-07T19:50:22.8223348Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:22.8224079Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:22.8224800Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:22.8225367Z 2025-05-07T19:50:22.8225560Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:22.8225829Z --expt-relaxed-constexpr 2025-05-07T19:50:22.8226104Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:22.8226405Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:22.8226839Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:22.8227256Z ================================================================================ 2025-05-07T19:50:22.8227477Z 2025-05-07T19:50:22.8227847Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:22.8228323Z 2025-05-07T19:50:22.8228328Z 2025-05-07T19:50:22.8228433Z ================================================================================ 2025-05-07T19:50:22.8228741Z NCCL Flags 2025-05-07T19:50:22.8228857Z 2025-05-07T19:50:22.8229227Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:22.8230299Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:22.8230924Z ================================================================================ 2025-05-07T19:50:22.8231146Z 2025-05-07T19:50:22.8231150Z 2025-05-07T19:50:22.8231154Z 2025-05-07T19:50:22.8231272Z ================================================================================ 2025-05-07T19:50:22.8231614Z CUDA Driver Path 2025-05-07T19:50:22.8231755Z 2025-05-07T19:50:22.8232134Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:22.8232789Z ================================================================================ 2025-05-07T19:50:22.8233009Z 2025-05-07T19:50:22.8233322Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:22.8248960Z 2025-05-07T19:50:22.8249170Z 2025-05-07T19:50:22.8249514Z ================================================================================ 2025-05-07T19:50:22.8250045Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:22.8250391Z 2025-05-07T19:50:22.8250596Z CPU_SRCS: 2025-05-07T19:50:22.8250722Z 2025-05-07T19:50:22.8250842Z 2025-05-07T19:50:22.8251044Z GPU_SRCS: 2025-05-07T19:50:22.8251200Z 2025-05-07T19:50:22.8251410Z 2025-05-07T19:50:22.8251615Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:22.8251790Z 2025-05-07T19:50:22.8251884Z 2025-05-07T19:50:22.8252090Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:22.8252310Z 2025-05-07T19:50:22.8252396Z 2025-05-07T19:50:22.8252618Z OTHER_SRCS: 2025-05-07T19:50:22.8253023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:22.8253686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:22.8254331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:22.8254965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:22.8255632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:22.8256241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:22.8256874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:22.8257492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:22.8258125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:22.8258758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:22.8259392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:22.8260154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:22.8260792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:22.8261435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:22.8262075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:22.8262713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:22.8263360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:22.8263976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:22.8264610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:22.8265227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:22.8265873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:22.8266523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:22.8267163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:22.8268013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:22.8268660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:22.8269297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:22.8269953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:22.8270595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:22.8271214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:22.8271867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:22.8272509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:22.8273140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:22.8273785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:22.8274410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:22.8275013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:22.8275768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:22.8276446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:22.8277043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:22.8277635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:22.8278192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:22.8278773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:22.8279322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:22.8279908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:22.8280488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:22.8281042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:22.8281647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:22.8282811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:22.8283517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:22.8284142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:22.8284809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:22.8285464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:22.8286091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:22.8286763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:22.8287407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:22.8288039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:22.8288637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:22.8289272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:22.8289916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:22.8290523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:22.8290999Z 2025-05-07T19:50:22.8291197Z CC_FLAGS: 2025-05-07T19:50:22.8291349Z 2025-05-07T19:50:22.8291436Z 2025-05-07T19:50:22.8291631Z NVCC_FLAGS: 2025-05-07T19:50:22.8291781Z 2025-05-07T19:50:22.8291867Z 2025-05-07T19:50:22.8292243Z HIPCC_FLAGS: 2025-05-07T19:50:22.8292381Z 2025-05-07T19:50:22.8292466Z 2025-05-07T19:50:22.8292696Z INCLUDE_DIRS: 2025-05-07T19:50:22.8292942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:22.8293302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:22.8293600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:22.8293955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:22.8294467Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:22.8295395Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:22.8296123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:22.8296532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:22.8296987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:22.8297453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:22.8297996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:22.8298450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:22.8299014Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:22.8299522Z 2025-05-07T19:50:22.8299804Z Selected Source Files: 2025-05-07T19:50:22.8299959Z 2025-05-07T19:50:22.8300247Z 2025-05-07T19:50:22.8300463Z HIPified Source Files: 2025-05-07T19:50:22.8300628Z 2025-05-07T19:50:22.8300814Z 2025-05-07T19:50:22.8301030Z Library Dependencies: 2025-05-07T19:50:22.8301321Z torch 2025-05-07T19:50:22.8301529Z torch_library 2025-05-07T19:50:22.8302004Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:22.8302705Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:22.8303425Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:22.8304244Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:22.8304987Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:22.8305611Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:22.8306019Z 2025-05-07T19:50:22.8306248Z Output Library: 2025-05-07T19:50:22.8306472Z asmjit 2025-05-07T19:50:22.8306698Z 2025-05-07T19:50:22.8306908Z Destination Directory: 2025-05-07T19:50:22.8307187Z fbgemm_gpu 2025-05-07T19:50:22.8307455Z ================================================================================ 2025-05-07T19:50:22.8307698Z 2025-05-07T19:50:22.8307741Z 2025-05-07T19:50:22.8307745Z 2025-05-07T19:50:22.8307898Z ================================================================================ 2025-05-07T19:50:22.8308257Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:22.8308592Z 2025-05-07T19:50:22.8308792Z CPU_SRCS: 2025-05-07T19:50:22.8308923Z 2025-05-07T19:50:22.8309034Z 2025-05-07T19:50:22.8309233Z GPU_SRCS: 2025-05-07T19:50:22.8309383Z 2025-05-07T19:50:22.8309470Z 2025-05-07T19:50:22.8309676Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:22.8309854Z 2025-05-07T19:50:22.8309940Z 2025-05-07T19:50:22.8310141Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:22.8310317Z 2025-05-07T19:50:22.8310402Z 2025-05-07T19:50:22.8310627Z OTHER_SRCS: 2025-05-07T19:50:22.8310908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:22.8311392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:22.8311880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:22.8312334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:22.8312861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:22.8313348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:22.8313815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:22.8314310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:22.8314724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:22.8315137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:22.8315574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:22.8315986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:22.8316435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:22.8316796Z 2025-05-07T19:50:22.8317012Z CC_FLAGS: 2025-05-07T19:50:22.8317195Z 2025-05-07T19:50:22.8317303Z 2025-05-07T19:50:22.8317491Z NVCC_FLAGS: 2025-05-07T19:50:22.8317611Z 2025-05-07T19:50:22.8317718Z 2025-05-07T19:50:22.8317905Z HIPCC_FLAGS: 2025-05-07T19:50:22.8318030Z 2025-05-07T19:50:22.8318144Z 2025-05-07T19:50:22.8318335Z INCLUDE_DIRS: 2025-05-07T19:50:22.8318597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:22.8318908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:22.8319219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:22.8319525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:22.8320033Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:22.8320823Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:22.8321439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:22.8321874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:22.8322297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:22.8322795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:22.8323304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:22.8323793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:22.8324365Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:22.8324861Z 2025-05-07T19:50:22.8325101Z Selected Source Files: 2025-05-07T19:50:22.8325257Z 2025-05-07T19:50:22.8325350Z 2025-05-07T19:50:22.8325587Z HIPified Source Files: 2025-05-07T19:50:22.8325744Z 2025-05-07T19:50:22.8325831Z 2025-05-07T19:50:22.8326069Z Library Dependencies: 2025-05-07T19:50:22.8326304Z torch 2025-05-07T19:50:22.8326540Z torch_library 2025-05-07T19:50:22.8326968Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:22.8327650Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:22.8328350Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:22.8329106Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:22.8329845Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:22.8330293Z asmjit 2025-05-07T19:50:22.8330642Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:22.8331052Z 2025-05-07T19:50:22.8331237Z Output Library: 2025-05-07T19:50:22.8331477Z fbgemm 2025-05-07T19:50:22.8331665Z 2025-05-07T19:50:22.8331890Z Destination Directory: 2025-05-07T19:50:22.8332132Z fbgemm_gpu 2025-05-07T19:50:22.8332382Z ================================================================================ 2025-05-07T19:50:22.8332607Z 2025-05-07T19:50:22.8332611Z 2025-05-07T19:50:22.8332614Z 2025-05-07T19:50:22.8332727Z ================================================================================ 2025-05-07T19:50:22.8333083Z Running code generation script ... 2025-05-07T19:50:22.8333829Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:22.8334575Z ================================================================================ 2025-05-07T19:50:22.8334804Z 2025-05-07T19:50:23.4588486Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:23.4591104Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:23.4593423Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:23.4594872Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:23.4596072Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.4596577Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:23.4597190Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:23.4597665Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:23.4598164Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:23.4598655Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.4599197Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:23.4599717Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:23.4600222Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.4600777Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.4601306Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.4601885Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.4602418Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.4602963Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.4603502Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.4604028Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.4604611Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.4605133Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.4605651Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:23.4606075Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:23.4606481Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:23.4606931Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:23.4607428Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.4607966Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:23.4608437Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:23.4608973Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.4609491Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:23.4610017Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.4610573Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.4611121Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.4611664Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.4612207Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.4612787Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.4613292Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:23.4613751Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:23.4614171Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:23.4614621Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.4615054Z Written: lookup_adagrad.py 2025-05-07T19:50:23.4615473Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:23.4615914Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:23.4616362Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.4616871Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:23.4617361Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:23.4617830Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.4618349Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:23.4618886Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:23.4619371Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:23.4619947Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:23.4620657Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.4621223Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:23.4621750Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:23.4622287Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.4622863Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.4623454Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.4624044Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.4624635Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.4625195Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.4625782Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.4626355Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.4626986Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.4627597Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.4628133Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:23.4628621Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:23.4629030Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:23.4629528Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.4629958Z Written: lookup_adam.py 2025-05-07T19:50:23.4630308Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:23.4630801Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.4631292Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:23.4631827Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.4632346Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:23.4632956Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:23.4633438Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.4633951Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:23.4634452Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.4634963Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.4635515Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.4636008Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.4636551Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.4637085Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.4637591Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:23.4638025Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:23.4638390Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:23.4638922Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.4639312Z Written: lookup_lamb.py 2025-05-07T19:50:23.4639633Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:23.4640055Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.4640548Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:23.4641050Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.4641586Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:23.4642158Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:23.4642673Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.4643215Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:23.4643720Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.4644296Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.4644853Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.4645414Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.4645986Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.4646555Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.4647090Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:23.4647531Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:23.4647950Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:23.4648402Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.4648837Z Written: lookup_lars_sgd.py 2025-05-07T19:50:23.4649188Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:23.4649640Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.4650190Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:23.4650780Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.4651407Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:23.4651985Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:23.4652606Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.4653243Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:23.4653837Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.4654504Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.4655156Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.4655797Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.4656466Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.4657124Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.5782446Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:23.5784133Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:23.5785747Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:23.5787170Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.5787711Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:23.5788171Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:23.5789034Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.5789657Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:23.5790320Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.5790966Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:23.5791614Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:23.5792254Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.5793165Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:23.5793820Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.5794597Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.5795276Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.5796104Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.5796813Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.5797526Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.5798164Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:23.5798754Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:23.5799262Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:23.5799873Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.5800362Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:23.5800810Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:23.5801402Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.5801998Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:23.5802602Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:23.5803262Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:23.5803809Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:23.5804358Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.5804961Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.5805569Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:23.5806133Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:23.5806718Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:23.5807260Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:23.5807845Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:23.5808408Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:23.5808987Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:23.5809551Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:23.5810115Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.5810747Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.5811333Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:23.5811940Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:23.5812503Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:23.5813200Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:23.5813798Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.5814393Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.5814997Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:23.5815557Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.5816178Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.5816831Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.5817523Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.5818147Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.5818913Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:23.5819539Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.5820439Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.5821199Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.5821843Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:23.5822448Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.5823109Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.5823785Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.5824481Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.5825161Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.5825802Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:23.5826457Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.5827114Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:23.5827799Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:23.5828476Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:23.5829185Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:23.5829895Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:23.5830560Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:23.5831262Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:23.5831948Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:23.5832718Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:23.5833310Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:23.5833884Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:23.5834492Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:23.5835043Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:23.5835608Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:23.5836093Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:23.5836547Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:23.5837074Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.5837514Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:23.5837990Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:23.5838438Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:23.5838968Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.5839411Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:23.5839809Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:23.5840272Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:23.5840803Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.5841665Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:23.5842235Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:23.5842777Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:23.5843370Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.5843983Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:23.5844563Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.5845260Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:23.5845940Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:23.5846537Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:23.5847231Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.5847907Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:23.5848597Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.5849366Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:23.5850074Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:23.5850760Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:23.5851484Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.7143441Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:23.7145698Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.7147664Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:23.7148474Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:23.7149179Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.7149980Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:23.7150665Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:23.7151316Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:23.7151980Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:23.7152665Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.7153347Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:23.7154034Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:23.7154707Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.7155447Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.7156372Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.7157141Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.7157844Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.7158574Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.7159268Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.7160083Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.7160847Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.7161553Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.7162257Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:23.7162852Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:23.7163425Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:23.7164060Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.7164591Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:23.7165094Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:23.7165704Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.7166410Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:23.7167052Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:23.7167668Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:23.7168355Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.7169013Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:23.7169688Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.7170341Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:23.7170924Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:23.7171463Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:23.7172030Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.7172630Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:23.7173201Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.7173771Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:23.7174221Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:23.7174705Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.7175207Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:23.7175673Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:23.7176158Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:23.7176614Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:23.7177113Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.7177606Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:23.7178105Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:23.7178619Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.7179214Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.7179859Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.7180611Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:23.7181215Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.7181760Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.7182544Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.7183127Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.7183831Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:23.7184422Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.7184945Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:23.7185420Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:23.7185828Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:23.7186324Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.7186771Z Written: lookup_sgd.py 2025-05-07T19:50:23.7187083Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:23.7187512Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:23.7188058Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.7188568Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:23.7189028Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:23.7189472Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:23.7189942Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.7190440Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:23.7190934Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.7191413Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:23.7191909Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:23.7192390Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:23.7192869Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:23.7193347Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:23.7193857Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:23.7194361Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:23.7194879Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:23.7195431Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:23.7195930Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:23.7196479Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:23.7197014Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:23.7197526Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:23.7197966Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:23.7198328Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:23.7198778Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.7199163Z Written: lookup_none.py 2025-05-07T19:50:23.7199483Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:23.7199909Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.7200415Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:23.7200979Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:23.7201531Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:23.7202162Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:23.7202665Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:23.7203181Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:23.7203657Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:23.7204196Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:23.7204760Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:23.7205296Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:23.7205898Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:23.7206398Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:23.7217113Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:23.7217681Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:23.7218194Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:23.7218685Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:23.7219221Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:23.7219860Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:23.7220510Z Written: pt2_arg_utils.h 2025-05-07T19:50:23.7220825Z Written: __init__.py 2025-05-07T19:50:23.7221172Z Written: lookup_args_ssd.py 2025-05-07T19:50:23.7221492Z Written: lookup_args.py 2025-05-07T19:50:23.7240798Z 2025-05-07T19:50:23.7240887Z 2025-05-07T19:50:23.7241410Z ================================================================================ 2025-05-07T19:50:23.7242567Z Running code generation script ... 2025-05-07T19:50:23.7244976Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:23.7246972Z ================================================================================ 2025-05-07T19:50:23.7247211Z 2025-05-07T19:50:23.8323359Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:23.8325980Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:23.8328270Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:23.8328787Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:23.8329268Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:23.8329792Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:23.8330278Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:23.8330684Z Written: optimizer_args.py 2025-05-07T19:50:23.8394546Z 2025-05-07T19:50:23.8395118Z 2025-05-07T19:50:23.8395682Z ================================================================================ 2025-05-07T19:50:23.8396769Z Running code generation script ... 2025-05-07T19:50:23.8399130Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:23.8400362Z ================================================================================ 2025-05-07T19:50:23.8400599Z 2025-05-07T19:50:23.9655125Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:23.9657814Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:23.9659728Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:23.9660701Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:23.9661663Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:23.9662378Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:23.9663104Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:23.9663830Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:23.9664570Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:23.9665373Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:23.9666229Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:23.9667030Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:23.9667807Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:23.9668600Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:23.9669355Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:23.9670109Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:23.9670863Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:23.9671584Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:23.9672450Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:23.9673116Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:23.9673789Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:23.9674460Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:23.9675107Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:23.9675696Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:23.9676199Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:23.9726525Z 2025-05-07T19:50:23.9726545Z 2025-05-07T19:50:23.9727302Z ================================================================================ 2025-05-07T19:50:23.9727749Z Running code generation script ... 2025-05-07T19:50:23.9728676Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:23.9729492Z ================================================================================ 2025-05-07T19:50:23.9729731Z 2025-05-07T19:50:24.3776101Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:24.3778724Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:24.3780383Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.3780927Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:24.3781448Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.3782311Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.3782824Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.3783379Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.3783905Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:24.3784391Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:24.3784926Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.3785676Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.3786238Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.3786748Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.3787315Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.3788041Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.3788677Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.3789226Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.3789817Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.3790328Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:24.3790819Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.3791346Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.3791856Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.3792334Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.3792830Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:24.3793287Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:24.3793791Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.3794287Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.3794798Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.3795302Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.3795760Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:24.3796211Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:24.3796650Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.3797139Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.3797578Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:24.3798021Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:24.3798471Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:24.3798888Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:24.3799500Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:24.3799951Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.3800465Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.3800940Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.3801429Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.3801901Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:24.3802334Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:24.3802832Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.3803306Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:24.3803801Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.3804285Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:24.3804780Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:24.3805240Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:24.3805778Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.3806349Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.3806882Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.3807438Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.3807941Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.3808484Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.3808928Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.3861828Z 2025-05-07T19:50:24.3862223Z 2025-05-07T19:50:24.3869607Z ================================================================================ 2025-05-07T19:50:24.3870036Z Running code generation script ... 2025-05-07T19:50:24.3870767Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:24.3871709Z ================================================================================ 2025-05-07T19:50:24.3871945Z 2025-05-07T19:50:24.6786855Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:24.6788799Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:24.6789588Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:24.6790205Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:24.6790650Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:24.6791247Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:24.6791695Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:24.6792160Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:24.6792661Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:24.6793216Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:24.6793709Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:24.6867816Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:24.6883715Z 2025-05-07T19:50:24.6883798Z 2025-05-07T19:50:24.6884399Z ================================================================================ 2025-05-07T19:50:24.6885668Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:24.6886761Z 2025-05-07T19:50:24.6887321Z CPU_SRCS: 2025-05-07T19:50:24.6888479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:24.6890502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:24.6892609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:24.6893200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:24.6893843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:24.6894323Z 2025-05-07T19:50:24.6894539Z GPU_SRCS: 2025-05-07T19:50:24.6894888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:24.6895489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:24.6896102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:24.6896757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:24.6897369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:24.6897939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:24.6898570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:24.6899142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:24.6899860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:24.6900811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:24.6901308Z 2025-05-07T19:50:24.6901542Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.6901696Z 2025-05-07T19:50:24.6901784Z 2025-05-07T19:50:24.6902251Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.6902407Z 2025-05-07T19:50:24.6902502Z 2025-05-07T19:50:24.6902720Z OTHER_SRCS: 2025-05-07T19:50:24.6902853Z 2025-05-07T19:50:24.6902942Z 2025-05-07T19:50:24.6903174Z CC_FLAGS: 2025-05-07T19:50:24.6903299Z 2025-05-07T19:50:24.6903408Z 2025-05-07T19:50:24.6903592Z NVCC_FLAGS: 2025-05-07T19:50:24.6903861Z --expt-relaxed-constexpr 2025-05-07T19:50:24.6904151Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.6904474Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.6904788Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.6905088Z 2025-05-07T19:50:24.6905388Z HIPCC_FLAGS: 2025-05-07T19:50:24.6905559Z 2025-05-07T19:50:24.6905651Z 2025-05-07T19:50:24.6905858Z INCLUDE_DIRS: 2025-05-07T19:50:24.6906151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.6906600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.6906920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.6907263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.6907750Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.6908536Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.6909166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.6909594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.6910016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.6910504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.6911039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.6911488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.6912055Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.6912544Z 2025-05-07T19:50:24.6912773Z Selected Source Files: 2025-05-07T19:50:24.6913197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:24.6913861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:24.6914490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:24.6915102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:24.6915723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:24.6916332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:24.6916935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:24.6917542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:24.6918189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:24.6918799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:24.6919373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:24.6919997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:24.6920565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:24.6921154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:24.6921782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:24.6922261Z 2025-05-07T19:50:24.6922490Z HIPified Source Files: 2025-05-07T19:50:24.6922645Z 2025-05-07T19:50:24.6922725Z 2025-05-07T19:50:24.6922952Z Library Dependencies: 2025-05-07T19:50:24.6923187Z torch 2025-05-07T19:50:24.6923407Z torch_library 2025-05-07T19:50:24.6923829Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.6924565Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.6925242Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.6926030Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.6926776Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.6927356Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.6927776Z 2025-05-07T19:50:24.6927971Z Output Library: 2025-05-07T19:50:24.6928228Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:24.6929576Z 2025-05-07T19:50:24.6929810Z Destination Directory: 2025-05-07T19:50:24.6930051Z fbgemm_gpu 2025-05-07T19:50:24.6930323Z ================================================================================ 2025-05-07T19:50:24.6930555Z 2025-05-07T19:50:24.7372123Z 2025-05-07T19:50:24.7372243Z 2025-05-07T19:50:24.7372880Z ================================================================================ 2025-05-07T19:50:24.7374145Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:24.7375207Z 2025-05-07T19:50:24.7375845Z CPU_SRCS: 2025-05-07T19:50:24.7376706Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:24.7378106Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:24.7379404Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:24.7380687Z 2025-05-07T19:50:24.7381202Z GPU_SRCS: 2025-05-07T19:50:24.7381504Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:24.7382209Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:24.7382789Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:24.7383450Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:24.7384074Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:24.7384722Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:24.7385368Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:24.7385990Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:24.7386661Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:24.7387346Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:24.7388135Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:24.7388761Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:24.7389405Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:24.7390053Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:24.7390662Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:24.7391276Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:24.7391862Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:24.7392471Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:24.7393076Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:24.7393663Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:24.7394246Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:24.7394799Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:24.7395388Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.7395799Z 2025-05-07T19:50:24.7396223Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.7396375Z 2025-05-07T19:50:24.7396479Z 2025-05-07T19:50:24.7396658Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.7396786Z 2025-05-07T19:50:24.7396880Z 2025-05-07T19:50:24.7397051Z OTHER_SRCS: 2025-05-07T19:50:24.7397164Z 2025-05-07T19:50:24.7397263Z 2025-05-07T19:50:24.7397436Z CC_FLAGS: 2025-05-07T19:50:24.7397569Z 2025-05-07T19:50:24.7397644Z 2025-05-07T19:50:24.7397816Z NVCC_FLAGS: 2025-05-07T19:50:24.7398048Z --expt-relaxed-constexpr 2025-05-07T19:50:24.7398295Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.7398587Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.7398948Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.7399186Z 2025-05-07T19:50:24.7399381Z HIPCC_FLAGS: 2025-05-07T19:50:24.7399499Z 2025-05-07T19:50:24.7399571Z 2025-05-07T19:50:24.7399765Z INCLUDE_DIRS: 2025-05-07T19:50:24.7399982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.7400303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.7400568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.7400876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.7401338Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.7402086Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.7402704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.7403086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.7403492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.7403929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.7404423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.7404844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.7405367Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.7405839Z 2025-05-07T19:50:24.7406027Z Selected Source Files: 2025-05-07T19:50:24.7406345Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:24.7406768Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:24.7407191Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:24.7407592Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:24.7408031Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:24.7408556Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:24.7409127Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:24.7409698Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:24.7410255Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:24.7410827Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:24.7411388Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:24.7411984Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:24.7412628Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:24.7413254Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:24.7413889Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:24.7414509Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:24.7415142Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:24.7415761Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:24.7416435Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:24.7417046Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:24.7417624Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:24.7418242Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:24.7418829Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:24.7419397Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:24.7420307Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:24.7421014Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.7421460Z 2025-05-07T19:50:24.7421670Z HIPified Source Files: 2025-05-07T19:50:24.7421841Z 2025-05-07T19:50:24.7421920Z 2025-05-07T19:50:24.7422116Z Library Dependencies: 2025-05-07T19:50:24.7422358Z torch 2025-05-07T19:50:24.7422543Z torch_library 2025-05-07T19:50:24.7422988Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.7423676Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.7424371Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.7425172Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.7425913Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.7426509Z asmjit 2025-05-07T19:50:24.7426680Z fbgemm 2025-05-07T19:50:24.7426865Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:24.7427079Z fbgemm_gpu_config 2025-05-07T19:50:24.7427424Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.7427819Z 2025-05-07T19:50:24.7428000Z Output Library: 2025-05-07T19:50:24.7428230Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:24.7428458Z 2025-05-07T19:50:24.7428644Z Destination Directory: 2025-05-07T19:50:24.7428860Z fbgemm_gpu 2025-05-07T19:50:24.7429097Z ================================================================================ 2025-05-07T19:50:24.7429310Z 2025-05-07T19:50:24.9731019Z 2025-05-07T19:50:24.9731143Z 2025-05-07T19:50:24.9731698Z ================================================================================ 2025-05-07T19:50:24.9732933Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:24.9733907Z 2025-05-07T19:50:24.9734416Z CPU_SRCS: 2025-05-07T19:50:24.9735076Z src/config/feature_gates.cpp 2025-05-07T19:50:24.9735768Z 2025-05-07T19:50:24.9736291Z GPU_SRCS: 2025-05-07T19:50:24.9736610Z 2025-05-07T19:50:24.9736843Z 2025-05-07T19:50:24.9737350Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.9737762Z 2025-05-07T19:50:24.9737987Z 2025-05-07T19:50:24.9738423Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.9738574Z 2025-05-07T19:50:24.9738654Z 2025-05-07T19:50:24.9738838Z OTHER_SRCS: 2025-05-07T19:50:24.9738988Z 2025-05-07T19:50:24.9739067Z 2025-05-07T19:50:24.9739246Z CC_FLAGS: 2025-05-07T19:50:24.9739385Z 2025-05-07T19:50:24.9739464Z 2025-05-07T19:50:24.9739840Z NVCC_FLAGS: 2025-05-07T19:50:24.9739973Z 2025-05-07T19:50:24.9740064Z 2025-05-07T19:50:24.9740261Z HIPCC_FLAGS: 2025-05-07T19:50:24.9740398Z 2025-05-07T19:50:24.9740472Z 2025-05-07T19:50:24.9740736Z INCLUDE_DIRS: 2025-05-07T19:50:24.9740964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9741294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.9741576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.9741903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9742408Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.9743236Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.9743903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.9744652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.9745117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.9745603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.9746171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.9746652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.9747237Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.9747764Z 2025-05-07T19:50:24.9748051Z Selected Source Files: 2025-05-07T19:50:24.9748341Z src/config/feature_gates.cpp 2025-05-07T19:50:24.9748590Z 2025-05-07T19:50:24.9748814Z HIPified Source Files: 2025-05-07T19:50:24.9748969Z 2025-05-07T19:50:24.9749043Z 2025-05-07T19:50:24.9749257Z Library Dependencies: 2025-05-07T19:50:24.9749575Z torch 2025-05-07T19:50:24.9749762Z torch_library 2025-05-07T19:50:24.9750222Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.9750918Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.9751629Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.9752437Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.9753186Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.9753894Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.9754302Z 2025-05-07T19:50:24.9754474Z Output Library: 2025-05-07T19:50:24.9754692Z fbgemm_gpu_config 2025-05-07T19:50:24.9755070Z 2025-05-07T19:50:24.9755264Z Destination Directory: 2025-05-07T19:50:24.9755494Z fbgemm_gpu 2025-05-07T19:50:24.9755727Z ================================================================================ 2025-05-07T19:50:24.9755958Z 2025-05-07T19:50:24.9755963Z 2025-05-07T19:50:24.9755970Z 2025-05-07T19:50:24.9756097Z ================================================================================ 2025-05-07T19:50:24.9756469Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:24.9756813Z 2025-05-07T19:50:24.9756987Z CPU_SRCS: 2025-05-07T19:50:24.9757290Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:24.9757741Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:24.9758117Z 2025-05-07T19:50:24.9758303Z GPU_SRCS: 2025-05-07T19:50:24.9758580Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:24.9758995Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:24.9759383Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:24.9759760Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:24.9760152Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:24.9760504Z 2025-05-07T19:50:24.9760693Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.9760850Z 2025-05-07T19:50:24.9760922Z 2025-05-07T19:50:24.9761111Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.9761264Z 2025-05-07T19:50:24.9761333Z 2025-05-07T19:50:24.9761525Z OTHER_SRCS: 2025-05-07T19:50:24.9761639Z 2025-05-07T19:50:24.9761719Z 2025-05-07T19:50:24.9761905Z CC_FLAGS: 2025-05-07T19:50:24.9762015Z 2025-05-07T19:50:24.9762089Z 2025-05-07T19:50:24.9762274Z NVCC_FLAGS: 2025-05-07T19:50:24.9762384Z 2025-05-07T19:50:24.9762457Z 2025-05-07T19:50:24.9762645Z HIPCC_FLAGS: 2025-05-07T19:50:24.9762769Z 2025-05-07T19:50:24.9762844Z 2025-05-07T19:50:24.9763046Z INCLUDE_DIRS: 2025-05-07T19:50:24.9763283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9763607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.9763888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.9764185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9764677Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.9765520Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.9766194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.9766592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.9767019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.9767496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.9768021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.9768554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.9769108Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.9769629Z 2025-05-07T19:50:24.9769818Z Selected Source Files: 2025-05-07T19:50:24.9770171Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:24.9770622Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:24.9771099Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:24.9771538Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:24.9771929Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:24.9772317Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:24.9772704Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:24.9773061Z 2025-05-07T19:50:24.9773255Z HIPified Source Files: 2025-05-07T19:50:24.9773432Z 2025-05-07T19:50:24.9773508Z 2025-05-07T19:50:24.9773691Z Library Dependencies: 2025-05-07T19:50:24.9773940Z torch 2025-05-07T19:50:24.9774127Z torch_library 2025-05-07T19:50:24.9774566Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.9775263Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.9775952Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.9776769Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.9777510Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.9778138Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.9778562Z 2025-05-07T19:50:24.9778749Z Output Library: 2025-05-07T19:50:24.9778989Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.9779202Z 2025-05-07T19:50:24.9779410Z Destination Directory: 2025-05-07T19:50:24.9779759Z fbgemm_gpu 2025-05-07T19:50:24.9780006Z ================================================================================ 2025-05-07T19:50:24.9780237Z 2025-05-07T19:50:24.9780241Z 2025-05-07T19:50:24.9780246Z 2025-05-07T19:50:24.9780354Z ================================================================================ 2025-05-07T19:50:24.9780787Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:24.9781182Z 2025-05-07T19:50:24.9781355Z CPU_SRCS: 2025-05-07T19:50:24.9781596Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:24.9781869Z 2025-05-07T19:50:24.9782284Z GPU_SRCS: 2025-05-07T19:50:24.9782491Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:24.9782775Z 2025-05-07T19:50:24.9782949Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.9783097Z 2025-05-07T19:50:24.9783169Z 2025-05-07T19:50:24.9783349Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.9783508Z 2025-05-07T19:50:24.9783580Z 2025-05-07T19:50:24.9783802Z OTHER_SRCS: 2025-05-07T19:50:24.9783928Z 2025-05-07T19:50:24.9784016Z 2025-05-07T19:50:24.9784247Z CC_FLAGS: 2025-05-07T19:50:24.9784369Z 2025-05-07T19:50:24.9784458Z 2025-05-07T19:50:24.9784680Z NVCC_FLAGS: 2025-05-07T19:50:24.9784810Z 2025-05-07T19:50:24.9784900Z 2025-05-07T19:50:24.9785132Z HIPCC_FLAGS: 2025-05-07T19:50:24.9785269Z 2025-05-07T19:50:24.9785357Z 2025-05-07T19:50:24.9785599Z INCLUDE_DIRS: 2025-05-07T19:50:24.9785861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9786367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.9786718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.9787053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9787602Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.9788422Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.9789135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.9789571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.9790138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.9790669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.9791217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.9791735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.9792324Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.9792882Z 2025-05-07T19:50:24.9793100Z Selected Source Files: 2025-05-07T19:50:24.9793421Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:24.9793761Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:24.9794089Z 2025-05-07T19:50:24.9794303Z HIPified Source Files: 2025-05-07T19:50:24.9794495Z 2025-05-07T19:50:24.9794583Z 2025-05-07T19:50:24.9794815Z Library Dependencies: 2025-05-07T19:50:24.9795061Z torch 2025-05-07T19:50:24.9795292Z torch_library 2025-05-07T19:50:24.9795737Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.9796458Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.9797174Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.9798011Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.9798797Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.9799291Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.9799700Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.9800125Z 2025-05-07T19:50:24.9800357Z Output Library: 2025-05-07T19:50:24.9800615Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:24.9800915Z 2025-05-07T19:50:24.9801133Z Destination Directory: 2025-05-07T19:50:24.9801424Z fbgemm_gpu 2025-05-07T19:50:24.9801677Z ================================================================================ 2025-05-07T19:50:24.9801961Z 2025-05-07T19:50:24.9802079Z 2025-05-07T19:50:24.9802083Z 2025-05-07T19:50:24.9802205Z ================================================================================ 2025-05-07T19:50:24.9802633Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:24.9802991Z 2025-05-07T19:50:24.9803222Z CPU_SRCS: 2025-05-07T19:50:24.9803499Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:24.9803974Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:24.9804490Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:24.9804812Z 2025-05-07T19:50:24.9805045Z GPU_SRCS: 2025-05-07T19:50:24.9805296Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:24.9805684Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:24.9806059Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:24.9806415Z 2025-05-07T19:50:24.9806632Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.9806813Z 2025-05-07T19:50:24.9806899Z 2025-05-07T19:50:24.9807105Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.9807281Z 2025-05-07T19:50:24.9807366Z 2025-05-07T19:50:24.9807595Z OTHER_SRCS: 2025-05-07T19:50:24.9807720Z 2025-05-07T19:50:24.9807807Z 2025-05-07T19:50:24.9808035Z CC_FLAGS: 2025-05-07T19:50:24.9808160Z 2025-05-07T19:50:24.9808247Z 2025-05-07T19:50:24.9808471Z NVCC_FLAGS: 2025-05-07T19:50:24.9808715Z --expt-relaxed-constexpr 2025-05-07T19:50:24.9809137Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.9809445Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.9809795Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.9810068Z 2025-05-07T19:50:24.9810306Z HIPCC_FLAGS: 2025-05-07T19:50:24.9810446Z 2025-05-07T19:50:24.9810567Z 2025-05-07T19:50:24.9810770Z INCLUDE_DIRS: 2025-05-07T19:50:24.9811047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9811378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.9811700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.9812025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9812624Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.9813431Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.9814122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.9814583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.9815032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.9815550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.9816092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.9816596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.9817170Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.9817714Z 2025-05-07T19:50:24.9817932Z Selected Source Files: 2025-05-07T19:50:24.9818275Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:24.9818755Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:24.9819192Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:24.9819707Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:24.9820101Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:24.9820496Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:24.9820813Z 2025-05-07T19:50:24.9821058Z HIPified Source Files: 2025-05-07T19:50:24.9821221Z 2025-05-07T19:50:24.9821309Z 2025-05-07T19:50:24.9821552Z Library Dependencies: 2025-05-07T19:50:24.9821808Z torch 2025-05-07T19:50:24.9822044Z torch_library 2025-05-07T19:50:24.9822487Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.9823168Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.9823900Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.9824704Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.9825460Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.9825930Z fbgemm 2025-05-07T19:50:24.9826145Z fbgemm_gpu_config 2025-05-07T19:50:24.9826520Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.9826926Z 2025-05-07T19:50:24.9827135Z Output Library: 2025-05-07T19:50:24.9827361Z fbgemm_gpu_tbe_common 2025-05-07T19:50:24.9827597Z 2025-05-07T19:50:24.9827795Z Destination Directory: 2025-05-07T19:50:24.9828059Z fbgemm_gpu 2025-05-07T19:50:24.9828293Z ================================================================================ 2025-05-07T19:50:24.9828544Z 2025-05-07T19:50:24.9828548Z 2025-05-07T19:50:24.9828552Z 2025-05-07T19:50:24.9828669Z ================================================================================ 2025-05-07T19:50:24.9829104Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:24.9829464Z 2025-05-07T19:50:24.9829654Z CPU_SRCS: 2025-05-07T19:50:24.9829771Z 2025-05-07T19:50:24.9829849Z 2025-05-07T19:50:24.9830051Z GPU_SRCS: 2025-05-07T19:50:24.9830328Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:24.9830738Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:24.9831264Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:24.9831623Z 2025-05-07T19:50:24.9831834Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.9831969Z 2025-05-07T19:50:24.9832044Z 2025-05-07T19:50:24.9832370Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.9832509Z 2025-05-07T19:50:24.9832586Z 2025-05-07T19:50:24.9832772Z OTHER_SRCS: 2025-05-07T19:50:24.9832886Z 2025-05-07T19:50:24.9832967Z 2025-05-07T19:50:24.9833151Z CC_FLAGS: 2025-05-07T19:50:24.9833258Z 2025-05-07T19:50:24.9833339Z 2025-05-07T19:50:24.9833527Z NVCC_FLAGS: 2025-05-07T19:50:24.9833799Z --expt-relaxed-constexpr 2025-05-07T19:50:24.9834072Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.9834370Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.9834667Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.9834919Z 2025-05-07T19:50:24.9835102Z HIPCC_FLAGS: 2025-05-07T19:50:24.9835230Z 2025-05-07T19:50:24.9835323Z 2025-05-07T19:50:24.9835492Z INCLUDE_DIRS: 2025-05-07T19:50:24.9835742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9836043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.9836325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.9836619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9837111Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.9837899Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.9838529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.9838945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.9839375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.9839860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.9840373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.9840835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.9841399Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.9842081Z 2025-05-07T19:50:24.9842294Z Selected Source Files: 2025-05-07T19:50:24.9842592Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:24.9843002Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:24.9843432Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:24.9843785Z 2025-05-07T19:50:24.9843982Z HIPified Source Files: 2025-05-07T19:50:24.9844151Z 2025-05-07T19:50:24.9844226Z 2025-05-07T19:50:24.9844426Z Library Dependencies: 2025-05-07T19:50:24.9844649Z torch 2025-05-07T19:50:24.9844857Z torch_library 2025-05-07T19:50:24.9845276Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.9845953Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.9846658Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.9847466Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.9848224Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.9848831Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.9849236Z 2025-05-07T19:50:24.9849415Z Output Library: 2025-05-07T19:50:24.9849647Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:24.9849885Z 2025-05-07T19:50:24.9850086Z Destination Directory: 2025-05-07T19:50:24.9850323Z fbgemm_gpu 2025-05-07T19:50:24.9850567Z ================================================================================ 2025-05-07T19:50:24.9850794Z 2025-05-07T19:50:24.9850798Z 2025-05-07T19:50:24.9850802Z 2025-05-07T19:50:24.9850928Z ================================================================================ 2025-05-07T19:50:24.9851346Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:24.9851798Z 2025-05-07T19:50:24.9851976Z CPU_SRCS: 2025-05-07T19:50:24.9852242Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9852556Z 2025-05-07T19:50:24.9852753Z GPU_SRCS: 2025-05-07T19:50:24.9852990Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:24.9853378Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:24.9853738Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:24.9854134Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.9854571Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.9855036Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.9855439Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:24.9855814Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:24.9856199Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:24.9856583Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:24.9857008Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:24.9857441Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.9857859Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.9858280Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:24.9858687Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:24.9859115Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.9859534Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.9860073Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:24.9860467Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:24.9860890Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.9861309Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.9861736Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9862190Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9862619Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:24.9863018Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:24.9863425Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9863877Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9864322Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:24.9864737Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9865152Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:24.9865550Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.9865971Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9866390Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9866809Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:24.9867215Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9867686Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9868133Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.9868707Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:24.9869133Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9869585Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9870048Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:24.9870465Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9870901Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.9871330Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.9871755Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9872208Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9872680Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.9873119Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.9873581Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.9874052Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.9874464Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9874852Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9875173Z 2025-05-07T19:50:24.9875358Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.9875497Z 2025-05-07T19:50:24.9875585Z 2025-05-07T19:50:24.9875869Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.9876031Z 2025-05-07T19:50:24.9876106Z 2025-05-07T19:50:24.9876282Z OTHER_SRCS: 2025-05-07T19:50:24.9876420Z 2025-05-07T19:50:24.9876495Z 2025-05-07T19:50:24.9876682Z CC_FLAGS: 2025-05-07T19:50:24.9876816Z 2025-05-07T19:50:24.9876890Z 2025-05-07T19:50:24.9877086Z NVCC_FLAGS: 2025-05-07T19:50:24.9877301Z --expt-relaxed-constexpr 2025-05-07T19:50:24.9877579Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.9877853Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.9878158Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.9878413Z 2025-05-07T19:50:24.9878601Z HIPCC_FLAGS: 2025-05-07T19:50:24.9878725Z 2025-05-07T19:50:24.9878801Z 2025-05-07T19:50:24.9878984Z INCLUDE_DIRS: 2025-05-07T19:50:24.9879210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9879532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.9879808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.9880127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9880734Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.9881500Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.9882311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.9882889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.9883329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.9883810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.9884358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.9884833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.9885393Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.9885907Z 2025-05-07T19:50:24.9886095Z Selected Source Files: 2025-05-07T19:50:24.9886385Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9886775Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:24.9887194Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.9887609Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:24.9888046Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.9888471Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:24.9888874Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.9889297Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9889717Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9890168Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9890613Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.9891025Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9891414Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9891769Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:24.9892140Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:24.9892487Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:24.9892871Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.9893291Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.9893829Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.9894221Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:24.9894615Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:24.9895013Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:24.9895408Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:24.9895846Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.9896264Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:24.9896695Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.9897176Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:24.9897593Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.9898032Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9898450Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:24.9898856Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:24.9899272Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9899812Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9900240Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:24.9900672Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9901074Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:24.9901511Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.9901955Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9902353Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:24.9902778Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9903211Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.9903640Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:24.9904079Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9904570Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.9905044Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:24.9905472Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9905916Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.9906339Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.9906778Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.9907189Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.9907629Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.9908098Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.9908568Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.9908946Z 2025-05-07T19:50:24.9909132Z HIPified Source Files: 2025-05-07T19:50:24.9909295Z 2025-05-07T19:50:24.9909396Z 2025-05-07T19:50:24.9909588Z Library Dependencies: 2025-05-07T19:50:24.9909840Z torch 2025-05-07T19:50:24.9910033Z torch_library 2025-05-07T19:50:24.9910483Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.9911166Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.9911889Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.9912714Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.9913453Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.9913953Z fbgemm_gpu_tbe_common 2025-05-07T19:50:24.9914314Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.9914729Z 2025-05-07T19:50:24.9914914Z Output Library: 2025-05-07T19:50:24.9915159Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:24.9915416Z 2025-05-07T19:50:24.9915717Z Destination Directory: 2025-05-07T19:50:24.9915993Z fbgemm_gpu 2025-05-07T19:50:24.9916224Z ================================================================================ 2025-05-07T19:50:24.9916465Z 2025-05-07T19:50:24.9916659Z 2025-05-07T19:50:24.9916674Z 2025-05-07T19:50:24.9916785Z ================================================================================ 2025-05-07T19:50:24.9917222Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:24.9917629Z 2025-05-07T19:50:24.9917803Z CPU_SRCS: 2025-05-07T19:50:24.9918049Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9918505Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9918880Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.9919220Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:24.9919561Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.9919924Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.9920336Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:24.9920803Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.9921188Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:24.9921628Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.9922087Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.9922506Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9923035Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.9923631Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.9924325Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.9924853Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9925308Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9925749Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9926223Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9926700Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9927122Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9927550Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9927981Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9928484Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9929054Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9929540Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9930073Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9930615Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9931135Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9931762Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9932461Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9933154Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9933772Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9934208Z 2025-05-07T19:50:24.9934402Z GPU_SRCS: 2025-05-07T19:50:24.9934701Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9935181Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9935666Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9936088Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9936526Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9936969Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9949953Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9950543Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9951051Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9951564Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9952124Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9952654Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9953341Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9954044Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9954720Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9955346Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9955890Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9956272Z 2025-05-07T19:50:24.9956474Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.9956617Z 2025-05-07T19:50:24.9956688Z 2025-05-07T19:50:24.9956891Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.9957032Z 2025-05-07T19:50:24.9957107Z 2025-05-07T19:50:24.9957291Z OTHER_SRCS: 2025-05-07T19:50:24.9957406Z 2025-05-07T19:50:24.9957479Z 2025-05-07T19:50:24.9957661Z CC_FLAGS: 2025-05-07T19:50:24.9957769Z 2025-05-07T19:50:24.9957848Z 2025-05-07T19:50:24.9958044Z NVCC_FLAGS: 2025-05-07T19:50:24.9958271Z --expt-relaxed-constexpr 2025-05-07T19:50:24.9958534Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.9958831Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.9959120Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.9959378Z 2025-05-07T19:50:24.9959550Z HIPCC_FLAGS: 2025-05-07T19:50:24.9959677Z 2025-05-07T19:50:24.9959747Z 2025-05-07T19:50:24.9959918Z INCLUDE_DIRS: 2025-05-07T19:50:24.9960157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9960466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.9960750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.9961067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.9961564Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:24.9962370Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.9963120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.9963536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.9963956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.9964438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.9964962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.9965400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.9965947Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.9966634Z 2025-05-07T19:50:24.9966835Z Selected Source Files: 2025-05-07T19:50:24.9967105Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9967488Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9967933Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.9968269Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:24.9968605Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.9968964Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.9969348Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:24.9969787Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.9970171Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:24.9970587Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.9972369Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.9972802Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9973316Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.9973898Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.9974663Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.9975190Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9975621Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.9976132Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9976599Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9977062Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9977459Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9977874Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9978299Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9978784Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9979336Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9979943Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9980457Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9980992Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9981517Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9982307Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9982985Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9983667Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9984270Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.9984802Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9985268Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9985742Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9986164Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9986576Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9987013Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9987507Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9988060Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9988542Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9989051Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9989605Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9990110Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9990737Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9991423Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9992107Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9992716Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9993271Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.9993669Z 2025-05-07T19:50:24.9993969Z HIPified Source Files: 2025-05-07T19:50:24.9994117Z 2025-05-07T19:50:24.9994202Z 2025-05-07T19:50:24.9994381Z Library Dependencies: 2025-05-07T19:50:24.9994607Z torch 2025-05-07T19:50:24.9994783Z torch_library 2025-05-07T19:50:24.9995344Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.9996001Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.9996695Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.9997471Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.9998191Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.9998749Z fbgemm 2025-05-07T19:50:24.9998936Z fbgemm_gpu_config 2025-05-07T19:50:24.9999164Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:24.9999382Z fbgemm_gpu_tbe_common 2025-05-07T19:50:24.9999611Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.9999845Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:25.0000237Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0000633Z 2025-05-07T19:50:25.0000821Z Output Library: 2025-05-07T19:50:25.0001062Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:25.0001323Z 2025-05-07T19:50:25.0001509Z Destination Directory: 2025-05-07T19:50:25.0001731Z fbgemm_gpu 2025-05-07T19:50:25.0001946Z ================================================================================ 2025-05-07T19:50:25.0002169Z 2025-05-07T19:50:25.0002173Z 2025-05-07T19:50:25.0002177Z 2025-05-07T19:50:25.0002277Z ================================================================================ 2025-05-07T19:50:25.0002690Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:25.0003069Z 2025-05-07T19:50:25.0003236Z CPU_SRCS: 2025-05-07T19:50:25.0003557Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:25.0003975Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:25.0004314Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:25.0004670Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:25.0005035Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:25.0005349Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:25.0005672Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:25.0006018Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:25.0006406Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:25.0006838Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:25.0007202Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:25.0007609Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:25.0008032Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:25.0008438Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:25.0008928Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:25.0009508Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:25.0010082Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:25.0010583Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:25.0011008Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:25.0011375Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:25.0011749Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:25.0012030Z 2025-05-07T19:50:25.0012224Z GPU_SRCS: 2025-05-07T19:50:25.0012463Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:25.0012882Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:25.0013336Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:25.0013768Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:25.0014211Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:25.0014668Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:25.0015229Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:25.0015728Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0016276Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0016837Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0017345Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:25.0017842Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0018351Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0018891Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:25.0019315Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0019883Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0020533Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0021036Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0021582Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0022059Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:25.0022510Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0022986Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0023471Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:25.0023980Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0024516Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0025062Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0025628Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0026232Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0026784Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:25.0027324Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0027879Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0028348Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:25.0028759Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0029186Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0029630Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0030094Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0030598Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0031054Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:25.0031470Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0031932Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0032464Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:25.0032866Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0033284Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0033727Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0034184Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0034684Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0035142Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:25.0035557Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0036007Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0036410Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:25.0036815Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0037301Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0037738Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0038200Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0038691Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0039136Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:25.0039545Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0039993Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0040468Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:25.0040889Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0041356Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0041810Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0042315Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0042831Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0043422Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:25.0043833Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0044297Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0044775Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:25.0045261Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0045788Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0046313Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0046884Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0047468Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0048024Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:25.0048552Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0049100Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0049607Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:25.0050120Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0050648Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0051180Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0051743Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0052320Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0052862Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:25.0053365Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0053909Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0054368Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:25.0054750Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0055161Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0055563Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0056009Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0056464Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0056890Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:25.0057282Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0057699Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0058258Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:25.0058809Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0059392Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0060235Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0060907Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0061607Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0062309Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:25.0062928Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0063569Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0064027Z 2025-05-07T19:50:25.0064207Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0064356Z 2025-05-07T19:50:25.0064427Z 2025-05-07T19:50:25.0064600Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0064943Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:25.0065444Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:25.0065902Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:25.0066269Z 2025-05-07T19:50:25.0066447Z OTHER_SRCS: 2025-05-07T19:50:25.0066565Z 2025-05-07T19:50:25.0066634Z 2025-05-07T19:50:25.0066805Z CC_FLAGS: 2025-05-07T19:50:25.0066923Z 2025-05-07T19:50:25.0066990Z 2025-05-07T19:50:25.0067149Z NVCC_FLAGS: 2025-05-07T19:50:25.0067359Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0067624Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0067888Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0068173Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0068412Z 2025-05-07T19:50:25.0068585Z HIPCC_FLAGS: 2025-05-07T19:50:25.0068708Z 2025-05-07T19:50:25.0068775Z 2025-05-07T19:50:25.0068947Z INCLUDE_DIRS: 2025-05-07T19:50:25.0069164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0069469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0069730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0070027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0070508Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0071282Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0071929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0072444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0072836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0073265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0073744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0074316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0074593Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0074657Z 2025-05-07T19:50:25.0074737Z Selected Source Files: 2025-05-07T19:50:25.0074933Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:25.0075037Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:25.0075148Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:25.0075293Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:25.0075395Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:25.0075494Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:25.0075590Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:25.0075707Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:25.0075852Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:25.0076054Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:25.0076163Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:25.0076336Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:25.0076453Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:25.0076613Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:25.0076813Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:25.0077031Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:25.0077277Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:25.0077449Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:25.0077559Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:25.0077693Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:25.0077805Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:25.0077927Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:25.0078081Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:25.0078241Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:25.0078384Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:25.0078532Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:25.0078708Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:25.0078895Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:25.0079084Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0079290Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0079510Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0079678Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:25.0079862Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0080057Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0080190Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:25.0080346Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0080509Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0080683Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0080866Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0081058Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0081208Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:25.0081371Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0081539Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0081708Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:25.0082040Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0082230Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0082718Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0082951Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0083186Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0083375Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:25.0083593Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0083805Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0083936Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:25.0084273Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0084429Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0084585Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0084776Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0084963Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0085098Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:25.0085257Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0085430Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0085632Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:25.0085786Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0085951Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0086111Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0086301Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0086495Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0086639Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:25.0086800Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0086965Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0087107Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:25.0087257Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0087414Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0087583Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0087769Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0087959Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0088110Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:25.0088274Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0088439Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0088587Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:25.0088766Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0088941Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0089120Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0089331Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0089542Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0089701Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:25.0089889Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0090076Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0090274Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:25.0090494Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0090724Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0090948Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0091200Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0091459Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0091667Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:25.0091889Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0092132Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0092325Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:25.0092594Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0092828Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0093054Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0093301Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0093553Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0093768Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:25.0094042Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0094271Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0094412Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:25.0094570Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0094847Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0095009Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0095185Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0095361Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0095499Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:25.0095652Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0095813Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0096026Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:25.0096262Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0096494Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0096728Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0097000Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0097261Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0097478Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:25.0097723Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0097962Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0098029Z 2025-05-07T19:50:25.0098112Z HIPified Source Files: 2025-05-07T19:50:25.0098130Z 2025-05-07T19:50:25.0098193Z 2025-05-07T19:50:25.0098271Z Library Dependencies: 2025-05-07T19:50:25.0098337Z torch 2025-05-07T19:50:25.0098418Z torch_library 2025-05-07T19:50:25.0098698Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0098930Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0099244Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0099565Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0099889Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0099959Z fbgemm 2025-05-07T19:50:25.0100216Z fbgemm_gpu_config 2025-05-07T19:50:25.0100301Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:25.0100381Z fbgemm_gpu_tbe_common 2025-05-07T19:50:25.0100473Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:25.0100569Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:25.0100777Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0100845Z 2025-05-07T19:50:25.0100932Z Output Library: 2025-05-07T19:50:25.0101053Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:25.0101119Z 2025-05-07T19:50:25.0101216Z Destination Directory: 2025-05-07T19:50:25.0101362Z fbgemm_gpu 2025-05-07T19:50:25.0101468Z ================================================================================ 2025-05-07T19:50:25.0101473Z 2025-05-07T19:50:25.0101477Z 2025-05-07T19:50:25.0101481Z 2025-05-07T19:50:25.0101585Z ================================================================================ 2025-05-07T19:50:25.0101786Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:25.0101854Z 2025-05-07T19:50:25.0101928Z CPU_SRCS: 2025-05-07T19:50:25.0101941Z 2025-05-07T19:50:25.0102012Z 2025-05-07T19:50:25.0102146Z GPU_SRCS: 2025-05-07T19:50:25.0102342Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:25.0102570Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:25.0102792Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:25.0102997Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:25.0103233Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:25.0103458Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:25.0103663Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:25.0103902Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:25.0104137Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:25.0104349Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:25.0104592Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:25.0104840Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:25.0104909Z 2025-05-07T19:50:25.0104989Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0104994Z 2025-05-07T19:50:25.0105068Z 2025-05-07T19:50:25.0105149Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0105153Z 2025-05-07T19:50:25.0105228Z 2025-05-07T19:50:25.0105303Z OTHER_SRCS: 2025-05-07T19:50:25.0105317Z 2025-05-07T19:50:25.0105383Z 2025-05-07T19:50:25.0105456Z CC_FLAGS: 2025-05-07T19:50:25.0105461Z 2025-05-07T19:50:25.0105527Z 2025-05-07T19:50:25.0105609Z NVCC_FLAGS: 2025-05-07T19:50:25.0105704Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0105794Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0105904Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0105993Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0106064Z 2025-05-07T19:50:25.0106141Z HIPCC_FLAGS: 2025-05-07T19:50:25.0106148Z 2025-05-07T19:50:25.0106225Z 2025-05-07T19:50:25.0106297Z INCLUDE_DIRS: 2025-05-07T19:50:25.0106407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0106504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0106602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0106702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0106993Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0107387Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0107526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0107699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0107850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0108052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0108252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0108403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0108704Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0108776Z 2025-05-07T19:50:25.0108879Z Selected Source Files: 2025-05-07T19:50:25.0109078Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:25.0109354Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:25.0109583Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:25.0109782Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:25.0110005Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:25.0110228Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:25.0110437Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:25.0110714Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:25.0110942Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:25.0111166Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:25.0111404Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:25.0111647Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:25.0111726Z 2025-05-07T19:50:25.0111812Z HIPified Source Files: 2025-05-07T19:50:25.0111816Z 2025-05-07T19:50:25.0111887Z 2025-05-07T19:50:25.0111976Z Library Dependencies: 2025-05-07T19:50:25.0112053Z torch 2025-05-07T19:50:25.0112131Z torch_library 2025-05-07T19:50:25.0112550Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0112795Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0113099Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0113430Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0113694Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0113785Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:25.0113982Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0114049Z 2025-05-07T19:50:25.0114133Z Output Library: 2025-05-07T19:50:25.0114226Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:25.0114296Z 2025-05-07T19:50:25.0114389Z Destination Directory: 2025-05-07T19:50:25.0114465Z fbgemm_gpu 2025-05-07T19:50:25.0114568Z ================================================================================ 2025-05-07T19:50:25.0114572Z 2025-05-07T19:50:25.0114575Z 2025-05-07T19:50:25.0114579Z 2025-05-07T19:50:25.0114685Z ================================================================================ 2025-05-07T19:50:25.0114875Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:25.0114942Z 2025-05-07T19:50:25.0115013Z CPU_SRCS: 2025-05-07T19:50:25.0115030Z 2025-05-07T19:50:25.0115095Z 2025-05-07T19:50:25.0115161Z GPU_SRCS: 2025-05-07T19:50:25.0115341Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0115533Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0115720Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0115898Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0116146Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0116377Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0116515Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0116661Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0116816Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0116966Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0117102Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0117257Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0117497Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0117700Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0117914Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0118091Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0118283Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0118487Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0118725Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0118937Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0119144Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0119327Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0119529Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0119734Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0119963Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0120207Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0120454Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0120688Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0120945Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0121207Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0121349Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0121502Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0121663Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0121801Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0121971Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0122139Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0122274Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0122444Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0122608Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0122755Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0122932Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0123104Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0123241Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0123401Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0123575Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0123716Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0123883Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0124064Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0124128Z 2025-05-07T19:50:25.0124205Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0124209Z 2025-05-07T19:50:25.0124285Z 2025-05-07T19:50:25.0124358Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0124365Z 2025-05-07T19:50:25.0124427Z 2025-05-07T19:50:25.0124495Z OTHER_SRCS: 2025-05-07T19:50:25.0124500Z 2025-05-07T19:50:25.0124572Z 2025-05-07T19:50:25.0124639Z CC_FLAGS: 2025-05-07T19:50:25.0124642Z 2025-05-07T19:50:25.0124703Z 2025-05-07T19:50:25.0124783Z NVCC_FLAGS: 2025-05-07T19:50:25.0124868Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0124951Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0125092Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0125189Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0125252Z 2025-05-07T19:50:25.0125327Z HIPCC_FLAGS: 2025-05-07T19:50:25.0125330Z 2025-05-07T19:50:25.0125407Z 2025-05-07T19:50:25.0125479Z INCLUDE_DIRS: 2025-05-07T19:50:25.0125576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0125658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0125762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0125856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0126113Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0126544Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0126672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0126820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0126975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0127159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0127344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0127474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0127766Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0127828Z 2025-05-07T19:50:25.0127908Z Selected Source Files: 2025-05-07T19:50:25.0128106Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0128285Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0128472Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0128659Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0128893Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0129131Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0129269Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0129428Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0129570Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0129722Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0129868Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:25.0130019Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:25.0130197Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0130410Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0130613Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0130783Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0130973Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0131177Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0131360Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0131574Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0131792Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0131972Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0132180Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0132389Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0132611Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0132859Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0133168Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0133400Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0133656Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0133915Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0134053Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0134210Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0134879Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0135033Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0135194Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0135361Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0135511Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0135672Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0135836Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0135992Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0136165Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0136336Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0136476Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:25.0136644Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0136803Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0136943Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:25.0137116Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:25.0137286Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:25.0137354Z 2025-05-07T19:50:25.0137437Z HIPified Source Files: 2025-05-07T19:50:25.0137450Z 2025-05-07T19:50:25.0137511Z 2025-05-07T19:50:25.0137590Z Library Dependencies: 2025-05-07T19:50:25.0137652Z torch 2025-05-07T19:50:25.0137726Z torch_library 2025-05-07T19:50:25.0138003Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0138235Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0138537Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0138859Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0139111Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0139211Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:25.0139404Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0139474Z 2025-05-07T19:50:25.0139547Z Output Library: 2025-05-07T19:50:25.0139755Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:25.0139839Z 2025-05-07T19:50:25.0139933Z Destination Directory: 2025-05-07T19:50:25.0140012Z fbgemm_gpu 2025-05-07T19:50:25.0140324Z ================================================================================ 2025-05-07T19:50:25.0140330Z 2025-05-07T19:50:25.0140334Z 2025-05-07T19:50:25.0140338Z 2025-05-07T19:50:25.0140451Z ================================================================================ 2025-05-07T19:50:25.0140771Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:25.0140857Z 2025-05-07T19:50:25.0140945Z CPU_SRCS: 2025-05-07T19:50:25.0140949Z 2025-05-07T19:50:25.0141029Z 2025-05-07T19:50:25.0141134Z GPU_SRCS: 2025-05-07T19:50:25.0141287Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:25.0141444Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:25.0141687Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0141863Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0142042Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0142230Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:25.0142450Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0142656Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0142813Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:25.0143048Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:25.0143232Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0143413Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0143546Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:25.0143624Z 2025-05-07T19:50:25.0143721Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0143728Z 2025-05-07T19:50:25.0143813Z 2025-05-07T19:50:25.0143930Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0143935Z 2025-05-07T19:50:25.0144015Z 2025-05-07T19:50:25.0144104Z OTHER_SRCS: 2025-05-07T19:50:25.0144108Z 2025-05-07T19:50:25.0144214Z 2025-05-07T19:50:25.0144298Z CC_FLAGS: 2025-05-07T19:50:25.0144302Z 2025-05-07T19:50:25.0144383Z 2025-05-07T19:50:25.0144494Z NVCC_FLAGS: 2025-05-07T19:50:25.0144601Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0144703Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0144809Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0144940Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0145019Z 2025-05-07T19:50:25.0145103Z HIPCC_FLAGS: 2025-05-07T19:50:25.0145107Z 2025-05-07T19:50:25.0145208Z 2025-05-07T19:50:25.0145298Z INCLUDE_DIRS: 2025-05-07T19:50:25.0145414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0145523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0145650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0145763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0146050Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0146462Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0146608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0146774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0146935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0147165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0147372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0147526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0147850Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0147930Z 2025-05-07T19:50:25.0148031Z Selected Source Files: 2025-05-07T19:50:25.0148213Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:25.0148392Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:25.0148546Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:25.0148664Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:25.0148830Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:25.0148994Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:25.0149165Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:25.0149359Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:25.0149560Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:25.0149766Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:25.0149944Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:25.0150180Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:25.0150367Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:25.0150451Z 2025-05-07T19:50:25.0150568Z HIPified Source Files: 2025-05-07T19:50:25.0150572Z 2025-05-07T19:50:25.0150656Z 2025-05-07T19:50:25.0150746Z Library Dependencies: 2025-05-07T19:50:25.0150834Z torch 2025-05-07T19:50:25.0150913Z torch_library 2025-05-07T19:50:25.0151214Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0151470Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0151865Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0152223Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0152611Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0152733Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:25.0152935Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0153010Z 2025-05-07T19:50:25.0153103Z Output Library: 2025-05-07T19:50:25.0153203Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:25.0153277Z 2025-05-07T19:50:25.0153365Z Destination Directory: 2025-05-07T19:50:25.0153611Z fbgemm_gpu 2025-05-07T19:50:25.0153715Z ================================================================================ 2025-05-07T19:50:25.0153719Z 2025-05-07T19:50:25.0153723Z 2025-05-07T19:50:25.0153726Z 2025-05-07T19:50:25.0153831Z ================================================================================ 2025-05-07T19:50:25.0154068Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:25.0154140Z 2025-05-07T19:50:25.0154215Z CPU_SRCS: 2025-05-07T19:50:25.0154219Z 2025-05-07T19:50:25.0154306Z 2025-05-07T19:50:25.0154377Z GPU_SRCS: 2025-05-07T19:50:25.0154491Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:25.0154638Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:25.0154740Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:25.0154841Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:25.0154944Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:25.0155075Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:25.0155216Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:25.0155360Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:25.0155485Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:25.0155658Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:25.0155769Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:25.0155920Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:25.0156144Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:25.0156360Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:25.0156544Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:25.0156721Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:25.0156841Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:25.0156985Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:25.0157157Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:25.0157336Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:25.0157514Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:25.0157648Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:25.0157810Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:25.0157939Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:25.0158077Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:25.0158280Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:25.0158425Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:25.0158566Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:25.0158738Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:25.0158931Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:25.0159130Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:25.0159317Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:25.0159529Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:25.0159731Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:25.0159876Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:25.0160118Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:25.0160355Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:25.0160428Z 2025-05-07T19:50:25.0160529Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0160533Z 2025-05-07T19:50:25.0160608Z 2025-05-07T19:50:25.0160697Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0160700Z 2025-05-07T19:50:25.0160769Z 2025-05-07T19:50:25.0160862Z OTHER_SRCS: 2025-05-07T19:50:25.0160866Z 2025-05-07T19:50:25.0160940Z 2025-05-07T19:50:25.0161016Z CC_FLAGS: 2025-05-07T19:50:25.0161020Z 2025-05-07T19:50:25.0161114Z 2025-05-07T19:50:25.0161193Z NVCC_FLAGS: 2025-05-07T19:50:25.0161292Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0161396Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0161502Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0161598Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0161666Z 2025-05-07T19:50:25.0161758Z HIPCC_FLAGS: 2025-05-07T19:50:25.0161762Z 2025-05-07T19:50:25.0161833Z 2025-05-07T19:50:25.0161913Z INCLUDE_DIRS: 2025-05-07T19:50:25.0162027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0162125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0162227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0162325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0162594Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0162966Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0163104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0163273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0163425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0163621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0163820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0163966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0164253Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0164333Z 2025-05-07T19:50:25.0164436Z Selected Source Files: 2025-05-07T19:50:25.0164549Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:25.0164678Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:25.0164802Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:25.0164905Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:25.0165012Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:25.0165126Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:25.0165284Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:25.0165428Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:25.0165530Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:25.0165719Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:25.0165834Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:25.0165987Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:25.0166251Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:25.0166468Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:25.0166660Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:25.0166820Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:25.0166960Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:25.0167103Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:25.0167259Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:25.0167502Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:25.0167685Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:25.0167816Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:25.0167965Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:25.0168098Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:25.0168239Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:25.0168370Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:25.0168520Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:25.0168659Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:25.0168815Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:25.0169014Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:25.0169210Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:25.0169398Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:25.0169605Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:25.0169732Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:25.0169866Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:25.0170083Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:25.0170324Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:25.0170389Z 2025-05-07T19:50:25.0170467Z HIPified Source Files: 2025-05-07T19:50:25.0170471Z 2025-05-07T19:50:25.0170548Z 2025-05-07T19:50:25.0170632Z Library Dependencies: 2025-05-07T19:50:25.0170697Z torch 2025-05-07T19:50:25.0170772Z torch_library 2025-05-07T19:50:25.0171065Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0171297Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0171601Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0171935Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0172193Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0172269Z fbgemm_gpu_config 2025-05-07T19:50:25.0172358Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:25.0172552Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0172774Z 2025-05-07T19:50:25.0172851Z Output Library: 2025-05-07T19:50:25.0172969Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:25.0173037Z 2025-05-07T19:50:25.0173120Z Destination Directory: 2025-05-07T19:50:25.0173201Z fbgemm_gpu 2025-05-07T19:50:25.0173305Z ================================================================================ 2025-05-07T19:50:25.0173313Z 2025-05-07T19:50:25.0173316Z 2025-05-07T19:50:25.0173320Z 2025-05-07T19:50:25.0173414Z ================================================================================ 2025-05-07T19:50:25.0173585Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:25.0173654Z 2025-05-07T19:50:25.0173731Z CPU_SRCS: 2025-05-07T19:50:25.0173970Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:25.0174152Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:25.0174224Z 2025-05-07T19:50:25.0174292Z GPU_SRCS: 2025-05-07T19:50:25.0174470Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:25.0174600Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:25.0174718Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:25.0174844Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:25.0174976Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:25.0175152Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:25.0175272Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:25.0175398Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:25.0175464Z 2025-05-07T19:50:25.0175551Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0175555Z 2025-05-07T19:50:25.0175631Z 2025-05-07T19:50:25.0175712Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0175716Z 2025-05-07T19:50:25.0175778Z 2025-05-07T19:50:25.0175846Z OTHER_SRCS: 2025-05-07T19:50:25.0175860Z 2025-05-07T19:50:25.0175923Z 2025-05-07T19:50:25.0175989Z CC_FLAGS: 2025-05-07T19:50:25.0175992Z 2025-05-07T19:50:25.0176059Z 2025-05-07T19:50:25.0176137Z NVCC_FLAGS: 2025-05-07T19:50:25.0176221Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0176306Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0176416Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0176504Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0176565Z 2025-05-07T19:50:25.0176645Z HIPCC_FLAGS: 2025-05-07T19:50:25.0176649Z 2025-05-07T19:50:25.0176720Z 2025-05-07T19:50:25.0176788Z INCLUDE_DIRS: 2025-05-07T19:50:25.0176882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0176973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0177064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0177154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0177419Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0177788Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0177915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0178060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0178214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0178397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0178582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0178717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0178992Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0179055Z 2025-05-07T19:50:25.0179137Z Selected Source Files: 2025-05-07T19:50:25.0179336Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:25.0179507Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:25.0179761Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:25.0179891Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:25.0180000Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:25.0180302Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:25.0180438Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:25.0180576Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:25.0180710Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:25.0180835Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:25.0180918Z 2025-05-07T19:50:25.0181002Z HIPified Source Files: 2025-05-07T19:50:25.0181006Z 2025-05-07T19:50:25.0181071Z 2025-05-07T19:50:25.0181163Z Library Dependencies: 2025-05-07T19:50:25.0181233Z torch 2025-05-07T19:50:25.0181368Z torch_library 2025-05-07T19:50:25.0181671Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0182072Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0182405Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0182753Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0183039Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0183239Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:25.0183328Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:25.0183555Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0183626Z 2025-05-07T19:50:25.0183708Z Output Library: 2025-05-07T19:50:25.0183803Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:25.0183878Z 2025-05-07T19:50:25.0183966Z Destination Directory: 2025-05-07T19:50:25.0184045Z fbgemm_gpu 2025-05-07T19:50:25.0184163Z ================================================================================ 2025-05-07T19:50:25.0184168Z 2025-05-07T19:50:25.0184172Z 2025-05-07T19:50:25.0184175Z 2025-05-07T19:50:25.0184283Z ================================================================================ 2025-05-07T19:50:25.0184471Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:25.0184559Z 2025-05-07T19:50:25.0184633Z CPU_SRCS: 2025-05-07T19:50:25.0184808Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:25.0184881Z 2025-05-07T19:50:25.0184964Z GPU_SRCS: 2025-05-07T19:50:25.0185131Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:25.0185281Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:25.0185360Z 2025-05-07T19:50:25.0185443Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0185447Z 2025-05-07T19:50:25.0185518Z 2025-05-07T19:50:25.0185600Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0185604Z 2025-05-07T19:50:25.0185684Z 2025-05-07T19:50:25.0185756Z OTHER_SRCS: 2025-05-07T19:50:25.0185760Z 2025-05-07T19:50:25.0185826Z 2025-05-07T19:50:25.0185914Z CC_FLAGS: 2025-05-07T19:50:25.0185918Z 2025-05-07T19:50:25.0185984Z 2025-05-07T19:50:25.0186056Z NVCC_FLAGS: 2025-05-07T19:50:25.0186148Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0186246Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0186343Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0186439Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0186515Z 2025-05-07T19:50:25.0186595Z HIPCC_FLAGS: 2025-05-07T19:50:25.0186599Z 2025-05-07T19:50:25.0186667Z 2025-05-07T19:50:25.0186742Z INCLUDE_DIRS: 2025-05-07T19:50:25.0186851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0186940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0187039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0187151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0187434Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0187824Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0187969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0188124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0188278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0188474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0188686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0188825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0189123Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0189207Z 2025-05-07T19:50:25.0189289Z Selected Source Files: 2025-05-07T19:50:25.0189544Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:25.0189725Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:25.0189879Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:25.0189947Z 2025-05-07T19:50:25.0190035Z HIPified Source Files: 2025-05-07T19:50:25.0190039Z 2025-05-07T19:50:25.0190119Z 2025-05-07T19:50:25.0190210Z Library Dependencies: 2025-05-07T19:50:25.0190282Z torch 2025-05-07T19:50:25.0190370Z torch_library 2025-05-07T19:50:25.0190665Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0190967Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0191289Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0191646Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0191916Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0192131Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0192204Z 2025-05-07T19:50:25.0192399Z Output Library: 2025-05-07T19:50:25.0192492Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:25.0192576Z 2025-05-07T19:50:25.0192654Z Destination Directory: 2025-05-07T19:50:25.0192721Z fbgemm_gpu 2025-05-07T19:50:25.0192819Z ================================================================================ 2025-05-07T19:50:25.0192823Z 2025-05-07T19:50:25.0192838Z 2025-05-07T19:50:25.0192842Z 2025-05-07T19:50:25.0192939Z ================================================================================ 2025-05-07T19:50:25.0193053Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:25.0193114Z 2025-05-07T19:50:25.0193192Z CPU_SRCS: 2025-05-07T19:50:25.0193281Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:25.0193372Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:25.0193564Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:25.0193760Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:25.0193946Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:25.0194147Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:25.0194358Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:25.0194574Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:25.0194707Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:25.0194845Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:25.0194963Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:25.0195068Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:25.0195214Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:25.0195309Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:25.0195409Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:25.0195525Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:25.0195627Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:25.0195716Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:25.0195798Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:25.0195883Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:25.0195977Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:25.0196065Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:25.0196155Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:25.0196255Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:25.0196482Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:25.0196621Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:25.0196829Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:25.0197049Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:25.0197200Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:25.0197297Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:25.0197390Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:25.0197496Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:25.0197677Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:25.0197765Z src/topology_utils.cpp 2025-05-07T19:50:25.0197830Z 2025-05-07T19:50:25.0197902Z GPU_SRCS: 2025-05-07T19:50:25.0198016Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:25.0198114Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:25.0198363Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:25.0198450Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:25.0198551Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:25.0198728Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:25.0198897Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:25.0199030Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:25.0199157Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:25.0199396Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:25.0199575Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:25.0199737Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:25.0199869Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:25.0200009Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:25.0200145Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:25.0200264Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:25.0200389Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:25.0200508Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:25.0200655Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:25.0200798Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:25.0200920Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:25.0201059Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:25.0201178Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:25.0201264Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:25.0201477Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:25.0201657Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:25.0201828Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:25.0201939Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:25.0202043Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:25.0202159Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:25.0202276Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:25.0202384Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:25.0202476Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:25.0202588Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:25.0202688Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:25.0202798Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:25.0202918Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:25.0203041Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:25.0203162Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:25.0203292Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:25.0203423Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:25.0203528Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:25.0203616Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:25.0203709Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:25.0203816Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:25.0203931Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:25.0204094Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:25.0204188Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:25.0204292Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:25.0204381Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:25.0204486Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:25.0204585Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:25.0204687Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:25.0204784Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:25.0204872Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:25.0205058Z 2025-05-07T19:50:25.0205135Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:25.0205139Z 2025-05-07T19:50:25.0205202Z 2025-05-07T19:50:25.0205289Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:25.0205293Z 2025-05-07T19:50:25.0205357Z 2025-05-07T19:50:25.0205432Z OTHER_SRCS: 2025-05-07T19:50:25.0205436Z 2025-05-07T19:50:25.0205510Z 2025-05-07T19:50:25.0205577Z CC_FLAGS: 2025-05-07T19:50:25.0205581Z 2025-05-07T19:50:25.0205647Z 2025-05-07T19:50:25.0205717Z NVCC_FLAGS: 2025-05-07T19:50:25.0205808Z --expt-relaxed-constexpr 2025-05-07T19:50:25.0205899Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:25.0205989Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:25.0206078Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:25.0206147Z 2025-05-07T19:50:25.0206219Z HIPCC_FLAGS: 2025-05-07T19:50:25.0206224Z 2025-05-07T19:50:25.0206288Z 2025-05-07T19:50:25.0206367Z INCLUDE_DIRS: 2025-05-07T19:50:25.0206459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0206542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:25.0206646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:25.0206738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:25.0206996Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:25.0207356Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:25.0207495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:25.0207642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:25.0207784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:25.0207976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:25.0208162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:25.0208295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:25.0208584Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:25.0208655Z 2025-05-07T19:50:25.0208739Z Selected Source Files: 2025-05-07T19:50:25.0208830Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:25.0208931Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:25.0209112Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:25.0209305Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:25.0209505Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:25.0209710Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:25.0209907Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:25.0210134Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:25.0210269Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:25.0210393Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:25.0210511Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:25.0210628Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:25.0210764Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:25.0210865Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:25.0210967Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:25.0211078Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:25.0211211Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:25.0211313Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:25.0211397Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:25.0211479Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:25.0211571Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:25.0211677Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:25.0211769Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:25.0211861Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:25.0212087Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:25.0212223Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:25.0212461Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:25.0212679Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:25.0212783Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:25.0212873Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:25.0212967Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:25.0213078Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:25.0213258Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:25.0213337Z src/topology_utils.cpp 2025-05-07T19:50:25.0213447Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:25.0213546Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:25.0213740Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:25.0213828Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:25.0213927Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:25.0214109Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:25.0214278Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:25.0214399Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:25.0214519Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:25.0214764Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:25.0214929Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:25.0215101Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:25.0215231Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:25.0215369Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:25.0215502Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:25.0215624Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:25.0215740Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:25.0215857Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:25.0216003Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:25.0216143Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:25.0216255Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:25.0216400Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:25.0216521Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:25.0216607Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:25.0216828Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:25.0217009Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:25.0217176Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:25.0217282Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:25.0217380Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:25.0217492Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:25.0217612Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:25.0217709Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:25.0217800Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:25.0217916Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:25.0218014Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:25.0218176Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:25.0218300Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:25.0218406Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:25.0218537Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:25.0218663Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:25.0218792Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:25.0218891Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:25.0218981Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:25.0219073Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:25.0219224Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:25.0219345Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:25.0219463Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:25.0219555Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:25.0219735Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:25.0219833Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:25.0219945Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:25.0220219Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:25.0220332Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:25.0220438Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:25.0220530Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:25.0220609Z 2025-05-07T19:50:25.0220766Z HIPified Source Files: 2025-05-07T19:50:25.0220770Z 2025-05-07T19:50:25.0220841Z 2025-05-07T19:50:25.0220941Z Library Dependencies: 2025-05-07T19:50:25.0221011Z torch 2025-05-07T19:50:25.0221087Z torch_library 2025-05-07T19:50:25.0221396Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:25.0221662Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:25.0221986Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:25.0222334Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:25.0222616Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:25.0222686Z fbgemm 2025-05-07T19:50:25.0222781Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:25.0222893Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:25.0222984Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:25.0223066Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:25.0223154Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:25.0223250Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:25.0223463Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.0223534Z 2025-05-07T19:50:25.0223621Z Output Library: 2025-05-07T19:50:25.0223695Z fbgemm_gpu_py 2025-05-07T19:50:25.0223767Z 2025-05-07T19:50:25.0223852Z Destination Directory: 2025-05-07T19:50:25.0223936Z fbgemm_gpu 2025-05-07T19:50:25.0224042Z ================================================================================ 2025-05-07T19:50:25.0224046Z 2025-05-07T19:50:25.0224154Z -- Configuring done (8.4s) 2025-05-07T19:50:25.1393656Z -- Generating done (0.1s) 2025-05-07T19:50:25.1408142Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build 2025-05-07T19:50:25.1677618Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build' 2025-05-07T19:50:25.1678408Z 2025-05-07T19:50:25.1678958Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:25.3307819Z [1/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:25.3453566Z [2/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:25.3542047Z [3/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:25.3733496Z [4/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:25.3754101Z [5/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:25.3774290Z [6/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:25.3898333Z [7/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:25.3921613Z [8/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:25.3941695Z [9/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:25.3971185Z [10/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:25.4207453Z [11/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:25.4527487Z [12/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:25.4571325Z [13/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:25.4638657Z [14/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:25.4696291Z [15/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:25.4803803Z [16/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:25.4823864Z [17/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:25.4834350Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp:10: 2025-05-07T19:50:25.4836316Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.4839764Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4843676Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.4845782Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4847404Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.4851166Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4855055Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.4857080Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4858660Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.4862163Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4866027Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.4868030Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4869607Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.4873037Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4876748Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.4878674Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4880294Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.4883967Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4887930Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.4890018Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4891620Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.4894919Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4898497Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.4900923Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4902620Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.4905998Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4910097Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.4912042Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4913619Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.4916930Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4920818Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.4922927Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4924570Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.4928012Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4932036Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.4934130Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4935674Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.4938897Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.4942614Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.4944696Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.4945388Z At global scope: 2025-05-07T19:50:25.4946815Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.4957818Z [18/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:25.4994919Z [19/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:25.5079914Z [20/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:25.5310736Z [21/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:25.5411442Z [22/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:25.5444553Z [23/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:25.5465058Z [24/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:25.5476138Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp:10: 2025-05-07T19:50:25.5478075Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.5481384Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5486077Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.5488237Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5489889Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.5493390Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5497283Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.5499304Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5500849Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.5504108Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5507900Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.5510079Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5511770Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.5515335Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5519195Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.5521202Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5522889Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.5526823Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5530887Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.5533008Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5534789Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.5538182Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5542346Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.5544544Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5546245Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.5549908Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5554044Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.5556090Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5557613Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.5560862Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5564779Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.5566942Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5568634Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.5572222Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5576036Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.5578220Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5579982Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.5583560Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5587595Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.5589659Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5590322Z At global scope: 2025-05-07T19:50:25.5591574Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.5602379Z [25/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:25.5623139Z [26/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:25.5644345Z [27/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:25.5655256Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:25.5656522Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp:11: 2025-05-07T19:50:25.5658386Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.5662078Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5666063Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.5668212Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5669815Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.5673378Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5677138Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.5679124Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5680815Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.5684653Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5688751Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.5691000Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5692749Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.5696658Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5700698Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.5702488Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5704219Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.5707614Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5711640Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.5713570Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5714971Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.5718277Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5721843Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.5723773Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5725441Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.5728876Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5732715Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.5734738Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5736345Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.5739770Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5743698Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.5745622Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5747231Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.5750557Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5754509Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.5756403Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5757964Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.5761270Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5764983Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.5766953Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5767569Z At global scope: 2025-05-07T19:50:25.5768799Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.5778964Z [28/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:25.5790112Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:25.5791446Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp:13: 2025-05-07T19:50:25.5793215Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.5800561Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5804797Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.5806695Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5808495Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.5811602Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5815152Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.5817092Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5818782Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.5822215Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5825982Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.5827954Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5829576Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.5833136Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5836699Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.5838511Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5840043Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.5843572Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5847561Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.5849666Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5851255Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.5854642Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5858564Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.5860756Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5862456Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.5865859Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5869715Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.5871768Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5873398Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.5876690Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5880649Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.5882855Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5884475Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.5888085Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5892096Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.5894290Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5895844Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.5899309Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.5903219Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.5905340Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.5906044Z At global scope: 2025-05-07T19:50:25.5907402Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.5918544Z [29/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:25.6064557Z [30/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:25.6119647Z [31/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:25.6130377Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:25.6131625Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp:13: 2025-05-07T19:50:25.6133427Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.6136632Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6140417Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.6142358Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6143915Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.6147220Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6151022Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.6153101Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6154813Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.6158240Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6162191Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.6164344Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6166109Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.6170083Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6174029Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.6175899Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6177533Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.6181190Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6185134Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.6187129Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6188804Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.6192219Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6195945Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.6197926Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6199585Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.6202911Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6206789Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.6208953Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6210685Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.6214385Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6218717Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.6220927Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6222612Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.6226092Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6230266Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.6232317Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6233888Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.6237407Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6241253Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.6243225Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6243865Z At global scope: 2025-05-07T19:50:25.6245058Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.6255915Z [32/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:25.6266699Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/a64archtraits_p.h:13, 2025-05-07T19:50:25.6268022Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp:16: 2025-05-07T19:50:25.6269829Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.6273269Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6277202Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.6279042Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6280830Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.6284331Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6288010Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.6290048Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6291645Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.6294979Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6298701Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.6300639Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6302170Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.6305413Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6308832Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.6310552Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6312055Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.6315389Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6319318Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.6321268Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6322858Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.6326150Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6330460Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.6332589Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6334254Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.6337423Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6340986Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.6342813Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6344364Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.6347823Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6351956Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.6354086Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6355836Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.6359383Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6363155Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.6365419Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6367369Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.6371150Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6375335Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.6377631Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6378327Z At global scope: 2025-05-07T19:50:25.6379756Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.6391057Z [33/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:25.6594123Z [34/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:25.6605905Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp:12: 2025-05-07T19:50:25.6607943Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.6611560Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6615934Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.6618092Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6619830Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.6623606Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6627720Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.6629890Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6631539Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.6635108Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6639205Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.6641374Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6643024Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.6646648Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6650612Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.6652618Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6654311Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.6658006Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6662327Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.6664666Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6666363Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.6669769Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6673845Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.6676188Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6677902Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.6681482Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6685887Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.6688103Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6689784Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.6693374Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6697250Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.6699377Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6701072Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.6704692Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6708780Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.6710949Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6712653Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.6716515Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.6720676Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.6722887Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.6723736Z At global scope: 2025-05-07T19:50:25.6725026Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.6736337Z [35/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:25.6757838Z [36/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:25.6857785Z [37/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:25.7029048Z [38/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:25.7043743Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:25.7045264Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:25.7046557Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp:9: 2025-05-07T19:50:25.7048553Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.7052280Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7056304Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.7058456Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7060322Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.7064149Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7068221Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.7070191Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7071807Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.7075386Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7079537Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.7081549Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7083277Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.7086736Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7090183Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.7091929Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7093430Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.7096610Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7100276Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.7102136Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7103673Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.7106866Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7110421Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.7112320Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7113849Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.7116985Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7120811Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.7122740Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7124296Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.7127418Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7131106Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.7132972Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7134491Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.7137722Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7141741Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.7143774Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7145495Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.7149180Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7153302Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.7155403Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7156053Z At global scope: 2025-05-07T19:50:25.7157402Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.7503782Z [39/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:25.7514932Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:25.7516366Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:25.7517787Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp:9: 2025-05-07T19:50:25.7519651Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.7523202Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7527186Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.7529225Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7530928Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.7534478Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7538348Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.7540520Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7542222Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.7545787Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7549593Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.7551643Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7553326Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.7556997Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7560793Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.7562621Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7564229Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.7567880Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7571795Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.7573708Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7575417Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.7579051Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7583459Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.7585590Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7587318Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.7590830Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7594621Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.7596573Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7598216Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.7601692Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7605983Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.7608041Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7609646Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.7613187Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7617534Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.7619492Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7621328Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.7624746Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.7628683Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.7630677Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.7631353Z At global scope: 2025-05-07T19:50:25.7632617Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.7765182Z [40/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:25.7983590Z [41/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:25.7994762Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:25.7996314Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64emithelper_p.h:13, 2025-05-07T19:50:25.7997499Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp:14: 2025-05-07T19:50:25.7999371Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.8002945Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8006736Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.8008789Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8010441Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.8013853Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8017740Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.8019898Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8021546Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.8024974Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8028943Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.8030955Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8032550Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.8036241Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8040076Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.8042166Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8043826Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.8047352Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8051330Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.8053392Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8054992Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.8058486Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8062534Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.8064597Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8066324Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.8069785Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8073726Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.8075705Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8077348Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.8081049Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8084901Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.8086876Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8088487Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.8092244Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8096152Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.8098149Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8099879Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.8103332Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.8107201Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.8109309Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.8109988Z At global scope: 2025-05-07T19:50:25.8111257Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.8121952Z [42/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:25.8141733Z [43/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:25.8454328Z [44/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:25.8557089Z [45/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:25.9138061Z [46/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:26.0272588Z [47/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:26.0448147Z [48/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:26.1901005Z [49/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:26.2808148Z [50/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:26.3021104Z [51/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:26.3032012Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:26.3033396Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:26.3034552Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp:12: 2025-05-07T19:50:26.3036378Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:26.3039767Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3043709Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:26.3045748Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3047382Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:26.3050836Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3055158Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:26.3057584Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3059270Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:26.3062916Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3067104Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:26.3069131Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3070727Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:26.3073956Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3077720Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:26.3079608Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3081276Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:26.3085094Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3089058Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:26.3091106Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3092771Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:26.3096252Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3100122Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:26.3102038Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3104052Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:26.3107357Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3111207Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:26.3114870Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3116478Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:26.3119958Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3123871Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:26.3125933Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3127632Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:26.3131212Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3135143Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:26.3137252Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3138895Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:26.3142458Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.3146157Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:26.3148108Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.3148760Z At global scope: 2025-05-07T19:50:26.3149968Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:26.3684615Z [52/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:26.6014383Z [53/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:26.6346886Z [54/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:26.7404602Z [55/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:26.8354180Z [56/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:26.9045773Z [57/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:26.9182918Z [58/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:26.9405569Z [59/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:27.4134094Z [60/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:27.4145836Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:27.4147303Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:27.4148548Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp:18: 2025-05-07T19:50:27.4150511Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:27.4154394Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4158430Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:27.4160475Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4162269Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:27.4165851Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4169963Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:27.4172093Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4173808Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:27.4177409Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4181629Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:27.4183900Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4185638Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:27.4189209Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4193399Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:27.4195364Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4197064Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:27.4200645Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4204878Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:27.4206947Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4208618Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:27.4212210Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4216181Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:27.4218347Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4220284Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:27.4223970Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4227802Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:27.4229885Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4231560Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:27.4235120Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4239164Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:27.4241218Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4243179Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:27.4246771Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4250779Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:27.4253082Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4254843Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:27.4258449Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:27.4262696Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:27.4264878Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:27.4265576Z At global scope: 2025-05-07T19:50:27.4266820Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:27.7505855Z [61/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:28.1737298Z [62/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:28.8375220Z [63/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:29.3395667Z [64/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:29.4457263Z [65/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:31.7148186Z [66/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:33.7987605Z [67/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:34.0161261Z [68/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:34.0379371Z [69/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:34.0534705Z [70/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:34.0680929Z [71/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:35.3695981Z [72/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:35.6420967Z [73/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:36.2689736Z [74/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:36.3910335Z [75/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:38.1973464Z [76/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:38.8838418Z [77/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:39.4300932Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:41.8169321Z [79/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:42.4149784Z [80/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:43.4980264Z [81/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:43.8167878Z [82/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:46.0148675Z [83/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:47.9339623Z [84/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:48.1869951Z [85/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:50.9418324Z [86/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:52.2341569Z [87/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:53.5691727Z [88/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:57.8198232Z [89/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:58.0307877Z [90/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:01.1288343Z [91/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:02.5430580Z [92/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:03.2771707Z [93/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:04.3212983Z [94/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:07.8073277Z [95/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:08.1001985Z [96/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:11.3365628Z [97/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:12.7296279Z [98/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:13.4918055Z [99/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:14.5376627Z [100/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:18.0144507Z [101/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:27.9177764Z [102/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:28.6543040Z [103/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:32.6096048Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:34.4596268Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:34.7815319Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:34.9561958Z [107/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:51:35.1189740Z [108/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:35.3025210Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:35.6555280Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:35.6808842Z [111/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:51:36.0489812Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:36.3746594Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:36.5313954Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:36.8481399Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:45.7472354Z [116/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:48.2575916Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:48.8657614Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T19:51:51.8196476Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:51:55.5760466Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:52:02.1497311Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:03.0220031Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:20.2533321Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:25.2982543Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:28.2108811Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:34.1909678Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:34.6555923Z [127/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:34.8824014Z [128/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:35.2460708Z [129/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:39.6053802Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:47.2083102Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:52.6406618Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:54.2808119Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:52:56.9755409Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:59.6939388Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:05.2003519Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:05.2027404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2029675Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2030211Z ^ 2025-05-07T19:53:05.2030506Z 2025-05-07T19:53:05.2030927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.2031615Z 2025-05-07T19:53:05.2033256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2035268Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2035859Z ^ 2025-05-07T19:53:05.2036157Z 2025-05-07T19:53:05.2037750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2039852Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2040457Z ^ 2025-05-07T19:53:05.2040769Z 2025-05-07T19:53:05.2042433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2044554Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2045123Z ^ 2025-05-07T19:53:05.2045457Z 2025-05-07T19:53:05.2045903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.2046540Z 2025-05-07T19:53:05.2048065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2050134Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2050723Z ^ 2025-05-07T19:53:05.2051009Z 2025-05-07T19:53:05.2052868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2054808Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2055346Z ^ 2025-05-07T19:53:05.2055648Z 2025-05-07T19:53:05.2057284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2059347Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2060322Z ^ 2025-05-07T19:53:05.2060637Z 2025-05-07T19:53:05.2061112Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.2061788Z 2025-05-07T19:53:05.2063515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2065534Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2066079Z ^ 2025-05-07T19:53:05.2066369Z 2025-05-07T19:53:05.2068003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2070060Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2070653Z ^ 2025-05-07T19:53:05.2070967Z 2025-05-07T19:53:05.2072609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2074694Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2075302Z ^ 2025-05-07T19:53:05.2075616Z 2025-05-07T19:53:05.2076075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.2076801Z 2025-05-07T19:53:05.2078447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2080569Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2081128Z ^ 2025-05-07T19:53:05.2081447Z 2025-05-07T19:53:05.2083174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2085198Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2085772Z ^ 2025-05-07T19:53:05.2086089Z 2025-05-07T19:53:05.2087730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2089603Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2090188Z ^ 2025-05-07T19:53:05.2090490Z 2025-05-07T19:53:05.2090932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.2091650Z 2025-05-07T19:53:05.2093329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2095400Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2095961Z ^ 2025-05-07T19:53:05.2096289Z 2025-05-07T19:53:05.2098264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:05.2100558Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:05.2101155Z ^ 2025-05-07T19:53:05.2101472Z 2025-05-07T19:53:14.4488474Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:14.4513170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4515453Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:14.4516231Z ^ 2025-05-07T19:53:14.4516559Z 2025-05-07T19:53:14.4517012Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.4517706Z 2025-05-07T19:53:14.4519194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4521175Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4521743Z ^ 2025-05-07T19:53:14.4522037Z 2025-05-07T19:53:14.4523977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4525908Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4526473Z ^ 2025-05-07T19:53:14.4526743Z 2025-05-07T19:53:14.4528311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4530422Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4530966Z ^ 2025-05-07T19:53:14.4531228Z 2025-05-07T19:53:14.4532814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4534836Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:14.4535591Z ^ 2025-05-07T19:53:14.4535899Z 2025-05-07T19:53:14.4536351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.4537016Z 2025-05-07T19:53:14.4538599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4540757Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4541343Z ^ 2025-05-07T19:53:14.4541629Z 2025-05-07T19:53:14.4543266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4545275Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4545855Z ^ 2025-05-07T19:53:14.4546136Z 2025-05-07T19:53:14.4547648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4549654Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4550231Z ^ 2025-05-07T19:53:14.4550533Z 2025-05-07T19:53:14.4552051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4554302Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:14.4555065Z ^ 2025-05-07T19:53:14.4555392Z 2025-05-07T19:53:14.4555833Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.4556482Z 2025-05-07T19:53:14.4558080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4560005Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4560544Z ^ 2025-05-07T19:53:14.4560850Z 2025-05-07T19:53:14.4562374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4564396Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4564998Z ^ 2025-05-07T19:53:14.4565280Z 2025-05-07T19:53:14.4567145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4569195Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4569760Z ^ 2025-05-07T19:53:14.4570076Z 2025-05-07T19:53:14.4571670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4574101Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:14.4574882Z ^ 2025-05-07T19:53:14.4575199Z 2025-05-07T19:53:14.4575576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.4576172Z 2025-05-07T19:53:14.4577566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4579367Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4580045Z ^ 2025-05-07T19:53:14.4580296Z 2025-05-07T19:53:14.4581792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4584058Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4584656Z ^ 2025-05-07T19:53:14.4584952Z 2025-05-07T19:53:14.4586579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4588668Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4589221Z ^ 2025-05-07T19:53:14.4589527Z 2025-05-07T19:53:14.4591168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4593417Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:14.4594046Z ^ 2025-05-07T19:53:14.4594357Z 2025-05-07T19:53:14.4594771Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.4595432Z 2025-05-07T19:53:14.4596995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4598938Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4599548Z ^ 2025-05-07T19:53:14.4599847Z 2025-05-07T19:53:14.4601473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4603580Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4604093Z ^ 2025-05-07T19:53:14.4604357Z 2025-05-07T19:53:14.4605916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:14.4607953Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:14.4608477Z ^ 2025-05-07T19:53:14.4608794Z 2025-05-07T19:53:14.5291742Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:15.1793181Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:15.1815870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1818046Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:15.1818787Z ^ 2025-05-07T19:53:15.1819051Z 2025-05-07T19:53:15.1819537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:15.1820303Z 2025-05-07T19:53:15.1821787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1823757Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1824293Z ^ 2025-05-07T19:53:15.1824580Z 2025-05-07T19:53:15.1826148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1828165Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1828695Z ^ 2025-05-07T19:53:15.1828989Z 2025-05-07T19:53:15.1830591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1832592Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1833140Z ^ 2025-05-07T19:53:15.1833415Z 2025-05-07T19:53:15.1835101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1837240Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:15.1838025Z ^ 2025-05-07T19:53:15.1838312Z 2025-05-07T19:53:15.1838737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:15.1839380Z 2025-05-07T19:53:15.1841006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1843011Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1843586Z ^ 2025-05-07T19:53:15.1843870Z 2025-05-07T19:53:15.1845697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1847717Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1848287Z ^ 2025-05-07T19:53:15.1848568Z 2025-05-07T19:53:15.1850116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1852090Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1852826Z ^ 2025-05-07T19:53:15.1853140Z 2025-05-07T19:53:15.1854753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1856894Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:15.1857635Z ^ 2025-05-07T19:53:15.1857933Z 2025-05-07T19:53:15.1858379Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:15.1859073Z 2025-05-07T19:53:15.1860730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1862798Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1863375Z ^ 2025-05-07T19:53:15.1863664Z 2025-05-07T19:53:15.1865208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1867251Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1867807Z ^ 2025-05-07T19:53:15.1868104Z 2025-05-07T19:53:15.1869704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1871671Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1872235Z ^ 2025-05-07T19:53:15.1872541Z 2025-05-07T19:53:15.1874152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1876285Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:15.1877040Z ^ 2025-05-07T19:53:15.1877333Z 2025-05-07T19:53:15.1877797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:15.1878427Z 2025-05-07T19:53:15.1879963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1882255Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1882784Z ^ 2025-05-07T19:53:15.1883049Z 2025-05-07T19:53:15.1884596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1886578Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1887091Z ^ 2025-05-07T19:53:15.1887382Z 2025-05-07T19:53:15.1889278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1891222Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1891732Z ^ 2025-05-07T19:53:15.1892004Z 2025-05-07T19:53:15.1893550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1895651Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:15.1896604Z ^ 2025-05-07T19:53:15.1896865Z 2025-05-07T19:53:15.1897315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:15.1897965Z 2025-05-07T19:53:15.1899449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1901444Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1901998Z ^ 2025-05-07T19:53:15.1902287Z 2025-05-07T19:53:15.1903747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1905654Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1906188Z ^ 2025-05-07T19:53:15.1906489Z 2025-05-07T19:53:15.1908023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:15.1909958Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:15.1910479Z ^ 2025-05-07T19:53:15.1910792Z 2025-05-07T19:53:16.7917510Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:17.4665537Z [141/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:18.0629626Z [142/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:20.8417287Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:24.5195082Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:34.2268703Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:39.5307582Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:43.8162331Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:44.5120130Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:48.0710067Z [149/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:06.4140357Z [150/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:10.0896432Z [151/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:10.3325541Z [152/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:10.7821042Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:11.4954470Z [154/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:17.0342967Z [155/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:19.1204363Z [156/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:22.5860895Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:22.5883870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5885910Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5886769Z ^ 2025-05-07T19:54:22.5890067Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:22.5892999Z 2025-05-07T19:54:22.5893381Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.5893939Z 2025-05-07T19:54:22.5895074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5896875Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5897705Z ^ 2025-05-07T19:54:22.5901044Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:22.5904264Z 2025-05-07T19:54:22.5905401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5907048Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5907914Z ^ 2025-05-07T19:54:22.5911513Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:22.5914777Z 2025-05-07T19:54:22.5916080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5918426Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5919319Z ^ 2025-05-07T19:54:22.5922773Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:22.5926250Z 2025-05-07T19:54:22.5927523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5929510Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5930399Z ^ 2025-05-07T19:54:22.5933826Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:22.5937029Z 2025-05-07T19:54:22.5938312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5940429Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5941309Z ^ 2025-05-07T19:54:22.5944754Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:22.5947970Z 2025-05-07T19:54:22.5949251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5951211Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5952085Z ^ 2025-05-07T19:54:22.5955335Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:22.5958727Z 2025-05-07T19:54:22.5960045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5961982Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5963168Z ^ 2025-05-07T19:54:22.5966853Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:22.5970258Z 2025-05-07T19:54:22.5971619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5973519Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5974423Z ^ 2025-05-07T19:54:22.5977691Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:22.5981092Z 2025-05-07T19:54:22.5982539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5984531Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5985459Z ^ 2025-05-07T19:54:22.5988644Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:22.5991569Z 2025-05-07T19:54:22.5992714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.5994406Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.5995304Z ^ 2025-05-07T19:54:22.5998727Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:22.6001825Z 2025-05-07T19:54:22.6003128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6005117Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6006009Z ^ 2025-05-07T19:54:22.6009718Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:22.6012621Z 2025-05-07T19:54:22.6013693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6019933Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6020905Z ^ 2025-05-07T19:54:22.6024638Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:22.6027885Z 2025-05-07T19:54:22.6029099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6031028Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6031891Z ^ 2025-05-07T19:54:22.6035310Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:22.6038561Z 2025-05-07T19:54:22.6039813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6041773Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6042639Z ^ 2025-05-07T19:54:22.6046031Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:22.6048998Z 2025-05-07T19:54:22.6050280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6052326Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6053240Z ^ 2025-05-07T19:54:22.6056937Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:22.6060303Z 2025-05-07T19:54:22.6061616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6063741Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6064650Z ^ 2025-05-07T19:54:22.6068117Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:22.6071349Z 2025-05-07T19:54:22.6072655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6074632Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6075525Z ^ 2025-05-07T19:54:22.6079036Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:22.6082502Z 2025-05-07T19:54:22.6083779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6085779Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6086669Z ^ 2025-05-07T19:54:22.6090138Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:22.6093401Z 2025-05-07T19:54:22.6094671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6096481Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6097362Z ^ 2025-05-07T19:54:22.6101481Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:22.6104827Z 2025-05-07T19:54:22.6106026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6108122Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6108970Z ^ 2025-05-07T19:54:22.6112636Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:22.6115899Z 2025-05-07T19:54:22.6117160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6119116Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6120015Z ^ 2025-05-07T19:54:22.6123742Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:22.6127082Z 2025-05-07T19:54:22.6128368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6130344Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6131244Z ^ 2025-05-07T19:54:22.6134835Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:22.6138324Z 2025-05-07T19:54:22.6139748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6142007Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6142973Z ^ 2025-05-07T19:54:22.6146963Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:22.6150403Z 2025-05-07T19:54:22.6151731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6153772Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6154842Z ^ 2025-05-07T19:54:22.6158480Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:22.6161863Z 2025-05-07T19:54:22.6162348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6163049Z 2025-05-07T19:54:22.6164391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6166428Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6167326Z ^ 2025-05-07T19:54:22.6170987Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:22.6174308Z 2025-05-07T19:54:22.6175634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6177666Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6178579Z ^ 2025-05-07T19:54:22.6182450Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:22.6185862Z 2025-05-07T19:54:22.6187181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6189219Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6190133Z ^ 2025-05-07T19:54:22.6194022Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:22.6197405Z 2025-05-07T19:54:22.6198764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6200910Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6202038Z ^ 2025-05-07T19:54:22.6206045Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:22.6209493Z 2025-05-07T19:54:22.6210857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6212886Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6213848Z ^ 2025-05-07T19:54:22.6217522Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:22.6221005Z 2025-05-07T19:54:22.6222359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6224457Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6225440Z ^ 2025-05-07T19:54:22.6228479Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:22.6231981Z 2025-05-07T19:54:22.6233352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6235393Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6236348Z ^ 2025-05-07T19:54:22.6240110Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:22.6243424Z 2025-05-07T19:54:22.6244688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6246646Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6247696Z ^ 2025-05-07T19:54:22.6251070Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:22.6254230Z 2025-05-07T19:54:22.6255206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6256738Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6257460Z ^ 2025-05-07T19:54:22.6260299Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:22.6262944Z 2025-05-07T19:54:22.6263902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6265427Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6266114Z ^ 2025-05-07T19:54:22.6268782Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:22.6271235Z 2025-05-07T19:54:22.6272195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6273703Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6274390Z ^ 2025-05-07T19:54:22.6277295Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:22.6279783Z 2025-05-07T19:54:22.6280752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6282493Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6283196Z ^ 2025-05-07T19:54:22.6286078Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:22.6288493Z 2025-05-07T19:54:22.6289477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6290947Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6291664Z ^ 2025-05-07T19:54:22.6294471Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:22.6296933Z 2025-05-07T19:54:22.6297927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6299599Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6300425Z ^ 2025-05-07T19:54:22.6303319Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:22.6306040Z 2025-05-07T19:54:22.6307098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6308781Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6309539Z ^ 2025-05-07T19:54:22.6313216Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:22.6316114Z 2025-05-07T19:54:22.6317251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6319019Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6319783Z ^ 2025-05-07T19:54:22.6322841Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:22.6325794Z 2025-05-07T19:54:22.6326906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6328793Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6329613Z ^ 2025-05-07T19:54:22.6332901Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:22.6335827Z 2025-05-07T19:54:22.6337004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6338835Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6339832Z ^ 2025-05-07T19:54:22.6342932Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:22.6346304Z 2025-05-07T19:54:22.6347715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6349854Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6350857Z ^ 2025-05-07T19:54:22.6355042Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:22.6358533Z 2025-05-07T19:54:22.6359923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6362045Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6362997Z ^ 2025-05-07T19:54:22.6366970Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:22.6370676Z 2025-05-07T19:54:22.6372070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6374183Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6375156Z ^ 2025-05-07T19:54:22.6379004Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:22.6382926Z 2025-05-07T19:54:22.6384308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6386455Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6387436Z ^ 2025-05-07T19:54:22.6391280Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:22.6394891Z 2025-05-07T19:54:22.6396266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6398414Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6399306Z ^ 2025-05-07T19:54:22.6403097Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:22.6406609Z 2025-05-07T19:54:22.6408248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6410358Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6411309Z ^ 2025-05-07T19:54:22.6415220Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:22.6418894Z 2025-05-07T19:54:22.6419382Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6420216Z 2025-05-07T19:54:22.6421606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6423702Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6424666Z ^ 2025-05-07T19:54:22.6428328Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:22.6431831Z 2025-05-07T19:54:22.6433235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6435333Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6436318Z ^ 2025-05-07T19:54:22.6440112Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:22.6443823Z 2025-05-07T19:54:22.6445153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6447202Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6448170Z ^ 2025-05-07T19:54:22.6452101Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:22.6455659Z 2025-05-07T19:54:22.6457179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6459290Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6460414Z ^ 2025-05-07T19:54:22.6464109Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:22.6467792Z 2025-05-07T19:54:22.6469135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6471405Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6472327Z ^ 2025-05-07T19:54:22.6475959Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:22.6479498Z 2025-05-07T19:54:22.6480849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6483394Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6484350Z ^ 2025-05-07T19:54:22.6488108Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:22.6491644Z 2025-05-07T19:54:22.6493022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6495037Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6495986Z ^ 2025-05-07T19:54:22.6499922Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:22.6503494Z 2025-05-07T19:54:22.6505191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6507289Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6508276Z ^ 2025-05-07T19:54:22.6512052Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:22.6515888Z 2025-05-07T19:54:22.6517291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6519405Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6520254Z ^ 2025-05-07T19:54:22.6523379Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:22.6526359Z 2025-05-07T19:54:22.6527555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6529415Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6530262Z ^ 2025-05-07T19:54:22.6533741Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:22.6536978Z 2025-05-07T19:54:22.6538224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6540303Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6541201Z ^ 2025-05-07T19:54:22.6544625Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:22.6547873Z 2025-05-07T19:54:22.6549356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6551370Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6552496Z ^ 2025-05-07T19:54:22.6556104Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:22.6559648Z 2025-05-07T19:54:22.6560982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6563060Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6563984Z ^ 2025-05-07T19:54:22.6567646Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:22.6571078Z 2025-05-07T19:54:22.6572425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6574670Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6575600Z ^ 2025-05-07T19:54:22.6579195Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:22.6582890Z 2025-05-07T19:54:22.6584234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6586292Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6587215Z ^ 2025-05-07T19:54:22.6590886Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:22.6594561Z 2025-05-07T19:54:22.6597432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6599478Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6600414Z ^ 2025-05-07T19:54:22.6604170Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:22.6607842Z 2025-05-07T19:54:22.6609201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6611228Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6612167Z ^ 2025-05-07T19:54:22.6616011Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:22.6619392Z 2025-05-07T19:54:22.6620914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6622920Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6623843Z ^ 2025-05-07T19:54:22.6627561Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:22.6630884Z 2025-05-07T19:54:22.6632087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6634007Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6634881Z ^ 2025-05-07T19:54:22.6638186Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:22.6641176Z 2025-05-07T19:54:22.6642327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6644416Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6645354Z ^ 2025-05-07T19:54:22.6648692Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:22.6651925Z 2025-05-07T19:54:22.6653062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6654819Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6655611Z ^ 2025-05-07T19:54:22.6658725Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:22.6662466Z 2025-05-07T19:54:22.6663854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6666026Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6666989Z ^ 2025-05-07T19:54:22.6670742Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:22.6674267Z 2025-05-07T19:54:22.6675637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6677760Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6678719Z ^ 2025-05-07T19:54:22.6682819Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:22.6686423Z 2025-05-07T19:54:22.6687804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6690260Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6691234Z ^ 2025-05-07T19:54:22.6694988Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:22.6698711Z 2025-05-07T19:54:22.6699199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6700037Z 2025-05-07T19:54:22.6701433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6703583Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6704562Z ^ 2025-05-07T19:54:22.6708368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:22.6711923Z 2025-05-07T19:54:22.6713283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6715461Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6716434Z ^ 2025-05-07T19:54:22.6720129Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:22.6723646Z 2025-05-07T19:54:22.6724987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6727103Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6728061Z ^ 2025-05-07T19:54:22.6731848Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:22.6735395Z 2025-05-07T19:54:22.6736743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6739116Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6740219Z ^ 2025-05-07T19:54:22.6743959Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:22.6747584Z 2025-05-07T19:54:22.6748952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6751092Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6752049Z ^ 2025-05-07T19:54:22.6755823Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:22.6759374Z 2025-05-07T19:54:22.6760753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6762890Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6763845Z ^ 2025-05-07T19:54:22.6767626Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:22.6771138Z 2025-05-07T19:54:22.6772495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6774435Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6775367Z ^ 2025-05-07T19:54:22.6779030Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:22.6782899Z 2025-05-07T19:54:22.6784277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6786377Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6787664Z ^ 2025-05-07T19:54:22.6791485Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:22.6795156Z 2025-05-07T19:54:22.6796565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6798677Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6799665Z ^ 2025-05-07T19:54:22.6803471Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:22.6806940Z 2025-05-07T19:54:22.6808193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6810305Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6811283Z ^ 2025-05-07T19:54:22.6814959Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:22.6818436Z 2025-05-07T19:54:22.6819938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6822075Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6823064Z ^ 2025-05-07T19:54:22.6826844Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:22.6829963Z 2025-05-07T19:54:22.6831181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6833098Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6834186Z ^ 2025-05-07T19:54:22.6837512Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:22.6840805Z 2025-05-07T19:54:22.6842235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6844203Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6845092Z ^ 2025-05-07T19:54:22.6848543Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:22.6851854Z 2025-05-07T19:54:22.6853083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6855116Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6855947Z ^ 2025-05-07T19:54:22.6859265Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:22.6862563Z 2025-05-07T19:54:22.6863793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6865817Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6866788Z ^ 2025-05-07T19:54:22.6870404Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:22.6873620Z 2025-05-07T19:54:22.6874990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6877064Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6878041Z ^ 2025-05-07T19:54:22.6882334Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:22.6885832Z 2025-05-07T19:54:22.6887190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6889412Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6890369Z ^ 2025-05-07T19:54:22.6894136Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:22.6897601Z 2025-05-07T19:54:22.6898942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6901151Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6902094Z ^ 2025-05-07T19:54:22.6905845Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:22.6909334Z 2025-05-07T19:54:22.6910681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6912789Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6913734Z ^ 2025-05-07T19:54:22.6917452Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:22.6920933Z 2025-05-07T19:54:22.6922240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6924269Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6925185Z ^ 2025-05-07T19:54:22.6929128Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:22.6932492Z 2025-05-07T19:54:22.6933788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6935925Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6936811Z ^ 2025-05-07T19:54:22.6940565Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:22.6943946Z 2025-05-07T19:54:22.6945244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6947291Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6948205Z ^ 2025-05-07T19:54:22.6951816Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:22.6954363Z 2025-05-07T19:54:22.6955355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6957081Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6957825Z ^ 2025-05-07T19:54:22.6960858Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:22.6963831Z 2025-05-07T19:54:22.6964949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6966751Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6967597Z ^ 2025-05-07T19:54:22.6971325Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:22.6974434Z 2025-05-07T19:54:22.6974966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6975694Z 2025-05-07T19:54:22.6976946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6979169Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6980234Z ^ 2025-05-07T19:54:22.6984060Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:22.6987400Z 2025-05-07T19:54:22.6988670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.6990658Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.6991557Z ^ 2025-05-07T19:54:22.6994930Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:22.6997868Z 2025-05-07T19:54:22.6999026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7000857Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7001679Z ^ 2025-05-07T19:54:22.7004973Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:22.7007975Z 2025-05-07T19:54:22.7009120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7010958Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7011748Z ^ 2025-05-07T19:54:22.7015694Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:22.7019255Z 2025-05-07T19:54:22.7020817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7023144Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7024143Z ^ 2025-05-07T19:54:22.7027985Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:22.7031531Z 2025-05-07T19:54:22.7032939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7034914Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7035872Z ^ 2025-05-07T19:54:22.7039489Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:22.7042745Z 2025-05-07T19:54:22.7044059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7046092Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7047040Z ^ 2025-05-07T19:54:22.7050810Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:22.7054317Z 2025-05-07T19:54:22.7055682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7057993Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7058961Z ^ 2025-05-07T19:54:22.7062923Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:22.7066359Z 2025-05-07T19:54:22.7067679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7069878Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7070823Z ^ 2025-05-07T19:54:22.7074478Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:22.7077919Z 2025-05-07T19:54:22.7079227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7081239Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7082423Z ^ 2025-05-07T19:54:22.7086091Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:22.7089512Z 2025-05-07T19:54:22.7090834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7093069Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7094001Z ^ 2025-05-07T19:54:22.7097708Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:22.7101261Z 2025-05-07T19:54:22.7102585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7104652Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7122069Z ^ 2025-05-07T19:54:22.7126288Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:22.7129464Z 2025-05-07T19:54:22.7130709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7132430Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7133599Z ^ 2025-05-07T19:54:22.7136857Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:22.7139815Z 2025-05-07T19:54:22.7140828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7142407Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7143313Z ^ 2025-05-07T19:54:22.7146882Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:22.7150190Z 2025-05-07T19:54:22.7151483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7153428Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7154351Z ^ 2025-05-07T19:54:22.7157671Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:22.7161059Z 2025-05-07T19:54:22.7162417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7164364Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7165301Z ^ 2025-05-07T19:54:22.7169094Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:22.7172363Z 2025-05-07T19:54:22.7173635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7175521Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7177756Z ^ 2025-05-07T19:54:22.7181466Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:22.7184922Z 2025-05-07T19:54:22.7186198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7188184Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7189079Z ^ 2025-05-07T19:54:22.7192577Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:22.7195888Z 2025-05-07T19:54:22.7197142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7199098Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7199993Z ^ 2025-05-07T19:54:22.7203295Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:22.7206389Z 2025-05-07T19:54:22.7207683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7209686Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7210629Z ^ 2025-05-07T19:54:22.7214771Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:22.7218210Z 2025-05-07T19:54:22.7219727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7221777Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7222671Z ^ 2025-05-07T19:54:22.7226394Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:22.7229927Z 2025-05-07T19:54:22.7231303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7233440Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7234421Z ^ 2025-05-07T19:54:22.7238258Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:22.7241623Z 2025-05-07T19:54:22.7242959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:22.7244997Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:22.7245958Z ^ 2025-05-07T19:54:22.7249688Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:22.7253140Z 2025-05-07T19:54:24.3526185Z [158/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:27.2766131Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:30.0426269Z [160/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:30.5311386Z [161/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:32.1068792Z [162/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:32.2440729Z [163/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:32.3799280Z [164/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:32.3956324Z [165/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.4125365Z [166/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.4278914Z [167/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.4437735Z [168/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.4608207Z [169/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.4773931Z [170/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.4930828Z [171/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.5098147Z [172/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.5258505Z [173/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.5428567Z [174/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.5595236Z [175/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.5761737Z [176/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.5920699Z [177/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.9407397Z [178/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:35.5725544Z [179/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:36.1787421Z [180/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:37.2071768Z [181/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:37.7669774Z [182/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:40.3720844Z [183/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:40.4549732Z [184/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:41.8855562Z [185/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:42.5033777Z [186/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:43.8307112Z [187/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:43.8462491Z [188/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:43.8616946Z [189/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:43.8766956Z [190/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:43.8934640Z [191/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:43.9105734Z [192/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:43.9267792Z [193/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:44.6268295Z [194/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:44.6292336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6294401Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6294982Z ^ 2025-05-07T19:54:44.6295269Z 2025-05-07T19:54:44.6295738Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:44.6296389Z 2025-05-07T19:54:44.6297962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6299978Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6300517Z ^ 2025-05-07T19:54:44.6300823Z 2025-05-07T19:54:44.6302235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6304152Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6304739Z ^ 2025-05-07T19:54:44.6305080Z 2025-05-07T19:54:44.6306643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6308651Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6309206Z ^ 2025-05-07T19:54:44.6309479Z 2025-05-07T19:54:44.6309965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:44.6310622Z 2025-05-07T19:54:44.6312101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6314083Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6314699Z ^ 2025-05-07T19:54:44.6314977Z 2025-05-07T19:54:44.6316473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6318424Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6319375Z ^ 2025-05-07T19:54:44.6319718Z 2025-05-07T19:54:44.6321142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6323000Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6323516Z ^ 2025-05-07T19:54:44.6323789Z 2025-05-07T19:54:44.6324231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:44.6325134Z 2025-05-07T19:54:44.6326599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6328554Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6329140Z ^ 2025-05-07T19:54:44.6329435Z 2025-05-07T19:54:44.6330912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6332875Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6333381Z ^ 2025-05-07T19:54:44.6333700Z 2025-05-07T19:54:44.6335165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6337075Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6337600Z ^ 2025-05-07T19:54:44.6337923Z 2025-05-07T19:54:44.6338340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:44.6338978Z 2025-05-07T19:54:44.6340723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6342666Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6343219Z ^ 2025-05-07T19:54:44.6343505Z 2025-05-07T19:54:44.6345050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6347059Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6347625Z ^ 2025-05-07T19:54:44.6347902Z 2025-05-07T19:54:44.6349415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6351347Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6351864Z ^ 2025-05-07T19:54:44.6352148Z 2025-05-07T19:54:44.6352538Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:44.6353104Z 2025-05-07T19:54:44.6354568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6356377Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6356896Z ^ 2025-05-07T19:54:44.6357171Z 2025-05-07T19:54:44.6358904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.6360855Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:44.6361382Z ^ 2025-05-07T19:54:44.6361649Z 2025-05-07T19:54:45.6617504Z [195/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:46.7988720Z [196/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:47.3905753Z [197/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:48.4520247Z [198/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.9359301Z [199/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.0668227Z [200/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.9925734Z [201/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:56.7347148Z [202/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:56.9050986Z [203/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:58.6448482Z [204/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:59.1819058Z [205/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:55:00.0258896Z [206/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:00.1781692Z [207/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:55:00.5835826Z [208/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.9445028Z [209/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:01.9457978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9459140Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9459657Z ^ 2025-05-07T19:55:01.9459826Z 2025-05-07T19:55:01.9460096Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9460459Z 2025-05-07T19:55:01.9461305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9462373Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9462675Z ^ 2025-05-07T19:55:01.9462862Z 2025-05-07T19:55:01.9463709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9464813Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9465132Z ^ 2025-05-07T19:55:01.9465325Z 2025-05-07T19:55:01.9466172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9467240Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9467571Z ^ 2025-05-07T19:55:01.9467738Z 2025-05-07T19:55:01.9468006Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9468370Z 2025-05-07T19:55:01.9469216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9470318Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9470632Z ^ 2025-05-07T19:55:01.9470823Z 2025-05-07T19:55:01.9471670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9472760Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9473071Z ^ 2025-05-07T19:55:01.9473260Z 2025-05-07T19:55:01.9474100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9475198Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9475508Z ^ 2025-05-07T19:55:01.9475675Z 2025-05-07T19:55:01.9475940Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9476302Z 2025-05-07T19:55:01.9480008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9481223Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9481570Z ^ 2025-05-07T19:55:01.9481742Z 2025-05-07T19:55:01.9482858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9483959Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9484472Z ^ 2025-05-07T19:55:01.9484669Z 2025-05-07T19:55:01.9485526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9486625Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9486943Z ^ 2025-05-07T19:55:01.9487138Z 2025-05-07T19:55:01.9487389Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9487751Z 2025-05-07T19:55:01.9488616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9489677Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9490023Z ^ 2025-05-07T19:55:01.9490191Z 2025-05-07T19:55:01.9491034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9492121Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9492471Z ^ 2025-05-07T19:55:01.9492644Z 2025-05-07T19:55:01.9493491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9494583Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9494898Z ^ 2025-05-07T19:55:01.9495092Z 2025-05-07T19:55:01.9495340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9495716Z 2025-05-07T19:55:01.9496580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9497644Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9497985Z ^ 2025-05-07T19:55:01.9498154Z 2025-05-07T19:55:01.9498994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9500205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9500551Z ^ 2025-05-07T19:55:01.9500721Z 2025-05-07T19:55:02.3155401Z [210/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.5118601Z [211/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:03.6548459Z [212/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:03.6570127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6572163Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.6572864Z ^ 2025-05-07T19:55:03.6573169Z 2025-05-07T19:55:03.6573577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.6574176Z 2025-05-07T19:55:03.6575625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6577354Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6577911Z ^ 2025-05-07T19:55:03.6578168Z 2025-05-07T19:55:03.6579829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6581540Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6582287Z ^ 2025-05-07T19:55:03.6582556Z 2025-05-07T19:55:03.6583961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6585654Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6586507Z ^ 2025-05-07T19:55:03.6586785Z 2025-05-07T19:55:03.6588153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6590005Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.6590718Z ^ 2025-05-07T19:55:03.6590980Z 2025-05-07T19:55:03.6591389Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.6592204Z 2025-05-07T19:55:03.6593600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6595336Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6595859Z ^ 2025-05-07T19:55:03.6596130Z 2025-05-07T19:55:03.6597519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6599203Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6599696Z ^ 2025-05-07T19:55:03.6599958Z 2025-05-07T19:55:03.6601351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6603094Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6603598Z ^ 2025-05-07T19:55:03.6603859Z 2025-05-07T19:55:03.6605217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6607074Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.6607765Z ^ 2025-05-07T19:55:03.6608017Z 2025-05-07T19:55:03.6608405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.6609006Z 2025-05-07T19:55:03.6610358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6612047Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6612589Z ^ 2025-05-07T19:55:03.6612827Z 2025-05-07T19:55:03.6614214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6615845Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6616354Z ^ 2025-05-07T19:55:03.6616607Z 2025-05-07T19:55:03.6617928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6619805Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6620366Z ^ 2025-05-07T19:55:03.6620608Z 2025-05-07T19:55:03.6621947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6623740Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.6624622Z ^ 2025-05-07T19:55:03.6624919Z 2025-05-07T19:55:03.6625314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.6625871Z 2025-05-07T19:55:03.6627264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6628906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6629403Z ^ 2025-05-07T19:55:03.6629794Z 2025-05-07T19:55:03.6631134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6632829Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6633346Z ^ 2025-05-07T19:55:03.6633604Z 2025-05-07T19:55:03.6634929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6636580Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6637098Z ^ 2025-05-07T19:55:03.6637353Z 2025-05-07T19:55:03.6638702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6640526Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.6641159Z ^ 2025-05-07T19:55:03.6641448Z 2025-05-07T19:55:03.6641829Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.6642371Z 2025-05-07T19:55:03.6643696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6645312Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6645787Z ^ 2025-05-07T19:55:03.6646003Z 2025-05-07T19:55:03.6647317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6648948Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6649455Z ^ 2025-05-07T19:55:03.6649703Z 2025-05-07T19:55:03.6651031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.6652679Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.6653164Z ^ 2025-05-07T19:55:03.6653424Z 2025-05-07T19:55:03.8229652Z [213/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:03.8253964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8256066Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.8256807Z ^ 2025-05-07T19:55:03.8257093Z 2025-05-07T19:55:03.8257558Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.8258231Z 2025-05-07T19:55:03.8259995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8262028Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8262572Z ^ 2025-05-07T19:55:03.8262888Z 2025-05-07T19:55:03.8264362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8266276Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8266810Z ^ 2025-05-07T19:55:03.8267086Z 2025-05-07T19:55:03.8268651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8270477Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8271045Z ^ 2025-05-07T19:55:03.8271331Z 2025-05-07T19:55:03.8272854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8274967Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.8276034Z ^ 2025-05-07T19:55:03.8276339Z 2025-05-07T19:55:03.8276777Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.8277482Z 2025-05-07T19:55:03.8279022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8280939Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8281478Z ^ 2025-05-07T19:55:03.8282272Z 2025-05-07T19:55:03.8283783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8285648Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8286150Z ^ 2025-05-07T19:55:03.8286417Z 2025-05-07T19:55:03.8287914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8289790Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8290356Z ^ 2025-05-07T19:55:03.8290635Z 2025-05-07T19:55:03.8292140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8294225Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.8294978Z ^ 2025-05-07T19:55:03.8295263Z 2025-05-07T19:55:03.8295702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.8296392Z 2025-05-07T19:55:03.8297982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8300119Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8300671Z ^ 2025-05-07T19:55:03.8300976Z 2025-05-07T19:55:03.8302470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8304429Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8304990Z ^ 2025-05-07T19:55:03.8305259Z 2025-05-07T19:55:03.8306793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8308620Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8309175Z ^ 2025-05-07T19:55:03.8309445Z 2025-05-07T19:55:03.8310976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8313013Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.8313767Z ^ 2025-05-07T19:55:03.8314066Z 2025-05-07T19:55:03.8314496Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.8315150Z 2025-05-07T19:55:03.8316659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8318949Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8319518Z ^ 2025-05-07T19:55:03.8319812Z 2025-05-07T19:55:03.8321330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8323212Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8323833Z ^ 2025-05-07T19:55:03.8324100Z 2025-05-07T19:55:03.8325883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8327768Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8328348Z ^ 2025-05-07T19:55:03.8328621Z 2025-05-07T19:55:03.8330114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8332188Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.8332947Z ^ 2025-05-07T19:55:03.8333232Z 2025-05-07T19:55:03.8333668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.8334325Z 2025-05-07T19:55:03.8335807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8337750Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8338296Z ^ 2025-05-07T19:55:03.8338582Z 2025-05-07T19:55:03.8340305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8342174Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8342728Z ^ 2025-05-07T19:55:03.8342997Z 2025-05-07T19:55:03.8344502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.8346345Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.8346898Z ^ 2025-05-07T19:55:03.8347167Z 2025-05-07T19:55:05.0869430Z [214/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:06.4519526Z [215/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:06.5993202Z [216/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:07.2804915Z [217/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:07.8835778Z [218/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:09.1811414Z [219/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:09.5478296Z [220/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:55:10.1087989Z [221/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:10.8183812Z [222/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:10.9869466Z [223/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:12.6834030Z [224/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:13.3815684Z [225/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:14.0311900Z [226/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:14.1521370Z [227/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:14.3168909Z [228/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:14.7760545Z [229/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:15.0362473Z [230/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T19:55:16.1157049Z [231/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:16.2878132Z [232/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:16.4290122Z [233/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:18.1744860Z [234/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:18.1768225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1770396Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.1771159Z ^ 2025-05-07T19:55:18.1771433Z 2025-05-07T19:55:18.1771865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.1772531Z 2025-05-07T19:55:18.1774031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1775921Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1776451Z ^ 2025-05-07T19:55:18.1776725Z 2025-05-07T19:55:18.1778232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1780233Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1780778Z ^ 2025-05-07T19:55:18.1781058Z 2025-05-07T19:55:18.1782840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1784308Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1784755Z ^ 2025-05-07T19:55:18.1784984Z 2025-05-07T19:55:18.1786073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1787624Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.1788252Z ^ 2025-05-07T19:55:18.1788492Z 2025-05-07T19:55:18.1788875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.1789409Z 2025-05-07T19:55:18.1790498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1792069Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1792575Z ^ 2025-05-07T19:55:18.1792854Z 2025-05-07T19:55:18.1794763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1796502Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1796944Z ^ 2025-05-07T19:55:18.1797161Z 2025-05-07T19:55:18.1798463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1800322Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1800857Z ^ 2025-05-07T19:55:18.1801087Z 2025-05-07T19:55:18.1802491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1804460Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.1805227Z ^ 2025-05-07T19:55:18.1805523Z 2025-05-07T19:55:18.1805977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.1806669Z 2025-05-07T19:55:18.1808189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1810178Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1810712Z ^ 2025-05-07T19:55:18.1811001Z 2025-05-07T19:55:18.1812600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1814529Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1815086Z ^ 2025-05-07T19:55:18.1815365Z 2025-05-07T19:55:18.1816902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1818375Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1818881Z ^ 2025-05-07T19:55:18.1819141Z 2025-05-07T19:55:18.1820671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1822751Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.1823496Z ^ 2025-05-07T19:55:18.1823776Z 2025-05-07T19:55:18.1824208Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.1824822Z 2025-05-07T19:55:18.1825973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1827647Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1828176Z ^ 2025-05-07T19:55:18.1828440Z 2025-05-07T19:55:18.1829938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1831791Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1832270Z ^ 2025-05-07T19:55:18.1832500Z 2025-05-07T19:55:18.1834038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1835966Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1836483Z ^ 2025-05-07T19:55:18.1836753Z 2025-05-07T19:55:18.1838212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1840443Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.1841211Z ^ 2025-05-07T19:55:18.1841497Z 2025-05-07T19:55:18.1841949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.1842634Z 2025-05-07T19:55:18.1844128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1846107Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1846690Z ^ 2025-05-07T19:55:18.1846982Z 2025-05-07T19:55:18.1848609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1850556Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1851124Z ^ 2025-05-07T19:55:18.1851403Z 2025-05-07T19:55:18.1852969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.1854871Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.1855403Z ^ 2025-05-07T19:55:18.1855668Z 2025-05-07T19:55:18.6994633Z [235/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:18.9621220Z [236/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:19.6764172Z [237/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:20.2569515Z [238/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:23.5598209Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:27.8992795Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:27.9018053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9020164Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:27.9020790Z ^ 2025-05-07T19:55:27.9021017Z 2025-05-07T19:55:27.9021338Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:27.9021855Z 2025-05-07T19:55:27.9023068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9024954Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9025415Z ^ 2025-05-07T19:55:27.9025651Z 2025-05-07T19:55:27.9026947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9028396Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9028828Z ^ 2025-05-07T19:55:27.9029044Z 2025-05-07T19:55:27.9030351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9032180Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9032693Z ^ 2025-05-07T19:55:27.9032934Z 2025-05-07T19:55:27.9034337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9036296Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:27.9036986Z ^ 2025-05-07T19:55:27.9037243Z 2025-05-07T19:55:27.9037644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:27.9038243Z 2025-05-07T19:55:27.9039583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9041283Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9041773Z ^ 2025-05-07T19:55:27.9042022Z 2025-05-07T19:55:27.9043444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9045344Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9045903Z ^ 2025-05-07T19:55:27.9046163Z 2025-05-07T19:55:27.9047732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9049735Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9050268Z ^ 2025-05-07T19:55:27.9050539Z 2025-05-07T19:55:27.9052102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9054180Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:27.9054954Z ^ 2025-05-07T19:55:27.9055242Z 2025-05-07T19:55:27.9055871Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:27.9056496Z 2025-05-07T19:55:27.9057979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9060141Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9060750Z ^ 2025-05-07T19:55:27.9061046Z 2025-05-07T19:55:27.9062569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9064705Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9065234Z ^ 2025-05-07T19:55:27.9065470Z 2025-05-07T19:55:27.9066857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9068858Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9069437Z ^ 2025-05-07T19:55:27.9069728Z 2025-05-07T19:55:27.9071348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9073600Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:27.9074387Z ^ 2025-05-07T19:55:27.9074712Z 2025-05-07T19:55:27.9075181Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:27.9075879Z 2025-05-07T19:55:27.9077570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9079621Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9080219Z ^ 2025-05-07T19:55:27.9080518Z 2025-05-07T19:55:27.9082390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9084425Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9085028Z ^ 2025-05-07T19:55:27.9085314Z 2025-05-07T19:55:27.9086920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9088969Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9089563Z ^ 2025-05-07T19:55:27.9089796Z 2025-05-07T19:55:27.9091162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9093384Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:27.9094179Z ^ 2025-05-07T19:55:27.9094499Z 2025-05-07T19:55:27.9094964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:27.9095681Z 2025-05-07T19:55:27.9097290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9099622Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9100589Z ^ 2025-05-07T19:55:27.9100886Z 2025-05-07T19:55:27.9102522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9104555Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9105159Z ^ 2025-05-07T19:55:27.9105445Z 2025-05-07T19:55:27.9106967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:27.9109326Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:27.9109907Z ^ 2025-05-07T19:55:27.9110178Z 2025-05-07T19:55:38.3725192Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:38.3748020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3749646Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3750154Z ^ 2025-05-07T19:55:38.3750424Z 2025-05-07T19:55:38.3751305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:38.3751954Z 2025-05-07T19:55:38.3753424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3755068Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3755524Z ^ 2025-05-07T19:55:38.3755761Z 2025-05-07T19:55:38.3757233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3759263Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3759764Z ^ 2025-05-07T19:55:38.3760025Z 2025-05-07T19:55:38.3761464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3763356Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3763890Z ^ 2025-05-07T19:55:38.3764195Z 2025-05-07T19:55:38.3764610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:38.3765231Z 2025-05-07T19:55:38.3766740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3768654Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3769170Z ^ 2025-05-07T19:55:38.3769446Z 2025-05-07T19:55:38.3770908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3772800Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3773313Z ^ 2025-05-07T19:55:38.3773589Z 2025-05-07T19:55:38.3775008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3776888Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3777417Z ^ 2025-05-07T19:55:38.3777711Z 2025-05-07T19:55:38.3778132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:38.3778768Z 2025-05-07T19:55:38.3780420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3782542Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3783088Z ^ 2025-05-07T19:55:38.3783344Z 2025-05-07T19:55:38.3784823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3786640Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3787220Z ^ 2025-05-07T19:55:38.3787505Z 2025-05-07T19:55:38.3788952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3790820Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3791711Z ^ 2025-05-07T19:55:38.3792047Z 2025-05-07T19:55:38.3792520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:38.3793241Z 2025-05-07T19:55:38.3794889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3797006Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3797615Z ^ 2025-05-07T19:55:38.3798097Z 2025-05-07T19:55:38.3799435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3801306Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3801948Z ^ 2025-05-07T19:55:38.3802285Z 2025-05-07T19:55:38.3803960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3806052Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3806680Z ^ 2025-05-07T19:55:38.3807002Z 2025-05-07T19:55:38.3807468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:38.3808191Z 2025-05-07T19:55:38.3809638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3811715Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3812322Z ^ 2025-05-07T19:55:38.3812667Z 2025-05-07T19:55:38.3814332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:38.3816183Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:38.3816824Z ^ 2025-05-07T19:55:38.3817148Z 2025-05-07T19:55:43.5894353Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:53.1440551Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:56:01.0672400Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:56:01.0697506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0699769Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0700749Z ^ 2025-05-07T19:56:01.0704348Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:01.0707843Z 2025-05-07T19:56:01.0708344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:01.0709096Z 2025-05-07T19:56:01.0710497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0712678Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0713649Z ^ 2025-05-07T19:56:01.0717365Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:01.0721212Z 2025-05-07T19:56:01.0722626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0724781Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0725767Z ^ 2025-05-07T19:56:01.0729466Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:01.0733155Z 2025-05-07T19:56:01.0734588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0736742Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0737734Z ^ 2025-05-07T19:56:01.0741361Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:01.0744530Z 2025-05-07T19:56:01.0745980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0748130Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0749119Z ^ 2025-05-07T19:56:01.0752817Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:01.0756408Z 2025-05-07T19:56:01.0757832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0759999Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0760973Z ^ 2025-05-07T19:56:01.0764009Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:01.0782162Z 2025-05-07T19:56:01.0783882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0785919Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0786869Z ^ 2025-05-07T19:56:01.0790355Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:01.0793691Z 2025-05-07T19:56:01.0795044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0797046Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0797971Z ^ 2025-05-07T19:56:01.0801463Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:01.0804565Z 2025-05-07T19:56:01.0805900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0807881Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0808818Z ^ 2025-05-07T19:56:01.0812221Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:01.0815378Z 2025-05-07T19:56:01.0816541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0818414Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0819504Z ^ 2025-05-07T19:56:01.0822993Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:01.0826301Z 2025-05-07T19:56:01.0827647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0829956Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0830915Z ^ 2025-05-07T19:56:01.0834359Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:01.0837860Z 2025-05-07T19:56:01.0839170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0841241Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0842142Z ^ 2025-05-07T19:56:01.0845633Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:01.0848877Z 2025-05-07T19:56:01.0850242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0852222Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0853065Z ^ 2025-05-07T19:56:01.0856265Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:01.0859572Z 2025-05-07T19:56:01.0860799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0862781Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0863715Z ^ 2025-05-07T19:56:01.0867051Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:01.0870264Z 2025-05-07T19:56:01.0871665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0873609Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0877609Z ^ 2025-05-07T19:56:01.0881121Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:01.0884384Z 2025-05-07T19:56:01.0886042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0887871Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0888824Z ^ 2025-05-07T19:56:01.0892519Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:01.0895998Z 2025-05-07T19:56:01.0897404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0899675Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0900639Z ^ 2025-05-07T19:56:01.0904328Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:01.0907652Z 2025-05-07T19:56:01.0909014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0910943Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0911755Z ^ 2025-05-07T19:56:01.0915079Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:01.0918297Z 2025-05-07T19:56:01.0919570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0921525Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0922401Z ^ 2025-05-07T19:56:01.0925840Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:01.0928943Z 2025-05-07T19:56:01.0930242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0932395Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0933323Z ^ 2025-05-07T19:56:01.0936889Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:01.0940500Z 2025-05-07T19:56:01.0941941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0944112Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0945100Z ^ 2025-05-07T19:56:01.0948970Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:01.0952425Z 2025-05-07T19:56:01.0953833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0955755Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0956689Z ^ 2025-05-07T19:56:01.0960265Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:01.0963570Z 2025-05-07T19:56:01.0964926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0966960Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0967906Z ^ 2025-05-07T19:56:01.0971697Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:01.0975014Z 2025-05-07T19:56:01.0976239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0978280Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0979535Z ^ 2025-05-07T19:56:01.0982641Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:01.0985450Z 2025-05-07T19:56:01.0986694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.0988556Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.0989463Z ^ 2025-05-07T19:56:01.0992694Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:01.0995675Z 2025-05-07T19:56:01.0996133Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:01.0996850Z 2025-05-07T19:56:01.0998263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1000406Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1001383Z ^ 2025-05-07T19:56:01.1005083Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:01.1008478Z 2025-05-07T19:56:01.1009891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1012033Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1013004Z ^ 2025-05-07T19:56:01.1016825Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:01.1020255Z 2025-05-07T19:56:01.1021607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1023708Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1024834Z ^ 2025-05-07T19:56:01.1028361Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:01.1031474Z 2025-05-07T19:56:01.1032666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1034634Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1035526Z ^ 2025-05-07T19:56:01.1038610Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:01.1041998Z 2025-05-07T19:56:01.1043395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1045555Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1046526Z ^ 2025-05-07T19:56:01.1050404Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:01.1053869Z 2025-05-07T19:56:01.1055259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1057217Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1058119Z ^ 2025-05-07T19:56:01.1062053Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:01.1065436Z 2025-05-07T19:56:01.1066799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1068926Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1069858Z ^ 2025-05-07T19:56:01.1073000Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:01.1075900Z 2025-05-07T19:56:01.1077172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1079103Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1080035Z ^ 2025-05-07T19:56:01.1083825Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:01.1087192Z 2025-05-07T19:56:01.1088599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1090687Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1091590Z ^ 2025-05-07T19:56:01.1094830Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:01.1098012Z 2025-05-07T19:56:01.1099524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1101656Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1102621Z ^ 2025-05-07T19:56:01.1106587Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:01.1109943Z 2025-05-07T19:56:01.1111320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1113466Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1114440Z ^ 2025-05-07T19:56:01.1118042Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:01.1121687Z 2025-05-07T19:56:01.1123052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1125152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1126095Z ^ 2025-05-07T19:56:01.1129716Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:01.1133022Z 2025-05-07T19:56:01.1134325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1136373Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1137260Z ^ 2025-05-07T19:56:01.1140683Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:01.1143830Z 2025-05-07T19:56:01.1144896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1146720Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1147563Z ^ 2025-05-07T19:56:01.1150977Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:01.1154269Z 2025-05-07T19:56:01.1155819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1158006Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1158905Z ^ 2025-05-07T19:56:01.1162444Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:01.1165821Z 2025-05-07T19:56:01.1167124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1169068Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1170024Z ^ 2025-05-07T19:56:01.1173501Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:01.1176722Z 2025-05-07T19:56:01.1178030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1180194Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1181080Z ^ 2025-05-07T19:56:01.1184497Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:01.1187881Z 2025-05-07T19:56:01.1189261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1191355Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1192312Z ^ 2025-05-07T19:56:01.1195947Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:01.1199284Z 2025-05-07T19:56:01.1201039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1203172Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1204124Z ^ 2025-05-07T19:56:01.1207702Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:01.1211328Z 2025-05-07T19:56:01.1212732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1214850Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1215786Z ^ 2025-05-07T19:56:01.1219534Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:01.1222949Z 2025-05-07T19:56:01.1224311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1226357Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1227265Z ^ 2025-05-07T19:56:01.1230901Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:01.1234122Z 2025-05-07T19:56:01.1235415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1237510Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1238440Z ^ 2025-05-07T19:56:01.1241913Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:01.1245026Z 2025-05-07T19:56:01.1246399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1248641Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1249538Z ^ 2025-05-07T19:56:01.1252938Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:01.1256597Z 2025-05-07T19:56:01.1257930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1260081Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1260961Z ^ 2025-05-07T19:56:01.1264365Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:01.1267709Z 2025-05-07T19:56:01.1268200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:01.1268780Z 2025-05-07T19:56:01.1269860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1271510Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1272238Z ^ 2025-05-07T19:56:01.1274973Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:01.1277652Z 2025-05-07T19:56:01.1278685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1280389Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1281173Z ^ 2025-05-07T19:56:01.1284273Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:01.1287359Z 2025-05-07T19:56:01.1288558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1290792Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1291635Z ^ 2025-05-07T19:56:01.1294827Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:01.1298013Z 2025-05-07T19:56:01.1299109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1300950Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1301713Z ^ 2025-05-07T19:56:01.1304636Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:01.1307479Z 2025-05-07T19:56:01.1308723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1310687Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1311580Z ^ 2025-05-07T19:56:01.1314898Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:01.1318042Z 2025-05-07T19:56:01.1319241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1321109Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1321967Z ^ 2025-05-07T19:56:01.1325371Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:01.1328444Z 2025-05-07T19:56:01.1329582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1331445Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1332578Z ^ 2025-05-07T19:56:01.1336229Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:01.1339568Z 2025-05-07T19:56:01.1340907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1342785Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1343516Z ^ 2025-05-07T19:56:01.1346465Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:01.1349797Z 2025-05-07T19:56:01.1351151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1353201Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1354340Z ^ 2025-05-07T19:56:01.1357939Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:01.1361257Z 2025-05-07T19:56:01.1362658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1364687Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1365624Z ^ 2025-05-07T19:56:01.1369209Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:01.1372479Z 2025-05-07T19:56:01.1373844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1375907Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1376839Z ^ 2025-05-07T19:56:01.1380381Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:01.1383905Z 2025-05-07T19:56:01.1385299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1387693Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1388700Z ^ 2025-05-07T19:56:01.1392394Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:01.1395698Z 2025-05-07T19:56:01.1397042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1399206Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1400152Z ^ 2025-05-07T19:56:01.1403676Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:01.1406873Z 2025-05-07T19:56:01.1408192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1410353Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1411257Z ^ 2025-05-07T19:56:01.1414705Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:01.1417945Z 2025-05-07T19:56:01.1419430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1421478Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1422419Z ^ 2025-05-07T19:56:01.1426377Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:01.1429598Z 2025-05-07T19:56:01.1430970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1434797Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1435726Z ^ 2025-05-07T19:56:01.1439175Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:01.1442634Z 2025-05-07T19:56:01.1444027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1446134Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1447090Z ^ 2025-05-07T19:56:01.1450706Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:01.1454060Z 2025-05-07T19:56:01.1455348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1457337Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1458213Z ^ 2025-05-07T19:56:01.1461936Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:01.1465357Z 2025-05-07T19:56:01.1466751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1468838Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1469776Z ^ 2025-05-07T19:56:01.1473515Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:01.1476834Z 2025-05-07T19:56:01.1478144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1480115Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1481247Z ^ 2025-05-07T19:56:01.1485147Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:01.1488622Z 2025-05-07T19:56:01.1489925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1491852Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1492782Z ^ 2025-05-07T19:56:01.1496368Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:01.1499777Z 2025-05-07T19:56:01.1501119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1503106Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1503995Z ^ 2025-05-07T19:56:01.1507588Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:01.1510824Z 2025-05-07T19:56:01.1512145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1514187Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1515092Z ^ 2025-05-07T19:56:01.1519092Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:01.1522396Z 2025-05-07T19:56:01.1523780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1525868Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1526803Z ^ 2025-05-07T19:56:01.1530667Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:01.1533813Z 2025-05-07T19:56:01.1534308Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:01.1534976Z 2025-05-07T19:56:01.1536282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1538378Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1539485Z ^ 2025-05-07T19:56:01.1542971Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:01.1546315Z 2025-05-07T19:56:01.1547643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1549786Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1550774Z ^ 2025-05-07T19:56:01.1554277Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:01.1557545Z 2025-05-07T19:56:01.1558856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1560877Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1561797Z ^ 2025-05-07T19:56:01.1565423Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:01.1568822Z 2025-05-07T19:56:01.1570153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1572241Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1573167Z ^ 2025-05-07T19:56:01.1576913Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:01.1580331Z 2025-05-07T19:56:01.1581637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1583928Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1584822Z ^ 2025-05-07T19:56:01.1588154Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:01.1591349Z 2025-05-07T19:56:01.1592676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1594665Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1595449Z ^ 2025-05-07T19:56:01.1598761Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:01.1601973Z 2025-05-07T19:56:01.1603371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1605470Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1606434Z ^ 2025-05-07T19:56:01.1610325Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:01.1613664Z 2025-05-07T19:56:01.1614983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1617062Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1618192Z ^ 2025-05-07T19:56:01.1621887Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:01.1625312Z 2025-05-07T19:56:01.1626724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1628646Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1629524Z ^ 2025-05-07T19:56:01.1633049Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:01.1636465Z 2025-05-07T19:56:01.1637825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1639946Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1640762Z ^ 2025-05-07T19:56:01.1644359Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:01.1647564Z 2025-05-07T19:56:01.1648965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1651049Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1651994Z ^ 2025-05-07T19:56:01.1655437Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:01.1658769Z 2025-05-07T19:56:01.1660381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1662414Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1663313Z ^ 2025-05-07T19:56:01.1666763Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:01.1670124Z 2025-05-07T19:56:01.1671428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1673403Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1674325Z ^ 2025-05-07T19:56:01.1677703Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:01.1680861Z 2025-05-07T19:56:01.1682310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1684335Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1685281Z ^ 2025-05-07T19:56:01.1688822Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:01.1692199Z 2025-05-07T19:56:01.1693598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1695674Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1696630Z ^ 2025-05-07T19:56:01.1700280Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:01.1703417Z 2025-05-07T19:56:01.1705062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1707100Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1707986Z ^ 2025-05-07T19:56:01.1711255Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:01.1714794Z 2025-05-07T19:56:01.1716305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1718321Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1719266Z ^ 2025-05-07T19:56:01.1722628Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:01.1725933Z 2025-05-07T19:56:01.1727307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1729442Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1730372Z ^ 2025-05-07T19:56:01.1733966Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:01.1737339Z 2025-05-07T19:56:01.1738673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1740942Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1741906Z ^ 2025-05-07T19:56:01.1745440Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:01.1748645Z 2025-05-07T19:56:01.1750043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1752393Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1753321Z ^ 2025-05-07T19:56:01.1756859Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:01.1760443Z 2025-05-07T19:56:01.1761761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1763845Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1764797Z ^ 2025-05-07T19:56:01.1768410Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:01.1771765Z 2025-05-07T19:56:01.1773125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1775231Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1776151Z ^ 2025-05-07T19:56:01.1780009Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:01.1783626Z 2025-05-07T19:56:01.1785033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1787053Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1787986Z ^ 2025-05-07T19:56:01.1791268Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:01.1794497Z 2025-05-07T19:56:01.1795787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1797701Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1798945Z ^ 2025-05-07T19:56:01.1802191Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:01.1805430Z 2025-05-07T19:56:01.1805880Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:01.1806533Z 2025-05-07T19:56:01.1807771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1809718Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1810555Z ^ 2025-05-07T19:56:01.1813977Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:01.1817355Z 2025-05-07T19:56:01.1818707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1820790Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1821648Z ^ 2025-05-07T19:56:01.1824969Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:01.1828054Z 2025-05-07T19:56:01.1829352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1831326Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1832242Z ^ 2025-05-07T19:56:01.1835656Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:01.1838934Z 2025-05-07T19:56:01.1840310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1842212Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1843315Z ^ 2025-05-07T19:56:01.1846734Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:01.1850046Z 2025-05-07T19:56:01.1851382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1853476Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1854471Z ^ 2025-05-07T19:56:01.1858109Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:01.1861602Z 2025-05-07T19:56:01.1862972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1865105Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1866010Z ^ 2025-05-07T19:56:01.1869478Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:01.1872832Z 2025-05-07T19:56:01.1874213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1876299Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1877184Z ^ 2025-05-07T19:56:01.1880741Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:01.1884355Z 2025-05-07T19:56:01.1885696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1887780Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1888641Z ^ 2025-05-07T19:56:01.1892477Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:01.1895921Z 2025-05-07T19:56:01.1897310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1899752Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1900692Z ^ 2025-05-07T19:56:01.1904382Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:01.1907839Z 2025-05-07T19:56:01.1909225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1911286Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1912228Z ^ 2025-05-07T19:56:01.1915628Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:01.1918700Z 2025-05-07T19:56:01.1919932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1922077Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1923047Z ^ 2025-05-07T19:56:01.1926782Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:01.1929836Z 2025-05-07T19:56:01.1931006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1933009Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1933901Z ^ 2025-05-07T19:56:01.1937692Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:01.1941106Z 2025-05-07T19:56:01.1942489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1944690Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1945635Z ^ 2025-05-07T19:56:01.1949271Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:01.1952528Z 2025-05-07T19:56:01.1953884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1955971Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1956937Z ^ 2025-05-07T19:56:01.1960287Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:01.1963689Z 2025-05-07T19:56:01.1965066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1967162Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1968136Z ^ 2025-05-07T19:56:01.1971736Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:01.1975129Z 2025-05-07T19:56:01.1976512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1978643Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1979761Z ^ 2025-05-07T19:56:01.1984031Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:01.1987402Z 2025-05-07T19:56:01.1988737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.1990783Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.1991918Z ^ 2025-05-07T19:56:01.1995359Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:01.1998760Z 2025-05-07T19:56:01.2000113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.2002198Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.2003131Z ^ 2025-05-07T19:56:01.2006753Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:01.2010219Z 2025-05-07T19:56:01.2011645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.2013719Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.2014684Z ^ 2025-05-07T19:56:01.2018247Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:01.2021794Z 2025-05-07T19:56:01.2023216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.2025257Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.2026205Z ^ 2025-05-07T19:56:01.2032586Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:01.2035971Z 2025-05-07T19:56:01.2037416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.2039574Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.2040574Z ^ 2025-05-07T19:56:01.2044536Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:01.2048042Z 2025-05-07T19:56:01.2049443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.2051627Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.2052596Z ^ 2025-05-07T19:56:01.2056371Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:01.2060055Z 2025-05-07T19:56:01.2061454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:01.2063622Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:01.2064610Z ^ 2025-05-07T19:56:01.2068376Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:01.2071895Z 2025-05-07T19:56:05.5253530Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:05.5278114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5280205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5280770Z ^ 2025-05-07T19:56:05.5281069Z 2025-05-07T19:56:05.5281536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:05.5282532Z 2025-05-07T19:56:05.5284096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5286061Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5286639Z ^ 2025-05-07T19:56:05.5286930Z 2025-05-07T19:56:05.5288409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5290387Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5290958Z ^ 2025-05-07T19:56:05.5291281Z 2025-05-07T19:56:05.5292806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5294665Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5295234Z ^ 2025-05-07T19:56:05.5295552Z 2025-05-07T19:56:05.5296001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:05.5296680Z 2025-05-07T19:56:05.5298285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5300466Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5301413Z ^ 2025-05-07T19:56:05.5301698Z 2025-05-07T19:56:05.5303304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5310288Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5310884Z ^ 2025-05-07T19:56:05.5311222Z 2025-05-07T19:56:05.5312842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5314961Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5315468Z ^ 2025-05-07T19:56:05.5315743Z 2025-05-07T19:56:05.5316225Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:05.5316928Z 2025-05-07T19:56:05.5318401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5320278Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5320879Z ^ 2025-05-07T19:56:05.5321180Z 2025-05-07T19:56:05.5322677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5324565Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5325045Z ^ 2025-05-07T19:56:05.5325362Z 2025-05-07T19:56:05.5326939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5328933Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5329471Z ^ 2025-05-07T19:56:05.5329774Z 2025-05-07T19:56:05.5330209Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:05.5330883Z 2025-05-07T19:56:05.5332477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5334523Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5335087Z ^ 2025-05-07T19:56:05.5335395Z 2025-05-07T19:56:05.5336937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5338849Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5339576Z ^ 2025-05-07T19:56:05.5339912Z 2025-05-07T19:56:05.5341481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5343473Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5343978Z ^ 2025-05-07T19:56:05.5344269Z 2025-05-07T19:56:05.5344700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:05.5345329Z 2025-05-07T19:56:05.5347218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5349290Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5349898Z ^ 2025-05-07T19:56:05.5350200Z 2025-05-07T19:56:05.5351821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:05.5354056Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:05.5354774Z ^ 2025-05-07T19:56:05.5355087Z 2025-05-07T19:56:07.8328293Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:11.1437905Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:56:25.8432683Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:25.8457508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8460277Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:25.8461032Z ^ 2025-05-07T19:56:25.8461331Z 2025-05-07T19:56:25.8461770Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:25.8462422Z 2025-05-07T19:56:25.8463952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8465975Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8466542Z ^ 2025-05-07T19:56:25.8466825Z 2025-05-07T19:56:25.8468448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8470323Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8470821Z ^ 2025-05-07T19:56:25.8471098Z 2025-05-07T19:56:25.8472545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8474598Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8475204Z ^ 2025-05-07T19:56:25.8475492Z 2025-05-07T19:56:25.8477293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8479568Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:25.8480377Z ^ 2025-05-07T19:56:25.8480678Z 2025-05-07T19:56:25.8481152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:25.8481882Z 2025-05-07T19:56:25.8483781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8485728Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8486279Z ^ 2025-05-07T19:56:25.8486572Z 2025-05-07T19:56:25.8488088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8490082Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8490625Z ^ 2025-05-07T19:56:25.8490899Z 2025-05-07T19:56:25.8492434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8494409Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8494989Z ^ 2025-05-07T19:56:25.8495255Z 2025-05-07T19:56:25.8497054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8499199Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:25.8500064Z ^ 2025-05-07T19:56:25.8500349Z 2025-05-07T19:56:25.8500959Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:25.8501674Z 2025-05-07T19:56:25.8503014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8505014Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8505531Z ^ 2025-05-07T19:56:25.8505827Z 2025-05-07T19:56:25.8507349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8509236Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8509757Z ^ 2025-05-07T19:56:25.8510021Z 2025-05-07T19:56:25.8511510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8513427Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8513982Z ^ 2025-05-07T19:56:25.8514239Z 2025-05-07T19:56:25.8515840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8517989Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:25.8518743Z ^ 2025-05-07T19:56:25.8519046Z 2025-05-07T19:56:25.8519512Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:25.8520174Z 2025-05-07T19:56:25.8521628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8523534Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8524082Z ^ 2025-05-07T19:56:25.8524384Z 2025-05-07T19:56:25.8525959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8527990Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8528573Z ^ 2025-05-07T19:56:25.8528851Z 2025-05-07T19:56:25.8530371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8532268Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8532814Z ^ 2025-05-07T19:56:25.8533087Z 2025-05-07T19:56:25.8534534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8536569Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:25.8537338Z ^ 2025-05-07T19:56:25.8537619Z 2025-05-07T19:56:25.8538271Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:25.8538995Z 2025-05-07T19:56:25.8540765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8542880Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8543558Z ^ 2025-05-07T19:56:25.8543864Z 2025-05-07T19:56:25.8545533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8547683Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8548289Z ^ 2025-05-07T19:56:25.8548580Z 2025-05-07T19:56:25.8550263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:25.8552340Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:25.8552928Z ^ 2025-05-07T19:56:25.8553216Z 2025-05-07T19:56:26.3471562Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:26.3498508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3500963Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.3501760Z ^ 2025-05-07T19:56:26.3502032Z 2025-05-07T19:56:26.3502454Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.3503293Z 2025-05-07T19:56:26.3504760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3506810Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3507335Z ^ 2025-05-07T19:56:26.3507594Z 2025-05-07T19:56:26.3509067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3510913Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3511444Z ^ 2025-05-07T19:56:26.3511709Z 2025-05-07T19:56:26.3513313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3515172Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3515730Z ^ 2025-05-07T19:56:26.3515994Z 2025-05-07T19:56:26.3517440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3519473Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.3520196Z ^ 2025-05-07T19:56:26.3520459Z 2025-05-07T19:56:26.3520870Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.3521510Z 2025-05-07T19:56:26.3522987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3524871Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3525444Z ^ 2025-05-07T19:56:26.3525725Z 2025-05-07T19:56:26.3527152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3529035Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3529664Z ^ 2025-05-07T19:56:26.3529914Z 2025-05-07T19:56:26.3531437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3533308Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3533867Z ^ 2025-05-07T19:56:26.3534175Z 2025-05-07T19:56:26.3535587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3537583Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.3538316Z ^ 2025-05-07T19:56:26.3538633Z 2025-05-07T19:56:26.3539041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.3540079Z 2025-05-07T19:56:26.3541522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3543446Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3544145Z ^ 2025-05-07T19:56:26.3544412Z 2025-05-07T19:56:26.3545866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3547799Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3548336Z ^ 2025-05-07T19:56:26.3548622Z 2025-05-07T19:56:26.3550113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3551961Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3552489Z ^ 2025-05-07T19:56:26.3552739Z 2025-05-07T19:56:26.3554150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3556173Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.3556895Z ^ 2025-05-07T19:56:26.3557181Z 2025-05-07T19:56:26.3557629Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.3558263Z 2025-05-07T19:56:26.3559778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3561676Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3562190Z ^ 2025-05-07T19:56:26.3562441Z 2025-05-07T19:56:26.3563961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3565765Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3566355Z ^ 2025-05-07T19:56:26.3566610Z 2025-05-07T19:56:26.3568069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3569900Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3570479Z ^ 2025-05-07T19:56:26.3570769Z 2025-05-07T19:56:26.3572195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3574240Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.3575025Z ^ 2025-05-07T19:56:26.3575269Z 2025-05-07T19:56:26.3575674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.3576290Z 2025-05-07T19:56:26.3577775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3579757Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3580298Z ^ 2025-05-07T19:56:26.3580562Z 2025-05-07T19:56:26.3582613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3584496Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3585024Z ^ 2025-05-07T19:56:26.3585390Z 2025-05-07T19:56:26.3586869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.3588818Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.3589343Z ^ 2025-05-07T19:56:26.3589603Z 2025-05-07T19:56:28.0172486Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:28.0193907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0195831Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:28.0196523Z ^ 2025-05-07T19:56:28.0196784Z 2025-05-07T19:56:28.0197179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:28.0197789Z 2025-05-07T19:56:28.0199536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0201465Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0201989Z ^ 2025-05-07T19:56:28.0202390Z 2025-05-07T19:56:28.0203896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0205963Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0206461Z ^ 2025-05-07T19:56:28.0206711Z 2025-05-07T19:56:28.0208262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0209945Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0210455Z ^ 2025-05-07T19:56:28.0210701Z 2025-05-07T19:56:28.0212178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0214068Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:28.0214735Z ^ 2025-05-07T19:56:28.0214994Z 2025-05-07T19:56:28.0215421Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:28.0216013Z 2025-05-07T19:56:28.0217366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0219114Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0219789Z ^ 2025-05-07T19:56:28.0220044Z 2025-05-07T19:56:28.0221330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0223046Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0223538Z ^ 2025-05-07T19:56:28.0223811Z 2025-05-07T19:56:28.0225155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0226858Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0227351Z ^ 2025-05-07T19:56:28.0227624Z 2025-05-07T19:56:28.0228965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0230854Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:28.0231511Z ^ 2025-05-07T19:56:28.0231780Z 2025-05-07T19:56:28.0232230Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:28.0232878Z 2025-05-07T19:56:28.0234402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0236316Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0236884Z ^ 2025-05-07T19:56:28.0237156Z 2025-05-07T19:56:28.0238887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0240789Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0241326Z ^ 2025-05-07T19:56:28.0241608Z 2025-05-07T19:56:28.0243201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0245184Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0245679Z ^ 2025-05-07T19:56:28.0245913Z 2025-05-07T19:56:28.0247440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0249284Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:28.0249969Z ^ 2025-05-07T19:56:28.0250211Z 2025-05-07T19:56:28.0250626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:28.0251208Z 2025-05-07T19:56:28.0252551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0254300Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0254787Z ^ 2025-05-07T19:56:28.0255063Z 2025-05-07T19:56:28.0256429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0258172Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0258651Z ^ 2025-05-07T19:56:28.0258921Z 2025-05-07T19:56:28.0260414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0262157Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0262641Z ^ 2025-05-07T19:56:28.0262887Z 2025-05-07T19:56:28.0264259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0266131Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:28.0266818Z ^ 2025-05-07T19:56:28.0267078Z 2025-05-07T19:56:28.0267513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:28.0268095Z 2025-05-07T19:56:28.0269442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0271196Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0271731Z ^ 2025-05-07T19:56:28.0272027Z 2025-05-07T19:56:28.0273549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0293339Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0293992Z ^ 2025-05-07T19:56:28.0294333Z 2025-05-07T19:56:28.0296080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:28.0297984Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:28.0298515Z ^ 2025-05-07T19:56:28.0298786Z 2025-05-07T19:56:32.9893011Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:32.9918761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9920937Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.9921695Z ^ 2025-05-07T19:56:32.9922003Z 2025-05-07T19:56:32.9922484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.9923161Z 2025-05-07T19:56:32.9924730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9926733Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9927294Z ^ 2025-05-07T19:56:32.9927616Z 2025-05-07T19:56:32.9929633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9931621Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9932156Z ^ 2025-05-07T19:56:32.9932452Z 2025-05-07T19:56:32.9934011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9936164Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9936705Z ^ 2025-05-07T19:56:32.9936984Z 2025-05-07T19:56:32.9938505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9940812Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.9941479Z ^ 2025-05-07T19:56:32.9941721Z 2025-05-07T19:56:32.9942190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.9942868Z 2025-05-07T19:56:32.9944515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9946549Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9947118Z ^ 2025-05-07T19:56:32.9947440Z 2025-05-07T19:56:32.9949054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9951140Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9951711Z ^ 2025-05-07T19:56:32.9952024Z 2025-05-07T19:56:32.9953644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9955716Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9956273Z ^ 2025-05-07T19:56:32.9956561Z 2025-05-07T19:56:32.9958195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9960378Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.9961085Z ^ 2025-05-07T19:56:32.9961376Z 2025-05-07T19:56:32.9961855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.9962504Z 2025-05-07T19:56:32.9964067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9965994Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9966563Z ^ 2025-05-07T19:56:32.9966839Z 2025-05-07T19:56:32.9968276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9970288Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9970875Z ^ 2025-05-07T19:56:32.9971156Z 2025-05-07T19:56:32.9973036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9975071Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9975614Z ^ 2025-05-07T19:56:32.9975921Z 2025-05-07T19:56:32.9977481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9979897Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.9980721Z ^ 2025-05-07T19:56:32.9981034Z 2025-05-07T19:56:32.9981480Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.9982381Z 2025-05-07T19:56:32.9984060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9986122Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9986714Z ^ 2025-05-07T19:56:32.9987012Z 2025-05-07T19:56:32.9988563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9990601Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9991171Z ^ 2025-05-07T19:56:32.9991448Z 2025-05-07T19:56:32.9993023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9995075Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.9995655Z ^ 2025-05-07T19:56:32.9995968Z 2025-05-07T19:56:32.9997585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.9999828Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:33.0000597Z ^ 2025-05-07T19:56:33.0000885Z 2025-05-07T19:56:33.0001344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:33.0002032Z 2025-05-07T19:56:33.0003626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.0005536Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:33.0006153Z ^ 2025-05-07T19:56:33.0006453Z 2025-05-07T19:56:33.0008085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.0010150Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:33.0010746Z ^ 2025-05-07T19:56:33.0011025Z 2025-05-07T19:56:33.0012586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.0014715Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:33.0015287Z ^ 2025-05-07T19:56:33.0015603Z 2025-05-07T19:57:05.9698764Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:09.8875185Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:57:10.1507550Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:12.1499714Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:57:14.3153038Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:16.6860246Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:16.7353715Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:17.8573268Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:20.2142991Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:20.2919827Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:29.3845835Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:29.8707961Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:33.0438671Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:33.6189744Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:41.7793934Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:41.7817151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:41.7818958Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:41.7819610Z ^ 2025-05-07T19:57:41.7819896Z 2025-05-07T19:57:41.7820330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.7821011Z 2025-05-07T19:57:41.7822351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:41.7823973Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:41.7824499Z ^ 2025-05-07T19:57:41.7824748Z 2025-05-07T19:57:41.7825191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.7825815Z 2025-05-07T19:57:41.7827003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:41.7828799Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:41.7829305Z ^ 2025-05-07T19:57:41.7829595Z 2025-05-07T19:57:41.7830064Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.7831056Z 2025-05-07T19:57:41.7832461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:41.7834215Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:41.7834757Z ^ 2025-05-07T19:57:41.7835074Z 2025-05-07T19:57:41.7835489Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.7836207Z 2025-05-07T19:57:47.8784371Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:51.0521669Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:51.4480416Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:51.7350118Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:54.8724614Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:54.8751090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8752965Z int error_code = 0; 2025-05-07T19:57:54.8753390Z ^ 2025-05-07T19:57:54.8753626Z 2025-05-07T19:57:54.8754068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:54.8754756Z 2025-05-07T19:57:54.8756204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8758010Z int64_t error_value; 2025-05-07T19:57:54.8758496Z ^ 2025-05-07T19:57:54.8758723Z 2025-05-07T19:57:54.8760132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8761957Z int error_code = 0; 2025-05-07T19:57:54.8762410Z ^ 2025-05-07T19:57:54.8762631Z 2025-05-07T19:57:54.8764065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8765894Z int64_t error_value; 2025-05-07T19:57:54.8766321Z ^ 2025-05-07T19:57:54.8766576Z 2025-05-07T19:57:54.8768105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8769917Z int error_code = 0; 2025-05-07T19:57:54.8770386Z ^ 2025-05-07T19:57:54.8770593Z 2025-05-07T19:57:54.8772025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8773818Z int64_t error_value; 2025-05-07T19:57:54.8774294Z ^ 2025-05-07T19:57:54.8774535Z 2025-05-07T19:57:54.8775910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8777724Z int error_code = 0; 2025-05-07T19:57:54.8778175Z ^ 2025-05-07T19:57:54.8778629Z 2025-05-07T19:57:54.8780180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8782197Z int64_t error_value; 2025-05-07T19:57:54.8782628Z ^ 2025-05-07T19:57:54.8783040Z 2025-05-07T19:57:54.8784420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8786342Z int error_code = 0; 2025-05-07T19:57:54.8786781Z ^ 2025-05-07T19:57:54.8786990Z 2025-05-07T19:57:54.8787467Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:54.8788117Z 2025-05-07T19:57:54.8789550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8791369Z int64_t error_value; 2025-05-07T19:57:54.8791837Z ^ 2025-05-07T19:57:54.8792066Z 2025-05-07T19:57:54.8793463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8795275Z int error_code = 0; 2025-05-07T19:57:54.8795705Z ^ 2025-05-07T19:57:54.8795937Z 2025-05-07T19:57:54.8797356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8799153Z int64_t error_value; 2025-05-07T19:57:54.8799589Z ^ 2025-05-07T19:57:54.8799826Z 2025-05-07T19:57:54.8801226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8803003Z int error_code = 0; 2025-05-07T19:57:54.8803461Z ^ 2025-05-07T19:57:54.8803655Z 2025-05-07T19:57:54.8805099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8806899Z int64_t error_value; 2025-05-07T19:57:54.8807355Z ^ 2025-05-07T19:57:54.8807588Z 2025-05-07T19:57:54.8809000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8810834Z int error_code = 0; 2025-05-07T19:57:54.8811288Z ^ 2025-05-07T19:57:54.8811502Z 2025-05-07T19:57:54.8812938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8814766Z int64_t error_value; 2025-05-07T19:57:54.8815208Z ^ 2025-05-07T19:57:54.8815470Z 2025-05-07T19:57:54.8816871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8818676Z int error_code = 0; 2025-05-07T19:57:54.8819128Z ^ 2025-05-07T19:57:54.8819329Z 2025-05-07T19:57:54.8819932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:54.8820608Z 2025-05-07T19:57:54.8822336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8824205Z int64_t error_value; 2025-05-07T19:57:54.8824676Z ^ 2025-05-07T19:57:54.8824898Z 2025-05-07T19:57:54.8826325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8828258Z int error_code = 0; 2025-05-07T19:57:54.8828681Z ^ 2025-05-07T19:57:54.8828912Z 2025-05-07T19:57:54.8830326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8832236Z int64_t error_value; 2025-05-07T19:57:54.8832671Z ^ 2025-05-07T19:57:54.8832895Z 2025-05-07T19:57:54.8834325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8836123Z int error_code = 0; 2025-05-07T19:57:54.8836568Z ^ 2025-05-07T19:57:54.8836767Z 2025-05-07T19:57:54.8838177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8839974Z int64_t error_value; 2025-05-07T19:57:54.8840408Z ^ 2025-05-07T19:57:54.8840638Z 2025-05-07T19:57:54.8842033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8843818Z int error_code = 0; 2025-05-07T19:57:54.8844246Z ^ 2025-05-07T19:57:54.8844473Z 2025-05-07T19:57:54.8845883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8847689Z int64_t error_value; 2025-05-07T19:57:54.8848097Z ^ 2025-05-07T19:57:54.8848324Z 2025-05-07T19:57:54.8849732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8851508Z int error_code = 0; 2025-05-07T19:57:54.8851929Z ^ 2025-05-07T19:57:54.8852129Z 2025-05-07T19:57:54.8852584Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:54.8853268Z 2025-05-07T19:57:54.8854676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8856466Z int64_t error_value; 2025-05-07T19:57:54.8856888Z ^ 2025-05-07T19:57:54.8857129Z 2025-05-07T19:57:54.8858529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8860457Z int error_code = 0; 2025-05-07T19:57:54.8860864Z ^ 2025-05-07T19:57:54.8861078Z 2025-05-07T19:57:54.8862473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8864270Z int64_t error_value; 2025-05-07T19:57:54.8864726Z ^ 2025-05-07T19:57:54.8864947Z 2025-05-07T19:57:54.8866593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8868354Z int error_code = 0; 2025-05-07T19:57:54.8868800Z ^ 2025-05-07T19:57:54.8869011Z 2025-05-07T19:57:54.8870396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8872296Z int64_t error_value; 2025-05-07T19:57:54.8872730Z ^ 2025-05-07T19:57:54.8872947Z 2025-05-07T19:57:54.8874381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:54.8876139Z int error_code = 0; 2025-05-07T19:57:54.8876567Z ^ 2025-05-07T19:57:54.8876793Z 2025-05-07T19:57:54.8878208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:54.8879979Z int64_t error_value; 2025-05-07T19:57:54.8880424Z ^ 2025-05-07T19:57:54.8880658Z 2025-05-07T19:57:57.9689347Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:59.8003730Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:58:02.9041515Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:58:03.7214194Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:58:07.3978861Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:08.6734614Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:15.0996931Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:15.2460824Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:18.3382550Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:19.1853260Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:21.5166318Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:21.5191627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.5193607Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.5194213Z ^ 2025-05-07T19:58:21.5194490Z 2025-05-07T19:58:21.5194972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.5195665Z 2025-05-07T19:58:21.5197082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.5198973Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.5199580Z ^ 2025-05-07T19:58:21.5199844Z 2025-05-07T19:58:21.5200303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.5201017Z 2025-05-07T19:58:21.5202455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.5204359Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.5204861Z ^ 2025-05-07T19:58:21.5205134Z 2025-05-07T19:58:21.5205604Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.5206285Z 2025-05-07T19:58:21.5207643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.5209438Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.5210035Z ^ 2025-05-07T19:58:21.5210305Z 2025-05-07T19:58:21.5210760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.5211482Z 2025-05-07T19:58:26.3051351Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:33.6994504Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:36.4009125Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:39.7780721Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:40.3717946Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:43.7835120Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:45.0833913Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:47.3296468Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:49.0751345Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:58:49.0778049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0780539Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0781195Z ^ 2025-05-07T19:58:49.0781566Z 2025-05-07T19:58:49.0782264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.0783027Z 2025-05-07T19:58:49.0784587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0786833Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0787492Z ^ 2025-05-07T19:58:49.0787829Z 2025-05-07T19:58:49.0789929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0792170Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0792833Z ^ 2025-05-07T19:58:49.0793166Z 2025-05-07T19:58:49.0794730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0797094Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0797849Z ^ 2025-05-07T19:58:49.0798218Z 2025-05-07T19:58:49.0798707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.0799442Z 2025-05-07T19:58:49.0801236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0803353Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0803981Z ^ 2025-05-07T19:58:49.0804317Z 2025-05-07T19:58:49.0806075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0808308Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0808980Z ^ 2025-05-07T19:58:49.0809312Z 2025-05-07T19:58:49.0811052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0813158Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0813845Z ^ 2025-05-07T19:58:49.0814183Z 2025-05-07T19:58:49.0814669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.0815431Z 2025-05-07T19:58:49.0817178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0819443Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0820237Z ^ 2025-05-07T19:58:49.0820589Z 2025-05-07T19:58:49.0822343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0824420Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0825084Z ^ 2025-05-07T19:58:49.0825431Z 2025-05-07T19:58:49.0827198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0829407Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0830076Z ^ 2025-05-07T19:58:49.0830413Z 2025-05-07T19:58:49.0830900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.0831546Z 2025-05-07T19:58:49.0833285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0835516Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0836140Z ^ 2025-05-07T19:58:49.0836502Z 2025-05-07T19:58:49.0838492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0840506Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0841168Z ^ 2025-05-07T19:58:49.0841610Z 2025-05-07T19:58:49.0843250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0845589Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0846250Z ^ 2025-05-07T19:58:49.0846582Z 2025-05-07T19:58:49.0847070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.0847826Z 2025-05-07T19:58:49.0849589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0851669Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0852295Z ^ 2025-05-07T19:58:49.0852663Z 2025-05-07T19:58:49.0854379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:49.0856610Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:49.0857233Z ^ 2025-05-07T19:58:49.0857564Z 2025-05-07T19:58:51.8713509Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:52.0506940Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:52.3378935Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:55.0870750Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:56.4441346Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:58.3206885Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:58.6796022Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:58.8277313Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:03.3523604Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:05.1928882Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:09.7318328Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:10.2066915Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:59:11.9504510Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:14.1634392Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:21.2512443Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:22.1530020Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:23.2146084Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:24.2726249Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:27.2076891Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:27.7726397Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:28.6660022Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:29.9076281Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:31.4554183Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:36.9650872Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:37.9437218Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:38.2161450Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:39.8093898Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:42.3709440Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:49.1856834Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:49.1881781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1884750Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:49.1885557Z ^ 2025-05-07T19:59:49.1886035Z 2025-05-07T19:59:49.1886503Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.1887045Z 2025-05-07T19:59:49.1888598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1890630Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1891215Z ^ 2025-05-07T19:59:49.1891507Z 2025-05-07T19:59:49.1893140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1895212Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1895796Z ^ 2025-05-07T19:59:49.1896081Z 2025-05-07T19:59:49.1897707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1899939Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1900489Z ^ 2025-05-07T19:59:49.1900800Z 2025-05-07T19:59:49.1902483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1904790Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:49.1905569Z ^ 2025-05-07T19:59:49.1905893Z 2025-05-07T19:59:49.1906359Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.1907069Z 2025-05-07T19:59:49.1908691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1910828Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1911440Z ^ 2025-05-07T19:59:49.1911750Z 2025-05-07T19:59:49.1913427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1915408Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1915974Z ^ 2025-05-07T19:59:49.1916294Z 2025-05-07T19:59:49.1917935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1919977Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1920549Z ^ 2025-05-07T19:59:49.1920865Z 2025-05-07T19:59:49.1922486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1925175Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:49.1925976Z ^ 2025-05-07T19:59:49.1926288Z 2025-05-07T19:59:49.1926779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.1927487Z 2025-05-07T19:59:49.1929276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1931328Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1931926Z ^ 2025-05-07T19:59:49.1932219Z 2025-05-07T19:59:49.1933898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1936055Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1936635Z ^ 2025-05-07T19:59:49.1936947Z 2025-05-07T19:59:49.1938630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1940896Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1941464Z ^ 2025-05-07T19:59:49.1941779Z 2025-05-07T19:59:49.1943462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1945786Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:49.1946586Z ^ 2025-05-07T19:59:49.1946874Z 2025-05-07T19:59:49.1947387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.1948093Z 2025-05-07T19:59:49.1949774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1951863Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1952472Z ^ 2025-05-07T19:59:49.1952768Z 2025-05-07T19:59:49.1954449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1956571Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1957133Z ^ 2025-05-07T19:59:49.1957444Z 2025-05-07T19:59:49.1959120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1961209Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1961773Z ^ 2025-05-07T19:59:49.1962076Z 2025-05-07T19:59:49.1963745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1966058Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:49.1966839Z ^ 2025-05-07T19:59:49.1967135Z 2025-05-07T19:59:49.1967634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.1968339Z 2025-05-07T19:59:49.1970201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1972294Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1972897Z ^ 2025-05-07T19:59:49.1973203Z 2025-05-07T19:59:49.1974851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1977068Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1977642Z ^ 2025-05-07T19:59:49.1977953Z 2025-05-07T19:59:49.1979717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:49.1981817Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:49.1982590Z ^ 2025-05-07T19:59:49.1982878Z 2025-05-07T19:59:51.0458337Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:52.4524758Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:52.4537832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4538994Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.4539623Z ^ 2025-05-07T19:59:52.4539794Z 2025-05-07T19:59:52.4540070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.4540439Z 2025-05-07T19:59:52.4541267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4542333Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4542651Z ^ 2025-05-07T19:59:52.4542832Z 2025-05-07T19:59:52.4543647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4544701Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4545014Z ^ 2025-05-07T19:59:52.4545197Z 2025-05-07T19:59:52.4546014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4547074Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4547506Z ^ 2025-05-07T19:59:52.4547667Z 2025-05-07T19:59:52.4548502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4549637Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.4550132Z ^ 2025-05-07T19:59:52.4550301Z 2025-05-07T19:59:52.4550582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.4550999Z 2025-05-07T19:59:52.4551820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4552885Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4553216Z ^ 2025-05-07T19:59:52.4553403Z 2025-05-07T19:59:52.4554212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4555268Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4555585Z ^ 2025-05-07T19:59:52.4555767Z 2025-05-07T19:59:52.4556578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4557612Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4557953Z ^ 2025-05-07T19:59:52.4558111Z 2025-05-07T19:59:52.4558952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4560089Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.4560539Z ^ 2025-05-07T19:59:52.4560700Z 2025-05-07T19:59:52.4560947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.4561336Z 2025-05-07T19:59:52.4562153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4563213Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4563526Z ^ 2025-05-07T19:59:52.4563707Z 2025-05-07T19:59:52.4564520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4565582Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4565889Z ^ 2025-05-07T19:59:52.4566045Z 2025-05-07T19:59:52.4566880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4567914Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4568249Z ^ 2025-05-07T19:59:52.4568404Z 2025-05-07T19:59:52.4569238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4570428Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.4570872Z ^ 2025-05-07T19:59:52.4571033Z 2025-05-07T19:59:52.4571280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.4571670Z 2025-05-07T19:59:52.4572486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4573587Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4573903Z ^ 2025-05-07T19:59:52.4574122Z 2025-05-07T19:59:52.4574938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4575996Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4576303Z ^ 2025-05-07T19:59:52.4576465Z 2025-05-07T19:59:52.4577302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4578331Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4578659Z ^ 2025-05-07T19:59:52.4578816Z 2025-05-07T19:59:52.4579808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4580941Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.4581378Z ^ 2025-05-07T19:59:52.4581538Z 2025-05-07T19:59:52.4581785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.4582432Z 2025-05-07T19:59:52.4583249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4584312Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4584632Z ^ 2025-05-07T19:59:52.4584821Z 2025-05-07T19:59:52.4585633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4586694Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4587010Z ^ 2025-05-07T19:59:52.4587166Z 2025-05-07T19:59:52.4588010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.4589043Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.4589380Z ^ 2025-05-07T19:59:52.4589536Z 2025-05-07T19:59:53.6223036Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:00:00.5567069Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:07.0810420Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:43.7132849Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:52.6241749Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:53.7794208Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:55.1778532Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:56.5907526Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:58.3543912Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:00:58.8589799Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:00.5168147Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:01:00.6815900Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:09.2475217Z [335/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:01:11.0257523Z [336/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:01:11.0897672Z [337/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:11.5888971Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:12.2473586Z [339/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:01:15.0091121Z [340/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:19.6988128Z [341/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:21.0405819Z [342/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:21.0963711Z [343/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:21.2648956Z [344/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:21.7718120Z [345/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:22.9360345Z [346/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:24.6363890Z [347/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:27.4413769Z [348/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:29.4017433Z [349/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:29.4205190Z [350/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:31.1256192Z [351/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:31.1849173Z [352/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:34.3749082Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:37.2890461Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:37.5380550Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:45.3696185Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:45.9594187Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:46.0111860Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:46.7090569Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:53.2524146Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:02:00.4830493Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:02:02.0920614Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:02:07.2823422Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:07.5442419Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:09.8208102Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:13.4891264Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:15.8083804Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:02:21.4964495Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:22.8201980Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:30.9032013Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:32.6834616Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:34.1032547Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:48.7134216Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:52.1351068Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:56.8382155Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:03:01.2470303Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:03:01.3479266Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:08.8103445Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:03:08.9332072Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:03:21.9223182Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:24.6513327Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:26.5018962Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:03:28.7165672Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:29.1867921Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:31.5794545Z [385/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:34.1715325Z [386/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:34.3019248Z [387/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:36.0222096Z [388/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:43.8582367Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:49.8305854Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:49.8683124Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:52.6121537Z [392/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:55.0135512Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:56.1727102Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:56.8826719Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:57.1974592Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:58.4307347Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:59.9034461Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:00.6321892Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:02.9515676Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:03.7645494Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:04.4607716Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:06.3793807Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:12.5117685Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:17.5665219Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:18.3608607Z [406/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:04:20.5419371Z [407/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:20.8907776Z [408/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:21.7060554Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:22.4611184Z [410/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:22.5912281Z [411/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:04:23.1514836Z [412/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:04:23.3810607Z [413/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:04:24.4234430Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:24.5945424Z [415/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:25.0170429Z [416/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:04:25.2612543Z [417/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:26.7508313Z [418/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:27.0839918Z [419/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:27.1003406Z [420/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:27.5804635Z [421/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:27.7726253Z [422/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:28.4049526Z [423/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:28.7616164Z [424/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:29.8957784Z [425/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:30.0650086Z [426/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:04:30.9699749Z [427/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:31.9197790Z [428/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:32.0288055Z [429/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:32.1914497Z [430/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:32.5466812Z [431/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:33.1387857Z [432/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:33.4064757Z [433/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:33.8528279Z [434/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:33.9756344Z [435/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:34.6987727Z [436/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:34.7697834Z [437/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:34.9965872Z [438/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:35.0625352Z [439/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:35.1380090Z [440/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:35.4379157Z [441/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:36.2706841Z [442/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:36.4041345Z [443/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:36.5740286Z [444/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:36.8296400Z [445/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:37.4336968Z [446/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:37.5189637Z [447/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:37.9472125Z [448/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:38.2955795Z [449/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:38.4977008Z [450/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:39.4698951Z [451/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:39.7619806Z [452/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:40.5683012Z [453/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:41.0557359Z [454/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:41.1185341Z [455/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:41.7118562Z [456/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:41.8840760Z [457/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:44.2007338Z [458/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:46.0660319Z [459/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:47.0378055Z [460/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:47.4181173Z [461/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:04:47.4203736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.4205561Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.4206127Z ^ 2025-05-07T20:04:47.4206398Z 2025-05-07T20:04:47.4206833Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.4207496Z 2025-05-07T20:04:47.4208829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.4210497Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.4211054Z ^ 2025-05-07T20:04:47.4211306Z 2025-05-07T20:04:47.4212187Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.4212865Z 2025-05-07T20:04:47.4214167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.4219788Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.4220332Z ^ 2025-05-07T20:04:47.4220598Z 2025-05-07T20:04:47.4221011Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.4221842Z 2025-05-07T20:04:47.4223130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.4224868Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.4225437Z ^ 2025-05-07T20:04:47.4225719Z 2025-05-07T20:04:47.4226155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.4226814Z 2025-05-07T20:04:48.3954416Z [462/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:49.5816905Z [463/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:49.8204510Z [464/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:50.2311885Z [465/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:50.9125160Z [466/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:50.9278032Z [467/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:52.0371587Z [468/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:52.6953361Z [469/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:53.3420695Z [470/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:53.4236908Z [471/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:53.9763676Z [472/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:54.1418729Z [473/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:54.6980675Z [474/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:55.2711329Z [475/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:55.8193489Z [476/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:55.9477710Z [477/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:56.0579600Z [478/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:56.2469180Z [479/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:56.8764997Z [480/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:59.0343468Z [481/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:04:59.5720257Z [482/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:05:01.9542445Z [483/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:01.9564818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.9566617Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.9567184Z ^ 2025-05-07T20:05:01.9567474Z 2025-05-07T20:05:01.9567913Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.9568581Z 2025-05-07T20:05:01.9569893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.9571613Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.9572148Z ^ 2025-05-07T20:05:01.9572388Z 2025-05-07T20:05:01.9572847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.9573493Z 2025-05-07T20:05:01.9574734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.9576436Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.9576971Z ^ 2025-05-07T20:05:01.9577213Z 2025-05-07T20:05:01.9577668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.9578294Z 2025-05-07T20:05:01.9579699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.9581449Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.9582266Z ^ 2025-05-07T20:05:01.9582503Z 2025-05-07T20:05:01.9583279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.9583951Z 2025-05-07T20:05:03.4558602Z [484/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:05:03.8570400Z [485/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:05:04.3065770Z [486/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:05:05.1480292Z [487/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:05:05.6099592Z [488/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:05:07.4737978Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:05:07.5965158Z [490/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:08.3151564Z [491/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:05:10.0260458Z [492/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:05:10.7217573Z [493/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:05:10.8292752Z [494/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:05:11.0018012Z [495/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:05:11.0368935Z [496/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:11.4580822Z [497/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:05:12.1155827Z [498/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:05:12.1926715Z [499/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:05:12.5610628Z [500/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:05:12.9658102Z [501/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:05:14.4196537Z [502/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:05:19.5568967Z [503/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:05:20.2332068Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:22.8994202Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:24.1512766Z [506/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:05:25.3954068Z [507/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:05:30.2597624Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:35.0200919Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:38.9006056Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:40.5251643Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:44.1860051Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:47.6596821Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:49.6044278Z [514/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:50.2285756Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:51.2447664Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:55.1541122Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:06:01.8394367Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:06:02.1657472Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:06:12.9015966Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:06:15.7002552Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:06:16.4604309Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:06:16.5290619Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:06:19.7765724Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:20.3367465Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:06:22.0869919Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:25.2720664Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:06:25.6451180Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:06:28.2070746Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:06:28.8083755Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:29.5822322Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:06:29.7226747Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:06:31.0981629Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:06:31.1175906Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:31.1260634Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:06:31.1262772Z ################################################################################ 2025-05-07T20:06:31.1263428Z [CMAKE] Running post-build script ... 2025-05-07T20:06:31.1264356Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:06:31.1265225Z Removing all RPATHs ... 2025-05-07T20:06:31.1265689Z ################################################################################ 2025-05-07T20:06:31.1487825Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 1 2025-05-07T20:06:31.1489932Z ################################################################################ 2025-05-07T20:06:31.1490592Z [CMAKE] Running post-build script ... 2025-05-07T20:06:31.1491499Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:06:31.1492523Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:31.1493173Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:31.1493974Z ################################################################################ 2025-05-07T20:06:31.2230852Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:31.2233310Z ################################################################################ 2025-05-07T20:06:31.2233970Z [CMAKE] Running post-build script ... 2025-05-07T20:06:31.2234933Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:31.2235940Z Removing all RPATHs ... 2025-05-07T20:06:31.2236369Z ################################################################################ 2025-05-07T20:06:32.9897222Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:33.0017881Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:33.0020310Z ################################################################################ 2025-05-07T20:06:33.0020960Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.0021932Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:33.0025600Z Removing all RPATHs ... 2025-05-07T20:06:33.0026072Z ################################################################################ 2025-05-07T20:06:33.3457652Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:33.3460223Z ################################################################################ 2025-05-07T20:06:33.3460956Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.3461918Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:33.3462896Z Removing all RPATHs ... 2025-05-07T20:06:33.3463397Z ################################################################################ 2025-05-07T20:06:33.3680675Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:33.3683427Z ################################################################################ 2025-05-07T20:06:33.3683969Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.3685100Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:33.3686226Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:33.3686936Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:33.3687640Z ################################################################################ 2025-05-07T20:06:33.4134090Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:33.4136429Z ################################################################################ 2025-05-07T20:06:33.4137102Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.4138108Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:33.4139140Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:33.4139905Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:33.4140672Z ################################################################################ 2025-05-07T20:06:33.5416392Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:33.5418554Z ################################################################################ 2025-05-07T20:06:33.5419149Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.5420095Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:33.5421011Z Removing all RPATHs ... 2025-05-07T20:06:33.5421439Z ################################################################################ 2025-05-07T20:06:33.7098524Z [544/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:33.9522441Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:33.9525103Z ################################################################################ 2025-05-07T20:06:33.9525808Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.9526936Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:33.9528015Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:33.9528659Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:33.9529418Z ################################################################################ 2025-05-07T20:06:33.9907692Z [546/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:34.0605178Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:34.0607528Z ################################################################################ 2025-05-07T20:06:34.0608112Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.0609179Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:34.0610302Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:34.0610981Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:34.0611665Z ################################################################################ 2025-05-07T20:06:34.2638383Z [548/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:34.3264965Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:34.9128578Z [550/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:34.9474384Z [551/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:34.9476788Z ################################################################################ 2025-05-07T20:06:34.9477351Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.9478372Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:34.9479373Z Removing all RPATHs ... 2025-05-07T20:06:34.9479851Z ################################################################################ 2025-05-07T20:06:35.1758150Z [552/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:35.1759671Z ################################################################################ 2025-05-07T20:06:35.1760080Z [CMAKE] Running post-build script ... 2025-05-07T20:06:35.1760678Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:35.1761302Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:35.1761693Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:35.1762160Z ################################################################################ 2025-05-07T20:06:36.7150735Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:36.7270088Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:36.8364320Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:38.2279456Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:38.8345442Z [557/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:38.8347966Z ################################################################################ 2025-05-07T20:06:38.8348579Z [CMAKE] Running post-build script ... 2025-05-07T20:06:38.8349677Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:38.8350846Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:38.8351574Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:38.8352327Z ################################################################################ 2025-05-07T20:06:39.3599190Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:39.3877090Z [559/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:40.0465355Z [560/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:40.1920433Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:40.4824595Z [562/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:40.4826513Z ################################################################################ 2025-05-07T20:06:40.4827080Z [CMAKE] Running post-build script ... 2025-05-07T20:06:40.4827955Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:40.4828978Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:40.4829532Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:40.4830124Z ################################################################################ 2025-05-07T20:06:42.3724008Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:42.5952080Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:46.6599388Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:49.1456058Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:52.7669688Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:06:54.6296383Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:07:01.4680376Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:07:01.7146095Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:07:01.9521372Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:07:03.1430209Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:07:10.0666631Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:07:12.1553002Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:07:15.4087884Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:07:15.9259903Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:07:15.9280812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:15.9282493Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:15.9283051Z ^ 2025-05-07T20:07:15.9283366Z 2025-05-07T20:07:15.9283787Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.9284448Z 2025-05-07T20:07:15.9285477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:15.9286909Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:15.9287511Z ^ 2025-05-07T20:07:15.9288287Z 2025-05-07T20:07:15.9288735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.9289392Z 2025-05-07T20:07:15.9290361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:15.9291793Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:15.9292428Z ^ 2025-05-07T20:07:15.9292734Z 2025-05-07T20:07:15.9293160Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.9293975Z 2025-05-07T20:07:15.9294916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:15.9296376Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:15.9296959Z ^ 2025-05-07T20:07:15.9297232Z 2025-05-07T20:07:15.9297669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.9298352Z 2025-05-07T20:07:18.3086449Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:07:20.7692052Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:07:29.6876870Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:07:32.0818388Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:07:33.6993814Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:33.7515065Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:34.4303122Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:38.4089070Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:42.5937941Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:43.3094547Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:43.3610875Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:44.7659851Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:45.2632595Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:45.8553927Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:47.0184276Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:48.1203357Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:48.1471331Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:49.8344243Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:50.6186387Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:07:50.8120515Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:50.8121828Z ################################################################################ 2025-05-07T20:07:50.8122206Z [CMAKE] Running post-build script ... 2025-05-07T20:07:50.8122762Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:50.8123302Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:50.8123681Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:50.8124109Z ################################################################################ 2025-05-07T20:09:01.1696713Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:09:01.4488703Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:09:07.6819337Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:09:09.5073857Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:09:10.1439239Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:09:10.2697629Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:09:10.3223097Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:09:10.3224459Z ################################################################################ 2025-05-07T20:09:10.3224882Z [CMAKE] Running post-build script ... 2025-05-07T20:09:10.3225567Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:10.3226229Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:10.3226641Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:10.3227080Z ################################################################################ 2025-05-07T20:09:10.7493912Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:09:10.7497797Z ################################################################################ 2025-05-07T20:09:10.7498840Z [CMAKE] Running post-build script ... 2025-05-07T20:09:10.7500374Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:10.7501044Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:10.7501503Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:10.7501965Z ################################################################################ 2025-05-07T20:09:11.9225935Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:09:11.9228812Z ################################################################################ 2025-05-07T20:09:11.9229188Z [CMAKE] Running post-build script ... 2025-05-07T20:09:11.9229966Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:11.9230624Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:11.9231092Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:11.9231549Z ################################################################################ 2025-05-07T20:09:18.0528714Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:09:20.7018659Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:09:20.7020655Z ################################################################################ 2025-05-07T20:09:20.7021046Z [CMAKE] Running post-build script ... 2025-05-07T20:09:20.7021844Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:20.7022514Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:20.7022940Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:20.7023387Z ################################################################################ 2025-05-07T20:09:20.7024509Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:09:20.7074967Z -- Install configuration: "Release" 2025-05-07T20:09:20.7076272Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:09:20.7104262Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:09:20.7105253Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:09:20.7125226Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:09:20.7126352Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:09:20.7148745Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:09:20.7169382Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:09:20.7170460Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:09:20.7171505Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:09:20.7185138Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:09:20.7186234Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:09:20.7187387Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:20.7188733Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:20.7189845Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:20.7191298Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:20.7192454Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:20.7193581Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:20.7195648Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:20.7196947Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:20.7198275Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:20.7199600Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:20.7200884Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:20.7202154Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:20.7203510Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:20.7204877Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:20.7206194Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:20.7207562Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:20.7208950Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:20.7210265Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:20.7211500Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:20.7212743Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:20.7214121Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:20.7215465Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:20.7216599Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:20.7226978Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:20.7281392Z 2025-05-07T20:09:20.7338647Z 2025-05-07T20:09:20.7339331Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:20.7340568Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:20.7341804Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:20.7342622Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:20.7344026Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:20.7345497Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:20.7346617Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:20.7347468Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:20.7348366Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:20.7349279Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:20.7350301Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:20.7351453Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:20.7352623Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:20.7353696Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:20.7354804Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:20.7356063Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:20.7357494Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:20.7358884Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:20.7360287Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:20.7361671Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:20.7362809Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:20.7363632Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:20.7364312Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:20.7365052Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:20.7365991Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:20.7366797Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:20.7367502Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:20.7368335Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:20.7369168Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:20.7370165Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:20.7371262Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:20.7372434Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:20.7373735Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:20.7374659Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:20.7375510Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:20.7376303Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:20.7377082Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:20.7378094Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:20.7378910Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:20.7379694Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:20.7380423Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:20.7381106Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:20.7381844Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:20.7382792Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:20.7383643Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:20.7384534Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:20.7385440Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:20.7386260Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:20.7387013Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:20.7387871Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:20.7388755Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:20.7389641Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:20.7390467Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:20.7391252Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:20.7392120Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:20.7392904Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:20.7393660Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:20.7394573Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:20.7395378Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7396235Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:20.7397189Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:20.7398382Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:20.7399777Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:20.7401021Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:20.7402205Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:20.7403627Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:20.7405202Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:20.7406718Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:20.7408176Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:20.7409687Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:20.7411046Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:20.7412417Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:20.7413530Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7414321Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:20.7415319Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:20.7416306Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:20.7417279Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:20.7418413Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:20.7419657Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:20.7420712Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:20.7421739Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:20.7422834Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:20.7424070Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:20.7425198Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:20.7425970Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:20.7426773Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:20.7427869Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:20.7428825Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:20.7429600Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:20.7430458Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:20.7431419Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:20.7432368Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:20.7433167Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:20.7433962Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:20.7434859Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:20.7435793Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:20.7436726Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:20.7437692Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:20.7438516Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:20.7439299Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:20.7440374Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:20.7441297Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:20.7442154Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:20.7443381Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:20.7444421Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:20.7445278Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:20.7446410Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:20.7447140Z 2025-05-07T20:09:20.7554382Z INFO:root:running bdist_wheel 2025-05-07T20:09:20.7600070Z INFO:root:running build 2025-05-07T20:09:20.7600409Z INFO:root:running build_py 2025-05-07T20:09:20.7604199Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7606037Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7608153Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7609541Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7610955Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7612625Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7614514Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7615997Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7617945Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7619322Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7620869Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7622980Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7624556Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7626148Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7627591Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7629007Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7630540Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7632200Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7634645Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7637644Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7639201Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7640658Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7642018Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7644084Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:20.7645356Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:20.7647113Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:20.7649768Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7650958Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7652596Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7654076Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7655543Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7657087Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7658605Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7660155Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7661572Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7663632Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:20.7665704Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:20.7666913Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:20.7668566Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:20.7670502Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll 2025-05-07T20:09:20.7671705Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll 2025-05-07T20:09:20.7673814Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe 2025-05-07T20:09:20.7675127Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe 2025-05-07T20:09:20.7677381Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:20.7678586Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:20.7680122Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:20.7681604Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:20.7683681Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:20.7685908Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:20.7687185Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:20.7688682Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:20.7690178Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:20.7691793Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:20.7693846Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:20.7695033Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:20.7696653Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:20.7699242Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:20.7700534Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:20.7702284Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:20.7705398Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7706593Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7708268Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7709943Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7711611Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7713234Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7714905Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7716564Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7718359Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7720130Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7721822Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7723597Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7725326Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7727005Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:20.7728581Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7729862Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7731539Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7733075Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7735060Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7736714Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7738278Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7739895Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7741400Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7743277Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7744863Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7746457Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:20.7748389Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:20.7749542Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:20.7751229Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:20.7753355Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:20.7754507Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:20.7756753Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:20.7758336Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:20.7759877Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:20.7762876Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:20.7764097Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:20.7766076Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:20.7767606Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:20.7769102Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:20.7770638Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:20.7772795Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:20.7773969Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:20.7775696Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:20.7777733Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:20.7779069Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:20.7781212Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:20.7783752Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:20.7784986Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:20.7786586Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:20.7838751Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.7882653Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.8211526Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:20.9283443Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:24.3631198Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:24.3636555Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:24.4944092Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:24.5055849Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:24.5276863Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:24.5980140Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:27.3358195Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:27.3910936Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:34.8321709Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:35.9650796Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:38.5937946Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:39.0585277Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:39.0872094Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:39.3582650Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3601887Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3604159Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3606239Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3608734Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3615188Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3627378Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3634483Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3657677Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3662775Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3674670Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3681803Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3688341Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3697987Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3703291Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:39.3707751Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:39.3709380Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:39.3727406Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:39.3742864Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:39.3779066Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0424321Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0425954Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0427337Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0428611Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0430234Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0431943Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0433549Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0435250Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0436891Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0438548Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0440746Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0442465Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0444194Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0445823Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0447410Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0449907Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0452022Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0454532Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0457939Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0459752Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0461535Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0463052Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:40.0464770Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:40.0466518Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:40.0468085Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0469750Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0471513Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0473104Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0474782Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0477060Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0478887Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0480506Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0482469Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:40.0484562Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:40.0488138Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:40.0489661Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll 2025-05-07T20:09:40.0491368Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe 2025-05-07T20:09:40.0493113Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:40.0494769Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:40.0496389Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:40.0498193Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:40.0499932Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:40.0502230Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:40.0503735Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:40.0505330Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:40.0507171Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:40.0508817Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:40.0510630Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:40.0512293Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:40.0514008Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0515760Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0517512Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0519483Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0521366Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0523200Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0524910Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0526711Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0528469Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0530165Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0532071Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0533810Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0535459Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0537408Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0539229Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0540993Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0542833Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0544527Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0546136Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0549737Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0551261Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0552966Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0554735Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0556367Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0557895Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:40.0559627Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:40.0561320Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0562763Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0564309Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0566107Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0568857Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0570552Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0572183Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0573722Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0575337Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0577085Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:40.0578901Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:40.0580546Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:40.0582562Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:40.0584380Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:40.0586101Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:40.0606791Z INFO:skbuild:copied 90 files 2025-05-07T20:09:40.0607153Z INFO:root:running build_ext 2025-05-07T20:09:40.0609233Z INFO:root:installing to _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:40.0609782Z INFO:root:running install 2025-05-07T20:09:40.0664011Z INFO:root:running install_lib 2025-05-07T20:09:40.0665850Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:40.0667739Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:40.0668823Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:40.0670211Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:40.0671895Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:40.0673096Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:40.0674284Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0675929Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0677479Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0679094Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0680788Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0682617Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0684309Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0685923Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0687482Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:40.0688699Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:40.0689946Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:40.0691588Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:40.0692799Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:40.0693556Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:40.0694712Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:40.0696321Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:40.0697511Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:40.0698705Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:40.0700419Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:40.0701630Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0702887Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0704799Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0706549Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0708450Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0710265Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0712068Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0713917Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0715882Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0717835Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0719720Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0721658Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0723535Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0725402Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:40.0727142Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:40.0728297Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:40.0729113Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0730361Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0732028Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0733721Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0735414Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0737153Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0738908Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0740682Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0742385Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0744101Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0745899Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0747641Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:40.0748860Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:40.0750103Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:40.0751827Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:40.0753155Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0753972Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:40.0755317Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:40.0757122Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:40.0758926Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0760546Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0762193Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0763876Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:40.0765120Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0766335Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0781624Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0783485Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0785193Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0786890Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:40.0788125Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:40.0789370Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:40.0791080Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:40.0792705Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:40.0793872Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:40.0794741Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:40.0796012Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:40.0797933Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:40.0799687Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:40.0801303Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:40.0802939Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:40.0804628Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:40.0805868Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:40.0807155Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:40.0808921Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:40.0810493Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:40.0812118Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:40.0813680Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.0815117Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.0816634Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.0943054Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.3671639Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.3673379Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.3781887Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.3791592Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.3815840Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.3870027Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.6040920Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.6085447Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.1690551Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.2560167Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.4567926Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.4925426Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.4946469Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5162815Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5164600Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5166890Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5169136Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5171327Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5173532Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5175707Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5178153Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5180590Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5183079Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5185343Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5187754Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5190048Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5192186Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5194389Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:41.5196012Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:41.5197880Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:41.5199927Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:41.5201702Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5203202Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5642881Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5644748Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5646682Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5648354Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5650173Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5652059Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5653837Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5655580Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5657161Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5658753Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5660442Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5662235Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5663996Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5665750Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5667361Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5669077Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5670840Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5672584Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5674383Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5676190Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5677874Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5679389Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:41.5680294Z INFO:skbuild:copied 125 files 2025-05-07T20:09:41.5680606Z INFO:root:running install_egg_info 2025-05-07T20:09:41.5708485Z INFO:root:running egg_info 2025-05-07T20:09:41.5741505Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:41.5745303Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:41.5747928Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:41.5749446Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:41.5852339Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:41.5892118Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:41.5893298Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.9.egg-info 2025-05-07T20:09:41.5899959Z INFO:root:running install_scripts 2025-05-07T20:09:41.5900707Z INFO:skbuild:copied 0 files 2025-05-07T20:09:44.3239349Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:44.3241441Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-wvfgktr9/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:44.3245831Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:44.3521778Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:44.3534982Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:44.3535439Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:44.5142326Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:44.5277281Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:44.5414229Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:46.3069171Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:46.5121404Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:47.2278350Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:47.3410478Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:47.9351603Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:10:06.0672260Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:10:07.3359284Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:35.4481116Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:38.2718748Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:41.9278382Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:42.5203073Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:42.6945074Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:10:51.4183298Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:11:02.5057134Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:11:03.9768457Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:11:04.0128794Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:11:04.0129302Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:11:04.0130006Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:11:04.0134315Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:11:04.0136369Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:11:04.0139555Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:11:04.0153765Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:11:04.0154600Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:11:04.0157715Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:11:04.0159460Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:11:04.0160935Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:11:04.0162802Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:11:04.0166157Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:11:04.0188032Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:11:04.0230054Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:11:04.0235320Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:11:04.0240314Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:11:04.0240853Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:11:04.0241289Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:11:04.0242649Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:11:04.0244619Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:11:04.0246503Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:11:04.0248037Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:11:04.0249886Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:11:04.0252651Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:11:04.0254433Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:11:04.0256867Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:11:04.0258533Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:11:04.0264668Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:11:04.0266715Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:11:04.0268720Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:11:04.0270513Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:11:04.0272765Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:11:04.0274829Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:11:04.0281252Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:11:04.0284092Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:11:04.0286887Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:11:04.0289263Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:11:04.0291545Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:11:04.0293585Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:11:04.0296217Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:11:04.0299983Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:11:04.0304237Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:11:04.0306377Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:11:04.0308886Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:11:04.0314309Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:11:04.0319870Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:11:04.0322077Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:11:04.0326070Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:11:04.0331525Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:11:04.0334328Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:11:04.0337291Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:11:04.0341338Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:11:04.0343641Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:11:04.0345541Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:11:04.0348664Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:11:04.0352072Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:11:04.0355059Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:11:04.0358479Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:11:04.0361630Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:11:04.0364914Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:11:04.0368108Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:11:04.0371717Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:11:04.0374601Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:11:04.0376834Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:11:04.0379508Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:11:04.0381352Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:11:04.0383688Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:11:04.0385938Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:11:04.0390840Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:11:04.0393613Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:11:04.0395973Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:11:04.0398043Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:11:04.0399683Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:11:04.0403079Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:11:04.0405802Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:11:04.0408627Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:11:04.0410367Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:11:04.0412250Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:11:04.0413949Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:11:04.0415609Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:11:04.0417114Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:11:04.0423326Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:11:04.0450462Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:11:04.0451986Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:11:04.0454802Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:11:04.0456646Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:11:04.0459779Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:11:04.0461688Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:11:04.0463526Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:11:04.0465135Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:11:04.0467809Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:11:04.0473416Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:11:04.0475860Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:11:04.0477673Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:11:04.0485868Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:11:04.0490493Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:11:04.0492655Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:11:04.0500857Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:11:04.0503342Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:11:04.0505650Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:11:04.0507466Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:11:04.0509640Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:11:04.0512641Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:11:04.0513672Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:11:04.0514656Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:11:04.0521279Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:11:04.0525377Z INFO:root:removing _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:11:04.2177581Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:11:04.2178192Z │ │ Version │ 2025-05-07T20:11:04.2178776Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:11:04.2179393Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:11:04.2180045Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:04.2180616Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:11:04.2181190Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:04.2181715Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:11:04.2182421Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:11:04.2182912Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:11:04.2183375Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:11:04.2183857Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:11:04.2184402Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:11:04.5242623Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:04.6110993Z 2025-05-07T20:11:04.6261525Z ################################################################################ 2025-05-07T20:11:04.6262021Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:04.6262475Z [CHECK] Listing out library size: 2025-05-07T20:11:04.6262899Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:04.6263263Z 2025-05-07T20:11:04.6275744Z 1 ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:04.6276464Z 2025-05-07T20:11:04.6276964Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:04.6277861Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.6278439Z 2025-05-07T20:11:04.6343312Z GLIBC_2.2.5 2025-05-07T20:11:04.6344281Z GLIBC_2.14 2025-05-07T20:11:04.6344796Z 2025-05-07T20:11:04.6345375Z 2025-05-07T20:11:04.6346523Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:04.6349283Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.6350972Z 2025-05-07T20:11:04.6416579Z 2025-05-07T20:11:04.6416599Z 2025-05-07T20:11:04.6443897Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so > /tmp/tmp.tTuB5OidE6.symbols.txt 2025-05-07T20:11:04.6444346Z 2025-05-07T20:11:04.6482340Z 2025-05-07T20:11:04.6517561Z [CHECK] Total Number of symbols: 803 2025-05-07T20:11:04.6535005Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:11:04.6562536Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so > /tmp/tmp.V4NXMXcahK.usymbols.txt 2025-05-07T20:11:04.6564201Z 2025-05-07T20:11:04.6580571Z 2025-05-07T20:11:04.6608020Z [CHECK] Listing out undefined symbols (49 total): 2025-05-07T20:11:04.6622162Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.6622764Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.6623122Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.6623488Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:04.6623862Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.6624203Z U __popcountdi2@GCC_3.4 2025-05-07T20:11:04.6624552Z U abort@GLIBC_2.2.5 2025-05-07T20:11:04.6624850Z U close@GLIBC_2.2.5 2025-05-07T20:11:04.6625158Z U fputs@GLIBC_2.2.5 2025-05-07T20:11:04.6625550Z U free@GLIBC_2.2.5 2025-05-07T20:11:04.6625853Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:11:04.6626146Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:04.6626552Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:04.6626847Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:11:04.6627131Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:04.6627418Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:04.6627683Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:04.6627962Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.6628225Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.6628508Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.6628767Z U mmap@GLIBC_2.2.5 2025-05-07T20:11:04.6629060Z U mprotect@GLIBC_2.2.5 2025-05-07T20:11:04.6629352Z U munmap@GLIBC_2.2.5 2025-05-07T20:11:04.6629652Z U open64@GLIBC_2.2.5 2025-05-07T20:11:04.6630010Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.6630389Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:11:04.6630745Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:04.6631069Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:04.6631407Z U read@GLIBC_2.2.5 2025-05-07T20:11:04.6631690Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:04.6631991Z U shm_open 2025-05-07T20:11:04.6632239Z U shm_unlink 2025-05-07T20:11:04.6632526Z U snprintf@GLIBC_2.2.5 2025-05-07T20:11:04.6632844Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:04.6633129Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:04.6633433Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.6633714Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:04.6634015Z U syscall@GLIBC_2.2.5 2025-05-07T20:11:04.6634299Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:04.6634601Z U uname@GLIBC_2.2.5 2025-05-07T20:11:04.6634876Z U unlink@GLIBC_2.2.5 2025-05-07T20:11:04.6635179Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:11:04.6635582Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.6636147Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.6636655Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.6637036Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.6637381Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.6637693Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.6638010Z w __gmon_start__ 2025-05-07T20:11:04.6638396Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.6638797Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:04.6639044Z 2025-05-07T20:11:04.6663838Z linux-vdso.so.1 (0x00007fff6b988000) 2025-05-07T20:11:04.6664291Z libtorch_cpu.so => not found 2025-05-07T20:11:04.6665078Z libtorch_cuda.so => not found 2025-05-07T20:11:04.6665620Z libtorch.so => not found 2025-05-07T20:11:04.6665998Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd2863ef000) 2025-05-07T20:11:04.6666483Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd2863c1000) 2025-05-07T20:11:04.6666895Z libc.so.6 => /lib64/libc.so.6 (0x00007fd2861b7000) 2025-05-07T20:11:04.6667296Z libm.so.6 => /lib64/libm.so.6 (0x00007fd2860dc000) 2025-05-07T20:11:04.6667672Z /lib64/ld-linux-x86-64.so.2 (0x00007fd2866d2000) 2025-05-07T20:11:04.6667950Z 2025-05-07T20:11:04.6668073Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.6668466Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:04.6668795Z 2025-05-07T20:11:04.6701064Z 2025-05-07T20:11:04.6702770Z Dynamic section at offset 0x78e78 contains 33 entries: 2025-05-07T20:11:04.6703976Z Tag Type Name/Value 2025-05-07T20:11:04.6704476Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.6705066Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.6705712Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.6706287Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.6706829Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.6707374Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.6707902Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:11:04.6708338Z 0x000000000000000c (INIT) 0x1a000 2025-05-07T20:11:04.6708711Z 0x000000000000000d (FINI) 0x5af2c 2025-05-07T20:11:04.6709055Z 0x0000000000000019 (INIT_ARRAY) 0x780a0 2025-05-07T20:11:04.6709432Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.6709789Z 0x000000000000001a (FINI_ARRAY) 0x780a8 2025-05-07T20:11:04.6710188Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.6710573Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:04.6710925Z 0x000000006ffffef5 (GNU_HASH) 0x1e18 2025-05-07T20:11:04.6711301Z 0x0000000000000005 (STRTAB) 0x86e0 2025-05-07T20:11:04.6711652Z 0x0000000000000006 (SYMTAB) 0x3b80 2025-05-07T20:11:04.6712051Z 0x000000000000000a (STRSZ) 45341 (bytes) 2025-05-07T20:11:04.6712434Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.6712825Z 0x0000000000000003 (PLTGOT) 0x790d8 2025-05-07T20:11:04.6713228Z 0x0000000000000002 (PLTRELSZ) 8064 (bytes) 2025-05-07T20:11:04.6713601Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.6713971Z 0x0000000000000017 (JMPREL) 0x17220 2025-05-07T20:11:04.6714322Z 0x0000000000000007 (RELA) 0x13ed8 2025-05-07T20:11:04.6714726Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:11:04.6715108Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.6715491Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.6715999Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.6716402Z 0x000000006ffffffe (VERNEED) 0x13e48 2025-05-07T20:11:04.6716842Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:04.6717204Z 0x000000006ffffff0 (VERSYM) 0x137fe 2025-05-07T20:11:04.6717576Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:11:04.6717911Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.6718151Z 2025-05-07T20:11:04.6719852Z ################################################################################ 2025-05-07T20:11:04.6720103Z 2025-05-07T20:11:04.6720107Z 2025-05-07T20:11:04.6720264Z ################################################################################ 2025-05-07T20:11:04.6720767Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:04.6721282Z [CHECK] Listing out library size: 2025-05-07T20:11:04.6721779Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:04.6722164Z 2025-05-07T20:11:04.6722353Z 1 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:04.6722651Z 2025-05-07T20:11:04.6723052Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:04.6724023Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.6724641Z 2025-05-07T20:11:04.6765817Z GLIBC_2.2.5 2025-05-07T20:11:04.6766471Z GLIBC_2.14 2025-05-07T20:11:04.6766857Z 2025-05-07T20:11:04.6766871Z 2025-05-07T20:11:04.6768049Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:04.6771050Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.6772398Z 2025-05-07T20:11:04.6818819Z GLIBCXX_3.4 2025-05-07T20:11:04.6819737Z GLIBCXX_3.4.9 2025-05-07T20:11:04.6820404Z GLIBCXX_3.4.21 2025-05-07T20:11:04.6820779Z 2025-05-07T20:11:04.6820792Z 2025-05-07T20:11:04.6839207Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.wzQIxUqUSp.symbols.txt 2025-05-07T20:11:04.6840650Z 2025-05-07T20:11:04.6855659Z 2025-05-07T20:11:04.6881804Z [CHECK] Total Number of symbols: 107 2025-05-07T20:11:04.6901244Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:04.6923537Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.1k6e0I4Ik0.usymbols.txt 2025-05-07T20:11:04.6924081Z 2025-05-07T20:11:04.6939508Z 2025-05-07T20:11:04.6964552Z [CHECK] Listing out undefined symbols (57 total): 2025-05-07T20:11:04.6980718Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.6981328Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.6981688Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.6982240Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.6982610Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.6982969Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.6983310Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.6983666Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.6983991Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:04.6984357Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.6984690Z U c10::BoolType::get() 2025-05-07T20:11:04.6985006Z U c10::StringType::get() 2025-05-07T20:11:04.6985336Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.6986121Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.6987546Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.6988481Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:04.6988805Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:04.6989117Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.6989402Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.6989747Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.6990084Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.6990488Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.6990889Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:04.6991568Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:04.6992492Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.6993096Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.6993491Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.6993894Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.6994302Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.6994708Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.6995295Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.6996330Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.6997141Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.6997495Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.6997865Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.6998200Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.6998552Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.6998884Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.6999218Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.6999524Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:04.6999832Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.7000656Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.7001837Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:04.7002841Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.7003508Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:11:04.7003915Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.7004356Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.7004820Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.7005430Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.7006261Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.7006916Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.7007474Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.7007941Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.7008263Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.7008604Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.7008932Z w __gmon_start__ 2025-05-07T20:11:04.7009232Z w __pthread_key_create 2025-05-07T20:11:04.7009569Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.7010026Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:04.7010317Z 2025-05-07T20:11:04.7020244Z linux-vdso.so.1 (0x00007ffe43f04000) 2025-05-07T20:11:04.7021071Z libc10.so => not found 2025-05-07T20:11:04.7021388Z libtorch_cpu.so => not found 2025-05-07T20:11:04.7021687Z libtorch_cuda.so => not found 2025-05-07T20:11:04.7022007Z libtorch.so => not found 2025-05-07T20:11:04.7022352Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa27c952000) 2025-05-07T20:11:04.7022827Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa27c922000) 2025-05-07T20:11:04.7023265Z libc.so.6 => /lib64/libc.so.6 (0x00007fa27c71a000) 2025-05-07T20:11:04.7023648Z libm.so.6 => /lib64/libm.so.6 (0x00007fa27c63f000) 2025-05-07T20:11:04.7024064Z /lib64/ld-linux-x86-64.so.2 (0x00007fa27cbc6000) 2025-05-07T20:11:04.7024325Z 2025-05-07T20:11:04.7024446Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.7024915Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:04.7025257Z 2025-05-07T20:11:04.7051950Z 2025-05-07T20:11:04.7052459Z Dynamic section at offset 0xab00 contains 34 entries: 2025-05-07T20:11:04.7053504Z Tag Type Name/Value 2025-05-07T20:11:04.7054115Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.7054670Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.7055260Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.7055812Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.7056386Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.7056944Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.7057468Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.7058024Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:11:04.7058484Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:11:04.7058841Z 0x000000000000000d (FINI) 0x817c 2025-05-07T20:11:04.7059184Z 0x0000000000000019 (INIT_ARRAY) 0xaa58 2025-05-07T20:11:04.7059701Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:11:04.7060069Z 0x000000000000001a (FINI_ARRAY) 0xaa68 2025-05-07T20:11:04.7060532Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.7060905Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:04.7061250Z 0x000000006ffffef5 (GNU_HASH) 0x700 2025-05-07T20:11:04.7061615Z 0x0000000000000005 (STRTAB) 0x13b0 2025-05-07T20:11:04.7061950Z 0x0000000000000006 (SYMTAB) 0x990 2025-05-07T20:11:04.7062334Z 0x000000000000000a (STRSZ) 6889 (bytes) 2025-05-07T20:11:04.7062706Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.7063089Z 0x0000000000000003 (PLTGOT) 0xad70 2025-05-07T20:11:04.7063466Z 0x0000000000000002 (PLTRELSZ) 1272 (bytes) 2025-05-07T20:11:04.7063859Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.7064224Z 0x0000000000000017 (JMPREL) 0x34a8 2025-05-07T20:11:04.7066225Z 0x0000000000000007 (RELA) 0x3028 2025-05-07T20:11:04.7066633Z 0x0000000000000008 (RELASZ) 1152 (bytes) 2025-05-07T20:11:04.7067050Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.7067416Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.7067768Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.7068157Z 0x000000006ffffffe (VERNEED) 0x2f78 2025-05-07T20:11:04.7068534Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:04.7068905Z 0x000000006ffffff0 (VERSYM) 0x2e9a 2025-05-07T20:11:04.7069265Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:11:04.7069591Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.7069814Z 2025-05-07T20:11:04.7069958Z ################################################################################ 2025-05-07T20:11:04.7070229Z 2025-05-07T20:11:04.7070234Z 2025-05-07T20:11:04.7070355Z ################################################################################ 2025-05-07T20:11:04.7070838Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:04.7071310Z [CHECK] Listing out library size: 2025-05-07T20:11:04.7071724Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:04.7072047Z 2025-05-07T20:11:04.7072228Z 6 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:04.7072480Z 2025-05-07T20:11:04.7072826Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:04.7073749Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.7074302Z 2025-05-07T20:11:04.7326884Z GLIBC_2.2.5 2025-05-07T20:11:04.7327255Z GLIBC_2.3 2025-05-07T20:11:04.7328073Z GLIBC_2.14 2025-05-07T20:11:04.7328312Z 2025-05-07T20:11:04.7328317Z 2025-05-07T20:11:04.7328702Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:04.7329780Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.7330530Z 2025-05-07T20:11:04.7582836Z GLIBCXX_3.4 2025-05-07T20:11:04.7583124Z GLIBCXX_3.4.9 2025-05-07T20:11:04.7583360Z GLIBCXX_3.4.11 2025-05-07T20:11:04.7583625Z GLIBCXX_3.4.14 2025-05-07T20:11:04.7583849Z GLIBCXX_3.4.15 2025-05-07T20:11:04.7584101Z GLIBCXX_3.4.18 2025-05-07T20:11:04.7584335Z GLIBCXX_3.4.21 2025-05-07T20:11:04.7584505Z 2025-05-07T20:11:04.7584509Z 2025-05-07T20:11:04.7605055Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so > /tmp/tmp.aZEy34dS9t.symbols.txt 2025-05-07T20:11:04.7605530Z 2025-05-07T20:11:04.7822603Z 2025-05-07T20:11:04.7856450Z [CHECK] Total Number of symbols: 4871 2025-05-07T20:11:04.7874512Z [CHECK] Number of fbgemm symbols: 3365 2025-05-07T20:11:04.7889715Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so > /tmp/tmp.0jFA3nZagK.usymbols.txt 2025-05-07T20:11:04.7891034Z 2025-05-07T20:11:04.7918671Z 2025-05-07T20:11:04.7942658Z [CHECK] Listing out undefined symbols (135 total): 2025-05-07T20:11:04.7959670Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.7960457Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:04.7960850Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.7961193Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.7961577Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.7961922Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:04.7962295Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:04.7962636Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.7963016Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.7963420Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:11:04.7963797Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.7964414Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:04.7964979Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:04.7965477Z U __cxa_throw_bad_array_new_length@CXXABI_1.3.8 2025-05-07T20:11:04.7965831Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.7966183Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:04.7966498Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:04.7966867Z U abort@GLIBC_2.2.5 2025-05-07T20:11:04.7967312Z U asmjit::_abi_1_13::BaseAssembler::bind(asmjit::_abi_1_13::Label const&) 2025-05-07T20:11:04.7967770Z U asmjit::_abi_1_13::BaseAssembler::newLabel() 2025-05-07T20:11:04.7968313Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:04.7969094Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:04.7970084Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:04.7971277Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:04.7972427Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:11:04.7973212Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:04.7973837Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:04.7974454Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:11:04.7975128Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:11:04.7975677Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:11:04.7976217Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:11:04.7977001Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:11:04.7977622Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:11:04.7978067Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:11:04.7978720Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:11:04.7979478Z U asmjit::_abi_1_13::JitRuntime::_add(void**, asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:04.7980184Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:11:04.7980814Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:04.7981318Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:11:04.7981728Z U cpuinfo_get_packages 2025-05-07T20:11:04.7982299Z U cpuinfo_get_packages_count 2025-05-07T20:11:04.7982705Z U cpuinfo_initialize 2025-05-07T20:11:04.7983039Z U cpuinfo_isa 2025-05-07T20:11:04.7983328Z U fma@GLIBC_2.2.5 2025-05-07T20:11:04.7983658Z U fmaf@GLIBC_2.2.5 2025-05-07T20:11:04.7983953Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:04.7984279Z U free@GLIBC_2.2.5 2025-05-07T20:11:04.7984581Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:04.7984915Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:04.7985309Z U log2@GLIBC_2.2.5 2025-05-07T20:11:04.7985638Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:04.7985969Z U lrintf@GLIBC_2.2.5 2025-05-07T20:11:04.7986269Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:04.7986593Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.7986895Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.7987231Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.7987574Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:11:04.7988039Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:11:04.7988393Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.7988809Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:11:04.7989193Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.7989589Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.7989967Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:11:04.7990275Z U pow@GLIBC_2.2.5 2025-05-07T20:11:04.7990575Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:04.7990975Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:04.7991495Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:04.7991975Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:04.7992627Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:04.7993384Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:11:04.7994421Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:11:04.7995655Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:04.7996411Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:04.7996954Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:04.7997360Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:04.7997863Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:11:04.7998400Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:04.7999051Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:04.7999510Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:04.7999860Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:04.8000243Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.8000599Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:04.8000970Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:04.8001364Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:04.8001937Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:11:04.8002376Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.8002788Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.8003231Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:04.8003651Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.8005847Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8006734Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:11:04.8007101Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:04.8007507Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:04.8007956Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:11:04.8008348Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:11:04.8008810Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.8009189Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.8009895Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:04.8010684Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:11:04.8011263Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8011811Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8012379Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8012898Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:04.8013293Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:04.8013672Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:04.8014183Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:04.8014745Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:04.8015354Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:04.8015854Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:04.8016200Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:04.8016526Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:04.8016818Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.8017124Z U strstr@GLIBC_2.2.5 2025-05-07T20:11:04.8017407Z U tolower@GLIBC_2.2.5 2025-05-07T20:11:04.8017728Z U toupper@GLIBC_2.2.5 2025-05-07T20:11:04.8018101Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:11:04.8018539Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:04.8018938Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:04.8019315Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:04.8020037Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.8020492Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.8020953Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:04.8021347Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:04.8021763Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.8022150Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.8022493Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.8022854Z w __gmon_start__ 2025-05-07T20:11:04.8023163Z w __pthread_key_create 2025-05-07T20:11:04.8023538Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:04.8023908Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:04.8024283Z w pthread_once 2025-05-07T20:11:04.8024587Z w pthread_rwlock_rdlock 2025-05-07T20:11:04.8024951Z w pthread_rwlock_unlock 2025-05-07T20:11:04.8025316Z w pthread_rwlock_wrlock 2025-05-07T20:11:04.8025650Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:11:04.8026060Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.8026538Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:04.8027001Z 2025-05-07T20:11:04.8027146Z linux-vdso.so.1 (0x00007ffe3a193000) 2025-05-07T20:11:04.8027468Z libc10.so => not found 2025-05-07T20:11:04.8028048Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f14d5785000) 2025-05-07T20:11:04.8028688Z libtorch.so => not found 2025-05-07T20:11:04.8028975Z libtorch_cpu.so => not found 2025-05-07T20:11:04.8029331Z libtorch_cuda.so => not found 2025-05-07T20:11:04.8029687Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f14d5521000) 2025-05-07T20:11:04.8030128Z libm.so.6 => /lib64/libm.so.6 (0x00007f14d5446000) 2025-05-07T20:11:04.8030529Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f14d5418000) 2025-05-07T20:11:04.8030957Z libc.so.6 => /lib64/libc.so.6 (0x00007f14d5210000) 2025-05-07T20:11:04.8031390Z /lib64/ld-linux-x86-64.so.2 (0x00007f14d5d48000) 2025-05-07T20:11:04.8031770Z libtorch_cpu.so => not found 2025-05-07T20:11:04.8032096Z libtorch_cuda.so => not found 2025-05-07T20:11:04.8032496Z libtorch.so => not found 2025-05-07T20:11:04.8032656Z 2025-05-07T20:11:04.8032797Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.8033168Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:04.8033472Z 2025-05-07T20:11:04.8041960Z 2025-05-07T20:11:04.8042642Z Dynamic section at offset 0x51fb38 contains 38 entries: 2025-05-07T20:11:04.8043463Z Tag Type Name/Value 2025-05-07T20:11:04.8043929Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.8044476Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:11:04.8045007Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.8045577Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.8046155Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.8046706Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.8047267Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:04.8047790Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.8048339Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.8048884Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:04.8049471Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:11:04.8050005Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:04.8050432Z 0x000000000000000c (INIT) 0xf6000 2025-05-07T20:11:04.8050814Z 0x000000000000000d (FINI) 0x4c8fb0 2025-05-07T20:11:04.8051177Z 0x0000000000000019 (INIT_ARRAY) 0x51dac0 2025-05-07T20:11:04.8051573Z 0x000000000000001b (INIT_ARRAYSZ) 56 (bytes) 2025-05-07T20:11:04.8051948Z 0x000000000000001a (FINI_ARRAY) 0x51daf8 2025-05-07T20:11:04.8052350Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.8052733Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:04.8053084Z 0x000000006ffffef5 (GNU_HASH) 0x6e20 2025-05-07T20:11:04.8053462Z 0x0000000000000005 (STRTAB) 0x2b0a0 2025-05-07T20:11:04.8053814Z 0x0000000000000006 (SYMTAB) 0xe7e0 2025-05-07T20:11:04.8054209Z 0x000000000000000a (STRSZ) 708057 (bytes) 2025-05-07T20:11:04.8054590Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.8054984Z 0x0000000000000003 (PLTGOT) 0x520dd8 2025-05-07T20:11:04.8055363Z 0x0000000000000002 (PLTRELSZ) 24312 (bytes) 2025-05-07T20:11:04.8055891Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.8056257Z 0x0000000000000017 (JMPREL) 0xef8e0 2025-05-07T20:11:04.8056602Z 0x0000000000000007 (RELA) 0xda610 2025-05-07T20:11:04.8057221Z 0x0000000000000008 (RELASZ) 86736 (bytes) 2025-05-07T20:11:04.8057639Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.8058020Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.8058368Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.8058774Z 0x000000006ffffffe (VERNEED) 0xda490 2025-05-07T20:11:04.8059129Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.8059686Z 0x000000006ffffff0 (VERSYM) 0xd7e7a 2025-05-07T20:11:04.8060068Z 0x000000006ffffff9 (RELACOUNT) 9 2025-05-07T20:11:04.8060405Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.8060621Z 2025-05-07T20:11:04.8060771Z ################################################################################ 2025-05-07T20:11:04.8061011Z 2025-05-07T20:11:04.8061045Z 2025-05-07T20:11:04.8061170Z ################################################################################ 2025-05-07T20:11:04.8061708Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:04.8062245Z [CHECK] Listing out library size: 2025-05-07T20:11:04.8062715Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:04.8063105Z 2025-05-07T20:11:04.8063346Z 3 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:04.8063659Z 2025-05-07T20:11:04.8064062Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:04.8065096Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.8065701Z 2025-05-07T20:11:04.8117849Z GLIBC_2.2.5 2025-05-07T20:11:04.8118513Z GLIBC_2.3 2025-05-07T20:11:04.8119113Z GLIBC_2.14 2025-05-07T20:11:04.8119444Z 2025-05-07T20:11:04.8119458Z 2025-05-07T20:11:04.8120697Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:04.8123808Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.8125139Z 2025-05-07T20:11:04.8179002Z GLIBCXX_3.4 2025-05-07T20:11:04.8179940Z GLIBCXX_3.4.9 2025-05-07T20:11:04.8180585Z GLIBCXX_3.4.14 2025-05-07T20:11:04.8181179Z GLIBCXX_3.4.20 2025-05-07T20:11:04.8181804Z GLIBCXX_3.4.21 2025-05-07T20:11:04.8182841Z GLIBCXX_3.4.29 2025-05-07T20:11:04.8183204Z 2025-05-07T20:11:04.8183218Z 2025-05-07T20:11:04.8200332Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.IHitWXTFXF.symbols.txt 2025-05-07T20:11:04.8201001Z 2025-05-07T20:11:04.8223079Z 2025-05-07T20:11:04.8249951Z [CHECK] Total Number of symbols: 505 2025-05-07T20:11:04.8261438Z [CHECK] Number of fbgemm symbols: 47 2025-05-07T20:11:04.8279173Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.iDYjZexxKg.usymbols.txt 2025-05-07T20:11:04.8279749Z 2025-05-07T20:11:04.8300961Z 2025-05-07T20:11:04.8327578Z [CHECK] Listing out undefined symbols (195 total): 2025-05-07T20:11:04.8342335Z U GOMP_barrier 2025-05-07T20:11:04.8343254Z U GOMP_parallel 2025-05-07T20:11:04.8344296Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.8344937Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.8345329Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.8345788Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.8346201Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.8346634Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.8347046Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.8347625Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.8348110Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.8348505Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:04.8348891Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.8349236Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.8349611Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.8349999Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:04.8350380Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:04.8350720Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.8351090Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.8351453Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.8351820Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:04.8352173Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.8352509Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:04.8353050Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:04.8353662Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:04.8354176Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:04.8355172Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8356123Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:11:04.8356612Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:04.8357156Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:04.8357966Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:04.8359192Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:04.8360049Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:04.8360831Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8361661Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:04.8362009Z U at::get_num_threads() 2025-05-07T20:11:04.8362350Z U at::get_thread_num() 2025-05-07T20:11:04.8362675Z U at::in_parallel_region() 2025-05-07T20:11:04.8362981Z U at::init_num_threads() 2025-05-07T20:11:04.8363325Z U at::internal::set_thread_num(int) 2025-05-07T20:11:04.8363679Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:04.8364044Z U c10::BoolType::get() 2025-05-07T20:11:04.8364398Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.8365060Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:04.8365659Z U c10::Error::what() const 2025-05-07T20:11:04.8366006Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.8366467Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8366899Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.8367275Z U c10::IntType::get() 2025-05-07T20:11:04.8367743Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:04.8368181Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:04.8368659Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:04.8369127Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:04.8369521Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:04.8369923Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:04.8370345Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.8371016Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:04.8371681Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:04.8372093Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:04.8372456Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:04.8372838Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:04.8373207Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:04.8373526Z U c10::SymIntType::get() 2025-05-07T20:11:04.8373906Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:04.8374272Z U c10::TensorType::get() 2025-05-07T20:11:04.8374625Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.8375573Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.8376518Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.8376958Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:04.8377493Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:04.8378218Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:04.8378804Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.8379152Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.8379614Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.8380147Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.8380590Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.8381110Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.8381608Z U c10::cuda::device_count() 2025-05-07T20:11:04.8382217Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.8382625Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.8383063Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.8383511Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.8383944Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.8384383Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.8385168Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.8386121Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.8387065Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.8388151Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.8402407Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.8403413Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.8403946Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.8404303Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.8404693Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:04.8405048Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:04.8405460Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.8405895Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.8406322Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:04.8406730Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:04.8407088Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:04.8407464Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.8407879Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.8408345Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.8408719Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.8409113Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.8409493Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.8409844Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.8410214Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.8410563Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.8410950Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.8411302Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.8411670Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.8412038Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.8412391Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.8412770Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.8413742Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8415366Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8417207Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8419201Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8421139Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8423061Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8424987Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8426908Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8428771Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8430699Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8432613Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8434535Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:04.8435758Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:11:04.8436247Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:04.8436767Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:04.8437344Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.8437815Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8438243Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.8438679Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8439139Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:04.8439627Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.8440074Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8440462Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.8440809Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.8441125Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.8441466Z U omp_get_max_threads 2025-05-07T20:11:04.8441780Z U omp_get_num_threads 2025-05-07T20:11:04.8442125Z U omp_get_thread_num 2025-05-07T20:11:04.8442494Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.8442941Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.8443580Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.8444598Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.8445482Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.8446092Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:04.8446529Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:04.8446934Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.8447500Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:04.8447952Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.8448353Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.8448812Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:04.8449403Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.8450122Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.8451206Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8452481Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.8453246Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:04.8453642Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:04.8454018Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.8454416Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.8454808Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.8455161Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.8455538Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.8455884Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.8456326Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8456914Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8457414Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:04.8457987Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:11:04.8458946Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:04.8460404Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:04.8461320Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:04.8461794Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.8462171Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.8463083Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.8464314Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.8465215Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.8466009Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.8466621Z U typeinfo for c10::Error 2025-05-07T20:11:04.8466999Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:04.8467451Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.8468001Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.8468512Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.8468970Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.8469392Z U vtable for c10::Error 2025-05-07T20:11:04.8469984Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.8470875Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.8471581Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.8472261Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.8472754Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.8473079Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.8473419Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.8473722Z w __gmon_start__ 2025-05-07T20:11:04.8474019Z w __pthread_key_create 2025-05-07T20:11:04.8474381Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.8474825Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:04.8475127Z 2025-05-07T20:11:04.8475261Z linux-vdso.so.1 (0x00007ffe40dd2000) 2025-05-07T20:11:04.8475548Z libc10.so => not found 2025-05-07T20:11:04.8475815Z libc10_cuda.so => not found 2025-05-07T20:11:04.8476334Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f8117600000) 2025-05-07T20:11:04.8477229Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f8118010000) 2025-05-07T20:11:04.8477877Z libtorch.so => not found 2025-05-07T20:11:04.8478135Z libtorch_cpu.so => not found 2025-05-07T20:11:04.8478436Z libtorch_cuda.so => not found 2025-05-07T20:11:04.8478712Z libcudart.so.12 => not found 2025-05-07T20:11:04.8479068Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f811739c000) 2025-05-07T20:11:04.8479479Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8117fe0000) 2025-05-07T20:11:04.8479883Z libc.so.6 => /lib64/libc.so.6 (0x00007f8117194000) 2025-05-07T20:11:04.8480247Z /lib64/ld-linux-x86-64.so.2 (0x00007f8118020000) 2025-05-07T20:11:04.8480594Z libc10.so => not found 2025-05-07T20:11:04.8481115Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f8117f63000) 2025-05-07T20:11:04.8481659Z libtorch.so => not found 2025-05-07T20:11:04.8482120Z libtorch_cpu.so => not found 2025-05-07T20:11:04.8482586Z libtorch_cuda.so => not found 2025-05-07T20:11:04.8482973Z libm.so.6 => /lib64/libm.so.6 (0x00007f81170b9000) 2025-05-07T20:11:04.8483317Z libc10.so => not found 2025-05-07T20:11:04.8483611Z libtorch_cpu.so => not found 2025-05-07T20:11:04.8483906Z libtorch_cuda.so => not found 2025-05-07T20:11:04.8484224Z libtorch.so => not found 2025-05-07T20:11:04.8484530Z libtorch_cpu.so => not found 2025-05-07T20:11:04.8484818Z libtorch_cuda.so => not found 2025-05-07T20:11:04.8485126Z libtorch.so => not found 2025-05-07T20:11:04.8485296Z 2025-05-07T20:11:04.8485420Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.8485898Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:04.8486254Z 2025-05-07T20:11:04.8486291Z 2025-05-07T20:11:04.8486489Z Dynamic section at offset 0x2c4138 contains 40 entries: 2025-05-07T20:11:04.8486896Z Tag Type Name/Value 2025-05-07T20:11:04.8487366Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.8487899Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.8488570Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:04.8489117Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:04.8489734Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.8490304Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.8490855Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.8491475Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.8492026Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.8492601Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.8493130Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.8493695Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:04.8494339Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:04.8495009Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:04.8495434Z 0x000000000000000c (INIT) 0x13000 2025-05-07T20:11:04.8495765Z 0x000000000000000d (FINI) 0x7422c 2025-05-07T20:11:04.8496130Z 0x0000000000000019 (INIT_ARRAY) 0x2c4cf8 2025-05-07T20:11:04.8496471Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:11:04.8496838Z 0x000000000000001a (FINI_ARRAY) 0x2c4d40 2025-05-07T20:11:04.8497198Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.8497527Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:04.8497879Z 0x000000006ffffef5 (GNU_HASH) 0x18b0 2025-05-07T20:11:04.8498206Z 0x0000000000000005 (STRTAB) 0x5790 2025-05-07T20:11:04.8498546Z 0x0000000000000006 (SYMTAB) 0x2820 2025-05-07T20:11:04.8498886Z 0x000000000000000a (STRSZ) 40152 (bytes) 2025-05-07T20:11:04.8499263Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.8499685Z 0x0000000000000003 (PLTGOT) 0x2c53f8 2025-05-07T20:11:04.8500257Z 0x0000000000000002 (PLTRELSZ) 6768 (bytes) 2025-05-07T20:11:04.8500646Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.8500985Z 0x0000000000000017 (JMPREL) 0x10f38 2025-05-07T20:11:04.8501355Z 0x0000000000000007 (RELA) 0xf990 2025-05-07T20:11:04.8501719Z 0x0000000000000008 (RELASZ) 5544 (bytes) 2025-05-07T20:11:04.8502129Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.8502458Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.8502827Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.8503197Z 0x000000006ffffffe (VERNEED) 0xf860 2025-05-07T20:11:04.8503572Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.8503951Z 0x000000006ffffff0 (VERSYM) 0xf468 2025-05-07T20:11:04.8504303Z 0x000000006ffffff9 (RELACOUNT) 17 2025-05-07T20:11:04.8504660Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.8504878Z 2025-05-07T20:11:04.8505004Z ################################################################################ 2025-05-07T20:11:04.8505268Z 2025-05-07T20:11:04.8505272Z 2025-05-07T20:11:04.8505392Z ################################################################################ 2025-05-07T20:11:04.8505931Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:04.8506433Z [CHECK] Listing out library size: 2025-05-07T20:11:04.8506929Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:04.8507307Z 2025-05-07T20:11:04.8507511Z 21 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:04.8507847Z 2025-05-07T20:11:04.8508241Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:04.8509308Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.8509939Z 2025-05-07T20:11:04.8510057Z GLIBC_2.2.5 2025-05-07T20:11:04.8510309Z GLIBC_2.14 2025-05-07T20:11:04.8510437Z 2025-05-07T20:11:04.8510441Z 2025-05-07T20:11:04.8510844Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:04.8511920Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.8512539Z 2025-05-07T20:11:04.8583507Z GLIBCXX_3.4 2025-05-07T20:11:04.8584143Z GLIBCXX_3.4.9 2025-05-07T20:11:04.8584775Z GLIBCXX_3.4.11 2025-05-07T20:11:04.8585364Z GLIBCXX_3.4.20 2025-05-07T20:11:04.8586286Z GLIBCXX_3.4.21 2025-05-07T20:11:04.8586866Z GLIBCXX_3.4.29 2025-05-07T20:11:04.8587277Z 2025-05-07T20:11:04.8587290Z 2025-05-07T20:11:04.8606234Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.ckTPl5V2c7.symbols.txt 2025-05-07T20:11:04.8607701Z 2025-05-07T20:11:04.8651002Z 2025-05-07T20:11:04.8676836Z [CHECK] Total Number of symbols: 811 2025-05-07T20:11:04.8688910Z [CHECK] Number of fbgemm symbols: 80 2025-05-07T20:11:04.8704431Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.762mlMKTSP.usymbols.txt 2025-05-07T20:11:04.8704934Z 2025-05-07T20:11:04.8725745Z 2025-05-07T20:11:04.8751571Z [CHECK] Listing out undefined symbols (152 total): 2025-05-07T20:11:04.8765753Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.8766343Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.8766780Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.8767231Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.8767673Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.8768071Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.8768487Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.8768888Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.8769273Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.8769668Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.8769999Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.8770358Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.8770693Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:04.8771055Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.8771429Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.8771770Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.8772134Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.8772506Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:04.8772969Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:04.8773758Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8774993Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8776423Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8777435Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:04.8778557Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8779787Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:04.8780588Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:04.8781613Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8783080Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.8784121Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:04.8784570Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:04.8784975Z U c10::BoolType::get() 2025-05-07T20:11:04.8785393Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.8785810Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:04.8786264Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8786722Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.8787115Z U c10::IntType::get() 2025-05-07T20:11:04.8787581Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:04.8788103Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.8788566Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.8789268Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:04.8789980Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:04.8790394Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:04.8790794Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.8791209Z U c10::TensorType::get() 2025-05-07T20:11:04.8791559Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.8792572Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.8793599Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.8794104Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.8794490Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.8794951Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.8795314Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.8795679Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.8796136Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.8796624Z U c10::cuda::current_device() 2025-05-07T20:11:04.8796933Z U c10::cuda::device_count() 2025-05-07T20:11:04.8797293Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.8797670Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.8798078Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.8798495Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.8798940Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.8799388Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.8800317Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.8801247Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.8802745Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.8803911Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.8804593Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.8804976Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.8805366Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:04.8805854Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:04.8806283Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.8806709Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.8807150Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.8807535Z U c10::throwNullDataPtrError() 2025-05-07T20:11:04.8807915Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.8808262Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:04.8808721Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.8809172Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:04.8809569Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:04.8809964Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.8810356Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.8810760Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.8811128Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.8811517Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.8811866Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.8812240Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.8812603Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:04.8812970Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:04.8813329Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:04.8813679Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.8814042Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.8814389Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.8814750Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:04.8815091Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:04.8815612Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:04.8816154Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:04.8816500Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:04.8816845Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.8817196Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.8817560Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.8817945Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8818351Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.8818742Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8819117Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:04.8819622Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:04.8820056Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.8820542Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8820897Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.8821201Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.8821542Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.8821891Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.8822295Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.8822895Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.8823997Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.8824871Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.8825449Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.8825833Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.8826230Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.8826682Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:04.8827121Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:04.8827615Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.8828342Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.8829418Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8830252Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.8830611Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.8830957Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.8831312Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.8831646Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.8832102Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.8832491Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8832991Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.8833435Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:04.8833719Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.8834028Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.8834830Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.8835944Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.8836758Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.8837478Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.8838324Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.8838857Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.8839308Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.8839935Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.8840592Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.8841433Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.8842117Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.8842675Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.8843185Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.8843524Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.8843845Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.8844162Z w __gmon_start__ 2025-05-07T20:11:04.8844433Z w __pthread_key_create 2025-05-07T20:11:04.8844757Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:04.8845084Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:04.8845472Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.8845936Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:04.8846267Z 2025-05-07T20:11:04.8846407Z linux-vdso.so.1 (0x00007ffce8f39000) 2025-05-07T20:11:04.8846715Z libtorch.so => not found 2025-05-07T20:11:04.8846963Z libc10.so => not found 2025-05-07T20:11:04.8847223Z libc10_cuda.so => not found 2025-05-07T20:11:04.8847507Z libtorch_cpu.so => not found 2025-05-07T20:11:04.8847796Z libtorch_cuda.so => not found 2025-05-07T20:11:04.8848087Z libcudart.so.12 => not found 2025-05-07T20:11:04.8848426Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f3a31b9c000) 2025-05-07T20:11:04.8848843Z libm.so.6 => /lib64/libm.so.6 (0x00007f3a31ac1000) 2025-05-07T20:11:04.8849228Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f3a33564000) 2025-05-07T20:11:04.8849618Z libc.so.6 => /lib64/libc.so.6 (0x00007f3a318b9000) 2025-05-07T20:11:04.8849978Z /lib64/ld-linux-x86-64.so.2 (0x00007f3a33598000) 2025-05-07T20:11:04.8850237Z 2025-05-07T20:11:04.8850349Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.8850799Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:04.8851141Z 2025-05-07T20:11:04.8851145Z 2025-05-07T20:11:04.8851308Z Dynamic section at offset 0x14c3b48 contains 37 entries: 2025-05-07T20:11:04.8851709Z Tag Type Name/Value 2025-05-07T20:11:04.8852128Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.8852655Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.8853175Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.8853693Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.8854405Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.8855110Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.8855651Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.8856164Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:04.8856677Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.8857186Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.8857713Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:04.8858189Z 0x000000000000000c (INIT) 0x2a000 2025-05-07T20:11:04.8858552Z 0x000000000000000d (FINI) 0xe445c 2025-05-07T20:11:04.8858913Z 0x0000000000000019 (INIT_ARRAY) 0x14c31b0 2025-05-07T20:11:04.8859308Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:11:04.8859769Z 0x000000000000001a (FINI_ARRAY) 0x14c3280 2025-05-07T20:11:04.8860135Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.8860537Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:04.8860875Z 0x000000006ffffef5 (GNU_HASH) 0x1eb8 2025-05-07T20:11:04.8861227Z 0x0000000000000005 (STRTAB) 0x8730 2025-05-07T20:11:04.8861555Z 0x0000000000000006 (SYMTAB) 0x3b10 2025-05-07T20:11:04.8861906Z 0x000000000000000a (STRSZ) 113474 (bytes) 2025-05-07T20:11:04.8862278Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.8862633Z 0x0000000000000003 (PLTGOT) 0x14c3de8 2025-05-07T20:11:04.8863006Z 0x0000000000000002 (PLTRELSZ) 8736 (bytes) 2025-05-07T20:11:04.8863359Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.8863672Z 0x0000000000000017 (JMPREL) 0x27c90 2025-05-07T20:11:04.8864003Z 0x0000000000000007 (RELA) 0x249f0 2025-05-07T20:11:04.8864345Z 0x0000000000000008 (RELASZ) 12960 (bytes) 2025-05-07T20:11:04.8864704Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.8865034Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.8865362Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.8865714Z 0x000000006ffffffe (VERNEED) 0x248d0 2025-05-07T20:11:04.8866033Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.8866363Z 0x000000006ffffff0 (VERSYM) 0x24272 2025-05-07T20:11:04.8866687Z 0x000000006ffffff9 (RELACOUNT) 39 2025-05-07T20:11:04.8867011Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.8867213Z 2025-05-07T20:11:04.8867323Z ################################################################################ 2025-05-07T20:11:04.8867567Z 2025-05-07T20:11:04.8867571Z 2025-05-07T20:11:04.8867684Z ################################################################################ 2025-05-07T20:11:04.8868195Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:04.8868684Z [CHECK] Listing out library size: 2025-05-07T20:11:04.8869150Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:04.8869540Z 2025-05-07T20:11:04.8869771Z 9 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:04.8870082Z 2025-05-07T20:11:04.8870481Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:04.8871517Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.8872248Z 2025-05-07T20:11:04.8912775Z GLIBC_2.2.5 2025-05-07T20:11:04.8913177Z GLIBC_2.3 2025-05-07T20:11:04.8913399Z GLIBC_2.14 2025-05-07T20:11:04.8913512Z 2025-05-07T20:11:04.8913523Z 2025-05-07T20:11:04.8914426Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:04.8915524Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.8916187Z 2025-05-07T20:11:04.8984571Z GLIBCXX_3.4 2025-05-07T20:11:04.8984827Z GLIBCXX_3.4.9 2025-05-07T20:11:04.8985033Z GLIBCXX_3.4.11 2025-05-07T20:11:04.8985255Z GLIBCXX_3.4.18 2025-05-07T20:11:04.8985451Z GLIBCXX_3.4.21 2025-05-07T20:11:04.8985667Z GLIBCXX_3.4.29 2025-05-07T20:11:04.8986758Z 2025-05-07T20:11:04.8986763Z 2025-05-07T20:11:04.9009061Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.RabNZCedR8.symbols.txt 2025-05-07T20:11:04.9010556Z 2025-05-07T20:11:04.9037889Z 2025-05-07T20:11:04.9062063Z [CHECK] Total Number of symbols: 342 2025-05-07T20:11:04.9077058Z [CHECK] Number of fbgemm symbols: 14 2025-05-07T20:11:04.9090587Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.cYvA09nx3r.usymbols.txt 2025-05-07T20:11:04.9092082Z 2025-05-07T20:11:04.9106037Z 2025-05-07T20:11:04.9133026Z [CHECK] Listing out undefined symbols (129 total): 2025-05-07T20:11:04.9145245Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9146136Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9146716Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.9147092Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.9147550Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.9147964Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.9148349Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.9148758Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.9149101Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.9149476Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.9149825Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.9150150Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.9150482Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.9150785Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.9151113Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:04.9151435Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.9151779Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:04.9152109Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:04.9152530Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:04.9153029Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:04.9153500Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:04.9153888Z U c10::BoolType::get() 2025-05-07T20:11:04.9154052Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.9154156Z U c10::FloatType::get() 2025-05-07T20:11:04.9154292Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:04.9154466Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.9154608Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.9154730Z U c10::IntType::get() 2025-05-07T20:11:04.9154895Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:04.9155021Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:04.9155188Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.9155330Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:04.9155754Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:04.9155915Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:04.9156015Z U c10::TensorType::get() 2025-05-07T20:11:04.9156133Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.9156894Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.9157085Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.9157241Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.9157384Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.9157497Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.9157612Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.9157740Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.9158018Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.9158237Z U c10::cuda::device_count() 2025-05-07T20:11:04.9158363Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.9158504Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.9158657Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.9158783Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.9158947Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.9159054Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.9159542Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.9159794Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.9160272Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.9160606Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.9160715Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.9160824Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.9160950Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.9161079Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.9161207Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.9161306Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.9161503Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.9161621Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.9161751Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.9161863Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.9161982Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.9162092Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.9162214Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.9162337Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.9162468Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:04.9162580Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.9162693Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.9162808Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.9162926Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.9163077Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.9163199Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.9163318Z U float at::Tensor::item() const 2025-05-07T20:11:04.9163501Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.9163647Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.9163819Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.9163964Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.9164066Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.9164162Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.9164314Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.9164464Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.9164818Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.9165203Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.9165563Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.9165949Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:04.9166105Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.9166227Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:04.9166375Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.9166518Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.9166687Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:04.9166930Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.9167276Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.9167873Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9168380Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.9168535Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.9168665Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.9168791Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.9168949Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.9169070Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.9169193Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.9169409Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9169653Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9169789Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:04.9169910Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:04.9170046Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.9170179Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.9170761Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.9171245Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.9171504Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.9172913Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.9173078Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.9173271Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.9173463Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.9173811Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9174177Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9174539Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9174739Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.9174989Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.9175135Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.9175255Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.9175364Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.9175490Z w __gmon_start__ 2025-05-07T20:11:04.9175591Z w __pthread_key_create 2025-05-07T20:11:04.9175705Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:04.9175822Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:04.9175996Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.9176225Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:04.9176232Z 2025-05-07T20:11:04.9185283Z linux-vdso.so.1 (0x00007ffec6dc3000) 2025-05-07T20:11:04.9185610Z libtorch.so => not found 2025-05-07T20:11:04.9185896Z libc10.so => not found 2025-05-07T20:11:04.9186161Z libc10_cuda.so => not found 2025-05-07T20:11:04.9186483Z libtorch_cpu.so => not found 2025-05-07T20:11:04.9186756Z libtorch_cuda.so => not found 2025-05-07T20:11:04.9187033Z libcudart.so.12 => not found 2025-05-07T20:11:04.9187540Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fe80739c000) 2025-05-07T20:11:04.9187983Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe80802a000) 2025-05-07T20:11:04.9188351Z libc.so.6 => /lib64/libc.so.6 (0x00007fe807194000) 2025-05-07T20:11:04.9188718Z /lib64/ld-linux-x86-64.so.2 (0x00007fe80805e000) 2025-05-07T20:11:04.9189106Z libm.so.6 => /lib64/libm.so.6 (0x00007fe807f4f000) 2025-05-07T20:11:04.9189134Z 2025-05-07T20:11:04.9189450Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.9190182Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:04.9190198Z 2025-05-07T20:11:04.9221163Z 2025-05-07T20:11:04.9221821Z Dynamic section at offset 0x8a8558 contains 37 entries: 2025-05-07T20:11:04.9222202Z Tag Type Name/Value 2025-05-07T20:11:04.9222958Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.9223194Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.9223407Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.9223621Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.9223860Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.9224079Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.9224291Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.9224525Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.9224727Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.9224956Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:04.9225393Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:11:04.9225574Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:11:04.9225703Z 0x000000000000000d (FINI) 0x3464c 2025-05-07T20:11:04.9225832Z 0x0000000000000019 (INIT_ARRAY) 0x8a82d8 2025-05-07T20:11:04.9225998Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:11:04.9226127Z 0x000000000000001a (FINI_ARRAY) 0x8a8308 2025-05-07T20:11:04.9226300Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.9226444Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:04.9226571Z 0x000000006ffffef5 (GNU_HASH) 0x1168 2025-05-07T20:11:04.9226689Z 0x0000000000000005 (STRTAB) 0x3c60 2025-05-07T20:11:04.9226829Z 0x0000000000000006 (SYMTAB) 0x1c38 2025-05-07T20:11:04.9227007Z 0x000000000000000a (STRSZ) 36562 (bytes) 2025-05-07T20:11:04.9227160Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.9227314Z 0x0000000000000003 (PLTGOT) 0x8a87f8 2025-05-07T20:11:04.9227459Z 0x0000000000000002 (PLTRELSZ) 3600 (bytes) 2025-05-07T20:11:04.9227694Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.9227809Z 0x0000000000000017 (JMPREL) 0xeb48 2025-05-07T20:11:04.9227948Z 0x0000000000000007 (RELA) 0xcf10 2025-05-07T20:11:04.9228082Z 0x0000000000000008 (RELASZ) 7224 (bytes) 2025-05-07T20:11:04.9228211Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.9228350Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.9228484Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.9228609Z 0x000000006ffffffe (VERNEED) 0xcde0 2025-05-07T20:11:04.9228724Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.9228872Z 0x000000006ffffff0 (VERSYM) 0xcb32 2025-05-07T20:11:04.9228991Z 0x000000006ffffff9 (RELACOUNT) 90 2025-05-07T20:11:04.9229102Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.9229109Z 2025-05-07T20:11:04.9229256Z ################################################################################ 2025-05-07T20:11:04.9229260Z 2025-05-07T20:11:04.9229264Z 2025-05-07T20:11:04.9229382Z ################################################################################ 2025-05-07T20:11:04.9229654Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:04.9229795Z [CHECK] Listing out library size: 2025-05-07T20:11:04.9230065Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:04.9230070Z 2025-05-07T20:11:04.9232236Z 17 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:04.9234137Z 2025-05-07T20:11:04.9235116Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:04.9235632Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.9235640Z 2025-05-07T20:11:04.9297184Z GLIBC_2.2.5 2025-05-07T20:11:04.9298114Z GLIBC_2.14 2025-05-07T20:11:04.9298145Z 2025-05-07T20:11:04.9298159Z 2025-05-07T20:11:04.9299651Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:04.9301246Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.9301262Z 2025-05-07T20:11:04.9359327Z GLIBCXX_3.4 2025-05-07T20:11:04.9360196Z GLIBCXX_3.4.9 2025-05-07T20:11:04.9360462Z GLIBCXX_3.4.20 2025-05-07T20:11:04.9360703Z GLIBCXX_3.4.21 2025-05-07T20:11:04.9360987Z GLIBCXX_3.4.29 2025-05-07T20:11:04.9361028Z 2025-05-07T20:11:04.9361041Z 2025-05-07T20:11:04.9380336Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.gMDV4SSTmf.symbols.txt 2025-05-07T20:11:04.9380483Z 2025-05-07T20:11:04.9406823Z 2025-05-07T20:11:04.9429837Z [CHECK] Total Number of symbols: 469 2025-05-07T20:11:04.9445051Z [CHECK] Number of fbgemm symbols: 12 2025-05-07T20:11:04.9461666Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.ut7oLrRQrO.usymbols.txt 2025-05-07T20:11:04.9461709Z 2025-05-07T20:11:04.9477998Z 2025-05-07T20:11:04.9503279Z [CHECK] Listing out undefined symbols (155 total): 2025-05-07T20:11:04.9521096Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9521319Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.9521486Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.9521668Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.9521980Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.9522134Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.9522308Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.9522444Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.9522592Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.9522721Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:04.9522861Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.9522982Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.9523096Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.9523235Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:04.9523354Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.9523469Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.9523578Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.9523709Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:04.9523835Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.9524100Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:04.9524296Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:04.9525041Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9525805Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9525972Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:04.9526152Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:04.9526349Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:04.9526566Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:04.9526687Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:04.9527180Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9527743Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9527848Z U c10::BoolType::get() 2025-05-07T20:11:04.9528035Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.9528251Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.9528359Z U c10::IntType::get() 2025-05-07T20:11:04.9528599Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:04.9528724Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:04.9528944Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:04.9529161Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.9529308Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.9529705Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:04.9529874Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:04.9530022Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:04.9530139Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:04.9530275Z U c10::SymIntType::get() 2025-05-07T20:11:04.9530431Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:04.9530589Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.9530698Z U c10::TensorType::get() 2025-05-07T20:11:04.9530848Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.9531549Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.9531710Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.9531833Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.9531961Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.9532081Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.9532222Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.9532344Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.9532590Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.9532728Z U c10::cuda::current_device() 2025-05-07T20:11:04.9532835Z U c10::cuda::device_count() 2025-05-07T20:11:04.9532977Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.9533140Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.9533286Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.9533429Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.9533610Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.9533729Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.9534229Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.9534492Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.9534987Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.9535318Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.9535900Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.9536017Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.9536176Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.9536340Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:04.9536498Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:04.9536618Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.9536769Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.9536925Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.9537040Z U c10::throwNullDataPtrError() 2025-05-07T20:11:04.9537155Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.9537262Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:04.9537451Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.9537612Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:04.9537737Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:04.9537861Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.9538018Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.9538127Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.9538249Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.9538358Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.9538492Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.9538607Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.9538735Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:04.9538862Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:04.9538976Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:04.9539114Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:04.9539249Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.9539468Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.9539584Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.9539703Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:04.9540011Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:04.9540301Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:04.9540433Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:04.9540565Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:04.9540764Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.9540896Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.9541048Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.9541176Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.9541336Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.9541439Z U log2@GLIBC_2.2.5 2025-05-07T20:11:04.9541633Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:04.9541769Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.9541931Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.9542053Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.9542156Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.9542252Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.9542422Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.9542545Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.9542902Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.9543363Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.9543782Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.9543899Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.9544071Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.9544234Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.9544409Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:04.9544677Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.9545042Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.9545672Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9545815Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:04.9546053Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.9546177Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.9546299Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.9546409Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.9546520Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.9546641Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.9546816Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9547056Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9547193Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:04.9547303Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:04.9547397Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.9547535Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.9548100Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.9548543Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.9548812Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.9549172Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.9549287Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:04.9549455Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.9549613Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.9549765Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.9550114Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9550440Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9550635Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.9550867Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.9551000Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.9551114Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.9551253Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.9551344Z w __gmon_start__ 2025-05-07T20:11:04.9551446Z w __pthread_key_create 2025-05-07T20:11:04.9551589Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.9551792Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:04.9551823Z 2025-05-07T20:11:04.9560906Z linux-vdso.so.1 (0x00007ffe1ddfb000) 2025-05-07T20:11:04.9561029Z libtorch.so => not found 2025-05-07T20:11:04.9561143Z libc10.so => not found 2025-05-07T20:11:04.9561249Z libc10_cuda.so => not found 2025-05-07T20:11:04.9561361Z libtorch_cpu.so => not found 2025-05-07T20:11:04.9561473Z libtorch_cuda.so => not found 2025-05-07T20:11:04.9561906Z libcudart.so.12 => not found 2025-05-07T20:11:04.9562108Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff32359c000) 2025-05-07T20:11:04.9562274Z libm.so.6 => /lib64/libm.so.6 (0x00007ff3234c1000) 2025-05-07T20:11:04.9562442Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff3249a5000) 2025-05-07T20:11:04.9562570Z libc.so.6 => /lib64/libc.so.6 (0x00007ff3232b9000) 2025-05-07T20:11:04.9562705Z /lib64/ld-linux-x86-64.so.2 (0x00007ff3249d9000) 2025-05-07T20:11:04.9562726Z 2025-05-07T20:11:04.9562846Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.9563091Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:04.9563097Z 2025-05-07T20:11:04.9592649Z 2025-05-07T20:11:04.9593181Z Dynamic section at offset 0x106d2d0 contains 37 entries: 2025-05-07T20:11:04.9593317Z Tag Type Name/Value 2025-05-07T20:11:04.9593533Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.9593750Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.9593990Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.9594199Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.9594406Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.9594637Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.9594850Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.9595047Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:04.9595267Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.9595463Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.9595687Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:04.9595819Z 0x000000000000000c (INIT) 0x12000 2025-05-07T20:11:04.9595935Z 0x000000000000000d (FINI) 0xa2d3c 2025-05-07T20:11:04.9596066Z 0x0000000000000019 (INIT_ARRAY) 0x106de30 2025-05-07T20:11:04.9596191Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:11:04.9596348Z 0x000000000000001a (FINI_ARRAY) 0x106de90 2025-05-07T20:11:04.9596473Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.9596578Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:04.9596708Z 0x000000006ffffef5 (GNU_HASH) 0x1640 2025-05-07T20:11:04.9596822Z 0x0000000000000005 (STRTAB) 0x51f0 2025-05-07T20:11:04.9596931Z 0x0000000000000006 (SYMTAB) 0x25e0 2025-05-07T20:11:04.9597078Z 0x000000000000000a (STRSZ) 38759 (bytes) 2025-05-07T20:11:04.9597207Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.9597329Z 0x0000000000000003 (PLTGOT) 0x106e570 2025-05-07T20:11:04.9597461Z 0x0000000000000002 (PLTRELSZ) 5376 (bytes) 2025-05-07T20:11:04.9597700Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.9597952Z 0x0000000000000017 (JMPREL) 0x10600 2025-05-07T20:11:04.9598100Z 0x0000000000000007 (RELA) 0xee18 2025-05-07T20:11:04.9598242Z 0x0000000000000008 (RELASZ) 6120 (bytes) 2025-05-07T20:11:04.9598361Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.9598456Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.9598595Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.9598749Z 0x000000006ffffffe (VERNEED) 0xed08 2025-05-07T20:11:04.9598854Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.9598954Z 0x000000006ffffff0 (VERSYM) 0xe958 2025-05-07T20:11:04.9599078Z 0x000000006ffffff9 (RELACOUNT) 26 2025-05-07T20:11:04.9599172Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.9599177Z 2025-05-07T20:11:04.9600890Z ################################################################################ 2025-05-07T20:11:04.9600895Z 2025-05-07T20:11:04.9600902Z 2025-05-07T20:11:04.9601026Z ################################################################################ 2025-05-07T20:11:04.9601324Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:04.9601429Z [CHECK] Listing out library size: 2025-05-07T20:11:04.9601725Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:04.9601730Z 2025-05-07T20:11:04.9609818Z 2 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:04.9610086Z 2025-05-07T20:11:04.9610587Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:04.9611142Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.9611161Z 2025-05-07T20:11:04.9661718Z GLIBC_2.2.5 2025-05-07T20:11:04.9661852Z GLIBC_2.3 2025-05-07T20:11:04.9661941Z GLIBC_2.14 2025-05-07T20:11:04.9662240Z 2025-05-07T20:11:04.9662445Z 2025-05-07T20:11:04.9663369Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:04.9663972Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.9663991Z 2025-05-07T20:11:04.9720537Z GLIBCXX_3.4 2025-05-07T20:11:04.9720830Z GLIBCXX_3.4.9 2025-05-07T20:11:04.9721062Z GLIBCXX_3.4.21 2025-05-07T20:11:04.9721337Z GLIBCXX_3.4.29 2025-05-07T20:11:04.9721343Z 2025-05-07T20:11:04.9721348Z 2025-05-07T20:11:04.9742384Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.R2MPKd6302.symbols.txt 2025-05-07T20:11:04.9742434Z 2025-05-07T20:11:04.9764394Z 2025-05-07T20:11:04.9789561Z [CHECK] Total Number of symbols: 326 2025-05-07T20:11:04.9804537Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:04.9820329Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.YQ1JWNenRW.usymbols.txt 2025-05-07T20:11:04.9820370Z 2025-05-07T20:11:04.9837486Z 2025-05-07T20:11:04.9862401Z [CHECK] Listing out undefined symbols (143 total): 2025-05-07T20:11:04.9883445Z U GOMP_parallel 2025-05-07T20:11:04.9883887Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9883998Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.9884179Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.9884330Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.9884466Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.9884635Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.9885014Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.9885201Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.9885345Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.9885469Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.9885578Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.9885686Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.9885861Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:04.9885970Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.9886074Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.9886218Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.9886315Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:04.9886555Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:04.9887196Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9887872Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9888058Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:04.9888193Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:04.9888689Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9888795Z U at::get_num_threads() 2025-05-07T20:11:04.9888918Z U at::get_thread_num() 2025-05-07T20:11:04.9889030Z U at::in_parallel_region() 2025-05-07T20:11:04.9889137Z U at::init_num_threads() 2025-05-07T20:11:04.9889281Z U at::internal::set_thread_num(int) 2025-05-07T20:11:04.9889886Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.9890302Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:04.9890502Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.9890779Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.9890929Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.9891086Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.9891241Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:04.9891395Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:04.9891545Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.9891681Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.9891850Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:04.9892002Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.9892106Z U c10::TensorType::get() 2025-05-07T20:11:04.9892224Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.9892940Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.9893097Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.9893230Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.9893384Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.9893498Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.9893610Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.9893742Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.9894013Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.9894115Z U c10::cuda::device_count() 2025-05-07T20:11:04.9894260Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.9894398Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.9894540Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.9894712Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.9894866Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.9894978Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.9895491Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.9895733Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.9896214Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.9896561Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.9896679Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.9896782Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.9896948Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:04.9897100Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:04.9897214Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.9897374Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.9897499Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.9897609Z U c10::throwNullDataPtrError() 2025-05-07T20:11:04.9897735Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.9897840Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:04.9898026Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.9898155Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:04.9898286Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:04.9898405Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.9898552Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.9898662Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.9898786Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.9898888Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.9899018Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.9899136Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.9899255Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:04.9899486Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:04.9899603Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:04.9899718Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.9899997Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.9900161Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.9900485Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:04.9900604Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:04.9900815Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:04.9900929Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.9901059Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.9901229Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.9901378Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.9901511Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.9901708Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:04.9901919Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.9902020Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.9902129Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.9902246Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.9902346Z U omp_get_num_threads 2025-05-07T20:11:04.9902447Z U omp_get_thread_num 2025-05-07T20:11:04.9902621Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.9902755Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.9903118Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.9903552Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.9903903Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.9904057Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:04.9904230Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:04.9904351Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.9904496Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.9904662Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.9904912Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.9905275Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.9905903Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9906038Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:04.9906164Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.9906314Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.9906432Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.9906558Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.9906697Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.9906822Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.9907017Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.9907267Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:04.9907370Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.9907502Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.9908164Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.9908662Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.9908933Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.9909353Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.9909512Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.9909869Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.9910050Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.9910435Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9910776Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.9911006Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.9911241Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.9911358Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.9911482Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.9911593Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.9911688Z w __gmon_start__ 2025-05-07T20:11:04.9911810Z w __pthread_key_create 2025-05-07T20:11:04.9911965Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.9912325Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:04.9912332Z 2025-05-07T20:11:04.9924569Z linux-vdso.so.1 (0x00007ffe17dfe000) 2025-05-07T20:11:04.9925002Z libc10.so => not found 2025-05-07T20:11:04.9925124Z libc10_cuda.so => not found 2025-05-07T20:11:04.9925235Z libtorch.so => not found 2025-05-07T20:11:04.9925409Z libtorch_cpu.so => not found 2025-05-07T20:11:04.9925843Z libtorch_cuda.so => not found 2025-05-07T20:11:04.9925969Z libcudart.so.12 => not found 2025-05-07T20:11:04.9926165Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa6bf380000) 2025-05-07T20:11:04.9926375Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa6bf352000) 2025-05-07T20:11:04.9926517Z libc.so.6 => /lib64/libc.so.6 (0x00007fa6bf14a000) 2025-05-07T20:11:04.9926656Z /lib64/ld-linux-x86-64.so.2 (0x00007fa6bf79a000) 2025-05-07T20:11:04.9926814Z libm.so.6 => /lib64/libm.so.6 (0x00007fa6bf06f000) 2025-05-07T20:11:04.9926830Z 2025-05-07T20:11:04.9927309Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.9927662Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:04.9927669Z 2025-05-07T20:11:04.9959686Z 2025-05-07T20:11:04.9960424Z Dynamic section at offset 0x179670 contains 38 entries: 2025-05-07T20:11:04.9960849Z Tag Type Name/Value 2025-05-07T20:11:04.9961460Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.9962056Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.9962656Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.9963284Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.9963903Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.9964499Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.9965114Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.9965901Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.9966624Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.9966861Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:04.9967125Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:04.9967318Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:04.9967468Z 0x000000000000000c (INIT) 0xc000 2025-05-07T20:11:04.9967601Z 0x000000000000000d (FINI) 0x237dc 2025-05-07T20:11:04.9967725Z 0x0000000000000019 (INIT_ARRAY) 0x1792c0 2025-05-07T20:11:04.9967852Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:11:04.9968001Z 0x000000000000001a (FINI_ARRAY) 0x1792e0 2025-05-07T20:11:04.9968156Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.9968267Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:04.9968405Z 0x000000006ffffef5 (GNU_HASH) 0x10f8 2025-05-07T20:11:04.9968521Z 0x0000000000000005 (STRTAB) 0x38a8 2025-05-07T20:11:04.9968628Z 0x0000000000000006 (SYMTAB) 0x1a00 2025-05-07T20:11:04.9968764Z 0x000000000000000a (STRSZ) 24404 (bytes) 2025-05-07T20:11:04.9968905Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.9969022Z 0x0000000000000003 (PLTGOT) 0x179910 2025-05-07T20:11:04.9969161Z 0x0000000000000002 (PLTRELSZ) 3864 (bytes) 2025-05-07T20:11:04.9969286Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.9969395Z 0x0000000000000017 (JMPREL) 0xaba8 2025-05-07T20:11:04.9969500Z 0x0000000000000007 (RELA) 0x9ba0 2025-05-07T20:11:04.9969625Z 0x0000000000000008 (RELASZ) 4104 (bytes) 2025-05-07T20:11:04.9969764Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.9969879Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.9970009Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.9970158Z 0x000000006ffffffe (VERNEED) 0x9a90 2025-05-07T20:11:04.9970273Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.9970381Z 0x000000006ffffff0 (VERSYM) 0x97fc 2025-05-07T20:11:04.9970503Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:11:04.9970607Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.9970612Z 2025-05-07T20:11:04.9970733Z ################################################################################ 2025-05-07T20:11:04.9970738Z 2025-05-07T20:11:04.9970742Z 2025-05-07T20:11:04.9970867Z ################################################################################ 2025-05-07T20:11:04.9971207Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:04.9971321Z [CHECK] Listing out library size: 2025-05-07T20:11:04.9971664Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:04.9971672Z 2025-05-07T20:11:04.9974184Z 8 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:04.9974189Z 2025-05-07T20:11:04.9974981Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:04.9975561Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.9975566Z 2025-05-07T20:11:05.0409521Z GLIBC_2.2.5 2025-05-07T20:11:05.0409759Z GLIBC_2.3 2025-05-07T20:11:05.0410598Z GLIBC_2.14 2025-05-07T20:11:05.0410632Z 2025-05-07T20:11:05.0410647Z 2025-05-07T20:11:05.0412125Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:05.0414211Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.0414339Z 2025-05-07T20:11:05.0834829Z GLIBCXX_3.4 2025-05-07T20:11:05.0835036Z GLIBCXX_3.4.9 2025-05-07T20:11:05.0835653Z GLIBCXX_3.4.11 2025-05-07T20:11:05.0835809Z GLIBCXX_3.4.15 2025-05-07T20:11:05.0835905Z GLIBCXX_3.4.18 2025-05-07T20:11:05.0836023Z GLIBCXX_3.4.20 2025-05-07T20:11:05.0836126Z GLIBCXX_3.4.21 2025-05-07T20:11:05.0836431Z GLIBCXX_3.4.29 2025-05-07T20:11:05.0836439Z 2025-05-07T20:11:05.0836443Z 2025-05-07T20:11:05.0857348Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.tlPhG4bEGq.symbols.txt 2025-05-07T20:11:05.0857379Z 2025-05-07T20:11:05.1240230Z 2025-05-07T20:11:05.1266769Z [CHECK] Total Number of symbols: 4265 2025-05-07T20:11:05.1297391Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:05.1313057Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.G73Pncw9K9.usymbols.txt 2025-05-07T20:11:05.1314716Z 2025-05-07T20:11:05.1341372Z 2025-05-07T20:11:05.1367074Z [CHECK] Listing out undefined symbols (190 total): 2025-05-07T20:11:05.1384563Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.1385199Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:05.1385556Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:05.1385944Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:05.1386268Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:05.1386658Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:05.1387020Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:05.1387388Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:05.1387720Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:05.1388085Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:05.1388415Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:05.1388763Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:05.1389091Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:05.1389456Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:05.1389856Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:05.1390314Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:05.1390704Z U at::RecordFunction::end() 2025-05-07T20:11:05.1391030Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:05.1391427Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:05.1392008Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:05.1392756Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:05.1393528Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:05.1394198Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:05.1395238Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.1396327Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:05.1396813Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.1397274Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:05.1397704Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:05.1398514Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:05.1398900Z U c10::AnyType::get() 2025-05-07T20:11:05.1399176Z U c10::BoolType::get() 2025-05-07T20:11:05.1399529Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:05.1399957Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:05.1400370Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:05.1401296Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:05.1402513Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:05.1403666Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.1404260Z U c10::Error::what() const 2025-05-07T20:11:05.1404568Z U c10::FloatType::get() 2025-05-07T20:11:05.1404893Z U c10::GradMode::is_enabled() 2025-05-07T20:11:05.1405208Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:05.1405594Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:05.1405998Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:05.1406320Z U c10::IValue::isBoolList() const 2025-05-07T20:11:05.1406671Z U c10::IValue::isDoubleList() const 2025-05-07T20:11:05.1406983Z U c10::IValue::isIntList() const 2025-05-07T20:11:05.1407330Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:05.1407654Z U c10::IValue::isTensorList() const 2025-05-07T20:11:05.1408036Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:05.1408410Z U c10::IntType::get() 2025-05-07T20:11:05.1409065Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.1409824Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:05.1410224Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:05.1410594Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.1410969Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.1411407Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.1412034Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:05.1412517Z U c10::StringType::get() 2025-05-07T20:11:05.1412884Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:05.1413295Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:05.1413690Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:05.1414114Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:05.1414770Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:05.1415428Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:05.1415824Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:05.1416197Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:05.1416579Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:05.1416928Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:05.1428980Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:05.1429360Z U c10::SymIntType::get() 2025-05-07T20:11:05.1429757Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:05.1430111Z U c10::TensorType::get() 2025-05-07T20:11:05.1430464Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:05.1431214Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.1432326Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.1433252Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:05.1434218Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.1435207Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:05.1436298Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.1437373Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:05.1438011Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:05.1438475Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.1438870Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:05.1439560Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.1440219Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:05.1440629Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:05.1441092Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:05.1441510Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:05.1441998Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:05.1442465Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:05.1442967Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:05.1443671Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.1444285Z U free@GLIBC_2.2.5 2025-05-07T20:11:05.1444661Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:05.1445030Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:05.1445317Z U memcpy@GLIBC_2.14 2025-05-07T20:11:05.1445605Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:05.1445869Z U memset@GLIBC_2.2.5 2025-05-07T20:11:05.1446211Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:05.1446576Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.1446907Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:05.1447299Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:05.1447958Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:05.1448786Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.1449630Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:05.1450440Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:05.1451259Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.1452108Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:05.1452714Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:05.1453040Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:05.1453420Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.1453819Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.1454252Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:05.1454679Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:05.1455045Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:05.1455549Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:05.1456225Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:05.1457250Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.1458424Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.1459171Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:05.1459656Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:05.1460222Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:05.1460591Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.1461039Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.1461398Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:05.1461771Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:05.1462202Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.1462772Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.1463284Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:05.1463705Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:05.1464154Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:05.1464872Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:05.1465588Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:05.1465971Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:05.1466309Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:05.1466618Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:05.1466931Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:05.1467817Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:05.1469105Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.1470019Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.1470564Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:05.1471123Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:05.1471771Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:05.1472316Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:05.1472867Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:05.1473535Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:05.1474225Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:05.1474692Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:05.1475226Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:05.1475681Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:05.1476026Z U torch::autograd::Node::metadata() 2025-05-07T20:11:05.1476416Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:05.1476929Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:05.1477602Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:05.1478168Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:05.1478653Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:05.1479245Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:05.1482826Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:05.1486112Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:05.1486572Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:05.1487025Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:05.1488155Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:05.1489267Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:05.1489979Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:05.1490916Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.1491528Z U typeinfo for c10::Error 2025-05-07T20:11:05.1491982Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.1492446Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:05.1492828Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:05.1493226Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:05.1493603Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:05.1494063Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:05.1494502Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:05.1494959Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:05.1495415Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.1495789Z U vtable for c10::Error 2025-05-07T20:11:05.1496416Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.1497242Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.1497856Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.1498330Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:05.1498884Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:05.1499457Z U vtable for torch::autograd::Node 2025-05-07T20:11:05.1499876Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.1500307Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:05.1500665Z w _ITM_registerTMCloneTable 2025-05-07T20:11:05.1500985Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:05.1501317Z w __gmon_start__ 2025-05-07T20:11:05.1501598Z w __pthread_key_create 2025-05-07T20:11:05.1501935Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:05.1502269Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:05.1502676Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:05.1503201Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:05.1503606Z 2025-05-07T20:11:05.1503760Z linux-vdso.so.1 (0x00007ffd25b32000) 2025-05-07T20:11:05.1504074Z libc10.so => not found 2025-05-07T20:11:05.1504735Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fdad7c00000) 2025-05-07T20:11:05.1505442Z libtorch.so => not found 2025-05-07T20:11:05.1506081Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007fdad876c000) 2025-05-07T20:11:05.1506757Z libtorch_cpu.so => not found 2025-05-07T20:11:05.1507074Z libtorch_cuda.so => not found 2025-05-07T20:11:05.1507424Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fdad799c000) 2025-05-07T20:11:05.1507875Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fdad873c000) 2025-05-07T20:11:05.1508276Z libc.so.6 => /lib64/libc.so.6 (0x00007fdad7794000) 2025-05-07T20:11:05.1508673Z /lib64/ld-linux-x86-64.so.2 (0x00007fdad877c000) 2025-05-07T20:11:05.1509017Z libc10.so => not found 2025-05-07T20:11:05.1509277Z libc10_cuda.so => not found 2025-05-07T20:11:05.1509837Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007fdad7200000) 2025-05-07T20:11:05.1510408Z libtorch.so => not found 2025-05-07T20:11:05.1510699Z libtorch_cpu.so => not found 2025-05-07T20:11:05.1510970Z libtorch_cuda.so => not found 2025-05-07T20:11:05.1511270Z libcudart.so.12 => not found 2025-05-07T20:11:05.1511545Z libc10.so => not found 2025-05-07T20:11:05.1511819Z libtorch_cpu.so => not found 2025-05-07T20:11:05.1512271Z libtorch_cuda.so => not found 2025-05-07T20:11:05.1512549Z libtorch.so => not found 2025-05-07T20:11:05.1512853Z libm.so.6 => /lib64/libm.so.6 (0x00007fdad7125000) 2025-05-07T20:11:05.1513159Z libc10.so => not found 2025-05-07T20:11:05.1513669Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007fdad7f85000) 2025-05-07T20:11:05.1514203Z libtorch.so => not found 2025-05-07T20:11:05.1514480Z libtorch_cpu.so => not found 2025-05-07T20:11:05.1514767Z libtorch_cuda.so => not found 2025-05-07T20:11:05.1515040Z libtorch_cpu.so => not found 2025-05-07T20:11:05.1515323Z libtorch_cuda.so => not found 2025-05-07T20:11:05.1515577Z libtorch.so => not found 2025-05-07T20:11:05.1515734Z 2025-05-07T20:11:05.1515865Z [CHECK] Displaying ELF information: 2025-05-07T20:11:05.1516327Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:05.1516757Z 2025-05-07T20:11:05.1516761Z 2025-05-07T20:11:05.1516909Z Dynamic section at offset 0x701230 contains 38 entries: 2025-05-07T20:11:05.1517270Z Tag Type Name/Value 2025-05-07T20:11:05.1517685Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:05.1518209Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:05.1518728Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:05.1519248Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:05.1519756Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:05.1520277Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:05.1520773Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:05.1521286Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:05.1521789Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:05.1522284Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:05.1522884Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:11:05.1523433Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:05.1523833Z 0x000000000000000c (INIT) 0x178000 2025-05-07T20:11:05.1524168Z 0x000000000000000d (FINI) 0x65b3d8 2025-05-07T20:11:05.1524492Z 0x0000000000000019 (INIT_ARRAY) 0x6fcd78 2025-05-07T20:11:05.1524853Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:11:05.1525178Z 0x000000000000001a (FINI_ARRAY) 0x6fce78 2025-05-07T20:11:05.1525523Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:05.1525845Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:05.1526185Z 0x000000006ffffef5 (GNU_HASH) 0x6490 2025-05-07T20:11:05.1526510Z 0x0000000000000005 (STRTAB) 0x25438 2025-05-07T20:11:05.1526854Z 0x0000000000000006 (SYMTAB) 0xc448 2025-05-07T20:11:05.1527208Z 0x000000000000000a (STRSZ) 1180638 (bytes) 2025-05-07T20:11:05.1527563Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:05.1527916Z 0x0000000000000003 (PLTGOT) 0x7024d0 2025-05-07T20:11:05.1528256Z 0x0000000000000002 (PLTRELSZ) 20976 (bytes) 2025-05-07T20:11:05.1528615Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:05.1528939Z 0x0000000000000017 (JMPREL) 0x171f98 2025-05-07T20:11:05.1529277Z 0x0000000000000007 (RELA) 0x147aa0 2025-05-07T20:11:05.1529616Z 0x0000000000000008 (RELASZ) 173304 (bytes) 2025-05-07T20:11:05.1529989Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:05.1530323Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:05.1530632Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:05.1531021Z 0x000000006ffffffe (VERNEED) 0x147970 2025-05-07T20:11:05.1531378Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:05.1531721Z 0x000000006ffffff0 (VERSYM) 0x145816 2025-05-07T20:11:05.1532042Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:11:05.1532364Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:05.1532558Z 2025-05-07T20:11:05.1532715Z ################################################################################ 2025-05-07T20:11:05.1532930Z 2025-05-07T20:11:05.1532934Z 2025-05-07T20:11:05.1533041Z ################################################################################ 2025-05-07T20:11:05.1533535Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:05.1533997Z [CHECK] Listing out library size: 2025-05-07T20:11:05.1534466Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:05.1534829Z 2025-05-07T20:11:05.1535044Z 432 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:05.1535344Z 2025-05-07T20:11:05.1535720Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:05.1536672Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.1537233Z 2025-05-07T20:11:05.1868960Z GLIBC_2.2.5 2025-05-07T20:11:05.1869599Z GLIBC_2.3 2025-05-07T20:11:05.1870210Z GLIBC_2.14 2025-05-07T20:11:05.1870543Z 2025-05-07T20:11:05.1870555Z 2025-05-07T20:11:05.1871828Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:05.1874974Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.1876883Z 2025-05-07T20:11:05.2257059Z GLIBCXX_3.4 2025-05-07T20:11:05.2257731Z GLIBCXX_3.4.9 2025-05-07T20:11:05.2258381Z GLIBCXX_3.4.11 2025-05-07T20:11:05.2258978Z GLIBCXX_3.4.14 2025-05-07T20:11:05.2259843Z GLIBCXX_3.4.18 2025-05-07T20:11:05.2260413Z GLIBCXX_3.4.20 2025-05-07T20:11:05.2260998Z GLIBCXX_3.4.21 2025-05-07T20:11:05.2261560Z GLIBCXX_3.4.29 2025-05-07T20:11:05.2261932Z 2025-05-07T20:11:05.2261947Z 2025-05-07T20:11:05.2277854Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.6d5EZ9ghAY.symbols.txt 2025-05-07T20:11:05.2279366Z 2025-05-07T20:11:05.2629028Z 2025-05-07T20:11:05.2657236Z [CHECK] Total Number of symbols: 4997 2025-05-07T20:11:05.2683206Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:11:05.2700285Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.0njsX1MnfM.usymbols.txt 2025-05-07T20:11:05.2700864Z 2025-05-07T20:11:05.2732983Z 2025-05-07T20:11:05.2760048Z [CHECK] Listing out undefined symbols (258 total): 2025-05-07T20:11:05.2776535Z U GOMP_parallel 2025-05-07T20:11:05.2778251Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.2781100Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.2783456Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:05.2783849Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.2784271Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.2784668Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.2785064Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:05.2785465Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:05.2785845Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:05.2786414Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.2786857Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:05.2787203Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:05.2787546Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:05.2787890Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:05.2788340Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:05.2788707Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:05.2789020Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:05.2789349Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:05.2789654Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:05.2789972Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:05.2790270Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:05.2790643Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:05.2791038Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:05.2791861Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.2793076Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.2794383Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.2795292Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:05.2796047Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.2796806Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:05.2797389Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:05.2798414Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:05.2799829Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.2800527Z U at::detail::getCUDAHooks() 2025-05-07T20:11:05.2800866Z U at::detail::getHIPHooks() 2025-05-07T20:11:05.2801188Z U at::get_num_threads() 2025-05-07T20:11:05.2801507Z U at::get_thread_num() 2025-05-07T20:11:05.2801825Z U at::globalContext() 2025-05-07T20:11:05.2802120Z U at::in_parallel_region() 2025-05-07T20:11:05.2802443Z U at::init_num_threads() 2025-05-07T20:11:05.2802772Z U at::internal::set_thread_num(int) 2025-05-07T20:11:05.2803179Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:05.2803651Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.2804154Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.2804632Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:11:05.2805365Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:11:05.2806038Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:05.2807013Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.2808104Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.2808728Z U c10::Error::what() const 2025-05-07T20:11:05.2809034Z U c10::GradMode::is_enabled() 2025-05-07T20:11:05.2809366Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:05.2809749Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.2810165Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.2810650Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:05.2811022Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:11:05.2811383Z U c10::IValue::isTensorList() const 2025-05-07T20:11:05.2811728Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:05.2812105Z U c10::IntType::get() 2025-05-07T20:11:05.2812781Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.2813501Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:05.2813903Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:05.2814198Z U c10::NoneType::get() 2025-05-07T20:11:05.2814609Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.2815063Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:05.2815398Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:05.2815773Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:05.2816123Z U c10::StringType::get() 2025-05-07T20:11:05.2816458Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:05.2817087Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:05.2817709Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:05.2818063Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:05.2818417Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:05.2819094Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:05.2820016Z U c10::TensorType::get() 2025-05-07T20:11:05.2821035Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:05.2822099Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:05.2823080Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:05.2824147Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:11:05.2824623Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:05.2824996Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:05.2825356Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:05.2825768Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:05.2826136Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:05.2826490Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:05.2826977Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:05.2827478Z U c10::cuda::device_count() 2025-05-07T20:11:05.2827874Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:05.2828268Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:05.2828680Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:05.2829085Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:05.2829511Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:05.2829941Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:05.2830735Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.2831754Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.2833354Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.2834674Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:05.2835512Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.2836420Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:05.2837398Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.2838227Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:11:05.2838577Z U c10::get_default_dtype() 2025-05-07T20:11:05.2839031Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:05.2839594Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:05.2839993Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:05.2840319Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:05.2840641Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:05.2840985Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:05.2841560Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.2842167Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:11:05.2842578Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:11:05.2843038Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:11:05.2843515Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:05.2843898Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:05.2844279Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:11:05.2844653Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:05.2845059Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:05.2845510Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:05.2845858Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:05.2846208Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:05.2846568Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:05.2846924Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:05.2847272Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:05.2847591Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:05.2847935Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:05.2848269Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:05.2849913Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:05.2850244Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:05.2850564Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:05.2850908Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:05.2851244Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:05.2852223Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2853863Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2855536Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2857192Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2858857Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2860958Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2862689Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:05.2864381Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:05.2866184Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2868054Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:05.2870004Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2871893Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:05.2873773Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:05.2875733Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2877696Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:05.2879582Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:05.2881428Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2883521Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:05.2885501Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2887384Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:05.2889288Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:05.2891292Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2893231Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2895249Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2897194Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2899184Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2901488Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2903532Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:05.2904795Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.2905252Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.2905683Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.2906073Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.2906775Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:11:05.2907483Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:05.2907917Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.2908320Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.2909174Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:11:05.2910324Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.2910981Z U memchr@GLIBC_2.2.5 2025-05-07T20:11:05.2911259Z U memcpy@GLIBC_2.14 2025-05-07T20:11:05.2911546Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:05.2911831Z U memset@GLIBC_2.2.5 2025-05-07T20:11:05.2912237Z U omp_get_num_threads 2025-05-07T20:11:05.2912501Z U omp_get_thread_num 2025-05-07T20:11:05.2912821Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:05.2913177Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.2913594Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:05.2914239Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:05.2915031Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.2915916Z U std::__cxx11::basic_stringbuf, std::allocator >::_M_sync(char*, unsigned long, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:11:05.2916760Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:05.2917491Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:05.2918422Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.2919241Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:05.2919858Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:05.2920251Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:05.2920594Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:05.2920914Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:05.2921254Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:05.2921630Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.2921995Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.2922379Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:05.2922778Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:05.2923225Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:05.2923904Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:05.2924897Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.2926050Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.2926810Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:05.2927247Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:11:05.2927660Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:05.2928057Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:05.2928395Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:05.2928717Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:05.2929047Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:05.2929367Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.2929688Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.2929866Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:11:05.2929979Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:05.2930081Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:05.2930480Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:05.2930614Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:11:05.2930788Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.2931013Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.2931140Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:05.2931292Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:05.2931417Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:05.2931639Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:05.2931737Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:05.2931828Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:05.2931910Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:05.2932030Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:05.2932627Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:05.2933098Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.2933339Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.2934389Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:11:05.2934736Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:11:05.2935103Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.2935486Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:11:05.2935620Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:11:05.2935915Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:11:05.2936367Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:11:05.2936663Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:11:05.2936817Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:11:05.2937297Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:11:05.2937394Z U typeinfo for c10::Error 2025-05-07T20:11:05.2937512Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:05.2937643Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:11:05.2937760Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:05.2937937Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.2938145Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.2938285Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:05.2938433Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:05.2938592Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.2938686Z U vtable for c10::Error 2025-05-07T20:11:05.2939017Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.2939424Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.2939943Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.2940152Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:05.2940401Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:05.2940530Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:05.2940642Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:05.2940772Z w _ITM_registerTMCloneTable 2025-05-07T20:11:05.2940902Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:05.2940994Z w __gmon_start__ 2025-05-07T20:11:05.2941121Z w __pthread_key_create 2025-05-07T20:11:05.2941242Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:05.2941348Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:05.2941493Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:05.2941751Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:05.2941759Z 2025-05-07T20:11:05.2941904Z linux-vdso.so.1 (0x00007fff745f7000) 2025-05-07T20:11:05.2942010Z libc10.so => not found 2025-05-07T20:11:05.2942105Z libc10_cuda.so => not found 2025-05-07T20:11:05.2942474Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007fa660a00000) 2025-05-07T20:11:05.2942997Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fa65f200000) 2025-05-07T20:11:05.2943103Z libtorch.so => not found 2025-05-07T20:11:05.2943206Z libtorch_cpu.so => not found 2025-05-07T20:11:05.2943309Z libtorch_cuda.so => not found 2025-05-07T20:11:05.2943426Z libcudart.so.12 => not found 2025-05-07T20:11:05.2943592Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa65ef9c000) 2025-05-07T20:11:05.2943745Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa67c4c0000) 2025-05-07T20:11:05.2943897Z libc.so.6 => /lib64/libc.so.6 (0x00007fa65ed94000) 2025-05-07T20:11:05.2944030Z /lib64/ld-linux-x86-64.so.2 (0x00007fa67c4f4000) 2025-05-07T20:11:05.2944120Z libc10.so => not found 2025-05-07T20:11:05.2944481Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007fa67c443000) 2025-05-07T20:11:05.2944593Z libtorch.so => not found 2025-05-07T20:11:05.2944699Z libtorch_cpu.so => not found 2025-05-07T20:11:05.2944801Z libtorch_cuda.so => not found 2025-05-07T20:11:05.2944945Z libm.so.6 => /lib64/libm.so.6 (0x00007fa660925000) 2025-05-07T20:11:05.2945036Z libtorch.so => not found 2025-05-07T20:11:05.2945124Z libc10.so => not found 2025-05-07T20:11:05.2945228Z libc10_cuda.so => not found 2025-05-07T20:11:05.2945336Z libtorch_cpu.so => not found 2025-05-07T20:11:05.2945429Z libtorch_cuda.so => not found 2025-05-07T20:11:05.2945526Z libcudart.so.12 => not found 2025-05-07T20:11:05.2945639Z libtorch_cpu.so => not found 2025-05-07T20:11:05.2945730Z libtorch_cuda.so => not found 2025-05-07T20:11:05.2945822Z libtorch.so => not found 2025-05-07T20:11:05.2945827Z 2025-05-07T20:11:05.2945941Z [CHECK] Displaying ELF information: 2025-05-07T20:11:05.2946202Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:05.2946208Z 2025-05-07T20:11:05.2946212Z 2025-05-07T20:11:05.2946374Z Dynamic section at offset 0x1af13978 contains 40 entries: 2025-05-07T20:11:05.2946492Z Tag Type Name/Value 2025-05-07T20:11:05.2946701Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:05.2946902Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:05.2947093Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:05.2947333Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:05.2947525Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:05.2947726Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:05.2947950Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:05.2948165Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:05.2948377Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:05.2948623Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:05.2948844Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:05.2949092Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:05.2949333Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:11:05.2949548Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:05.2949668Z 0x000000000000000c (INIT) 0x19a000 2025-05-07T20:11:05.2949822Z 0x000000000000000d (FINI) 0x7e3f4c 2025-05-07T20:11:05.2949978Z 0x0000000000000019 (INIT_ARRAY) 0x1af13d58 2025-05-07T20:11:05.2950113Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:11:05.2950246Z 0x000000000000001a (FINI_ARRAY) 0x1af13ee0 2025-05-07T20:11:05.2950399Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:05.2950542Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:05.2950668Z 0x000000006ffffef5 (GNU_HASH) 0x7048 2025-05-07T20:11:05.2950793Z 0x0000000000000005 (STRTAB) 0x2bee8 2025-05-07T20:11:05.2950939Z 0x0000000000000006 (SYMTAB) 0xea58 2025-05-07T20:11:05.2951092Z 0x000000000000000a (STRSZ) 1363139 (bytes) 2025-05-07T20:11:05.2951225Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:05.2951377Z 0x0000000000000003 (PLTGOT) 0x1af14c38 2025-05-07T20:11:05.2951529Z 0x0000000000000002 (PLTRELSZ) 15648 (bytes) 2025-05-07T20:11:05.2951661Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:05.2951817Z 0x0000000000000017 (JMPREL) 0x195ff8 2025-05-07T20:11:05.2951942Z 0x0000000000000007 (RELA) 0x17b418 2025-05-07T20:11:05.2952213Z 0x0000000000000008 (RELASZ) 109536 (bytes) 2025-05-07T20:11:05.2952334Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:05.2952469Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:05.2952599Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:05.2952724Z 0x000000006ffffffe (VERNEED) 0x17b2b8 2025-05-07T20:11:05.2952868Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:05.2952995Z 0x000000006ffffff0 (VERSYM) 0x178bac 2025-05-07T20:11:05.2953109Z 0x000000006ffffff9 (RELACOUNT) 79 2025-05-07T20:11:05.2953218Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:05.2953245Z 2025-05-07T20:11:05.2953368Z ################################################################################ 2025-05-07T20:11:05.2953373Z 2025-05-07T20:11:05.2953377Z 2025-05-07T20:11:05.2953494Z ################################################################################ 2025-05-07T20:11:05.2953867Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:05.2953979Z [CHECK] Listing out library size: 2025-05-07T20:11:05.2954315Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:05.2954321Z 2025-05-07T20:11:05.2954590Z 4 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:05.2954620Z 2025-05-07T20:11:05.2955080Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:05.2955626Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.2955630Z 2025-05-07T20:11:05.3101543Z GLIBC_2.2.5 2025-05-07T20:11:05.3101663Z GLIBC_2.3 2025-05-07T20:11:05.3101747Z GLIBC_2.14 2025-05-07T20:11:05.3101753Z 2025-05-07T20:11:05.3101757Z 2025-05-07T20:11:05.3102307Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:05.3103270Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.3103341Z 2025-05-07T20:11:05.3322536Z GLIBCXX_3.4 2025-05-07T20:11:05.3323044Z GLIBCXX_3.4.9 2025-05-07T20:11:05.3323359Z GLIBCXX_3.4.11 2025-05-07T20:11:05.3323472Z GLIBCXX_3.4.15 2025-05-07T20:11:05.3323590Z GLIBCXX_3.4.18 2025-05-07T20:11:05.3323682Z GLIBCXX_3.4.20 2025-05-07T20:11:05.3323769Z GLIBCXX_3.4.21 2025-05-07T20:11:05.3323855Z GLIBCXX_3.4.29 2025-05-07T20:11:05.3324087Z 2025-05-07T20:11:05.3324094Z 2025-05-07T20:11:05.3343226Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.ojy3inSDYI.symbols.txt 2025-05-07T20:11:05.3343256Z 2025-05-07T20:11:05.3527171Z 2025-05-07T20:11:05.3552480Z [CHECK] Total Number of symbols: 2654 2025-05-07T20:11:05.3581293Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:11:05.3603581Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.Q2k5kD5891.usymbols.txt 2025-05-07T20:11:05.3603610Z 2025-05-07T20:11:05.3636108Z 2025-05-07T20:11:05.3664812Z [CHECK] Listing out undefined symbols (194 total): 2025-05-07T20:11:05.3682678Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.3682826Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:05.3683012Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:05.3683243Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:05.3683366Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:05.3683518Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:05.3683885Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:05.3684002Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:05.3684139Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:05.3684254Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:05.3684363Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:05.3684471Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:05.3684612Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:05.3684720Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:05.3684845Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:05.3685060Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:05.3685204Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:05.3685320Z U at::RecordFunction::end() 2025-05-07T20:11:05.3685481Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:05.3685636Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:05.3686419Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.3686790Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:05.3687396Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.3688099Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.3688278Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:05.3688481Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:05.3688881Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:05.3689085Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:05.3689255Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.3689420Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:05.3689589Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:05.3689769Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:05.3689883Z U c10::AnyType::get() 2025-05-07T20:11:05.3690017Z U c10::BoolType::get() 2025-05-07T20:11:05.3690215Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:05.3690343Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:05.3690956Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:05.3691627Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:05.3692045Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.3692164Z U c10::Error::what() const 2025-05-07T20:11:05.3692271Z U c10::FloatType::get() 2025-05-07T20:11:05.3692400Z U c10::GradMode::is_enabled() 2025-05-07T20:11:05.3692524Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:05.3692699Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:05.3692827Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:05.3692977Z U c10::IValue::isBoolList() const 2025-05-07T20:11:05.3693100Z U c10::IValue::isIntList() const 2025-05-07T20:11:05.3693228Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:05.3693381Z U c10::IValue::isTensorList() const 2025-05-07T20:11:05.3693537Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:05.3693649Z U c10::IntType::get() 2025-05-07T20:11:05.3694182Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.3694433Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:05.3694571Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:05.3694712Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.3694878Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.3695115Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.3695417Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:05.3695569Z U c10::StringType::get() 2025-05-07T20:11:05.3695732Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:05.3695891Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:05.3696080Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:05.3696364Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:05.3696516Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:05.3696917Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:05.3697129Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:05.3697252Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:05.3697417Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:05.3697576Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:05.3697699Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:05.3697831Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:05.3698018Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:05.3698148Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:05.3698258Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:05.3698378Z U c10::SymIntType::get() 2025-05-07T20:11:05.3698500Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:05.3698632Z U c10::TensorType::get() 2025-05-07T20:11:05.3698791Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:05.3699214Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.3700012Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.3700326Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:05.3700853Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.3701215Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:05.3701859Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.3702205Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:05.3702431Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:05.3702565Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.3702734Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:05.3703152Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.3703288Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:05.3703464Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:05.3703631Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:05.3703809Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:05.3704019Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:05.3704150Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:05.3704433Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:05.3704538Z U free@GLIBC_2.2.5 2025-05-07T20:11:05.3704725Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:05.3704843Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:05.3704947Z U memcpy@GLIBC_2.14 2025-05-07T20:11:05.3705042Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:05.3705170Z U memset@GLIBC_2.2.5 2025-05-07T20:11:05.3705325Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:05.3705458Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.3705597Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:05.3705872Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:05.3706226Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:05.3706706Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.3707062Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:05.3707412Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:05.3707831Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.3708251Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:05.3708381Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:05.3708528Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:05.3708676Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.3708827Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.3709039Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:05.3709176Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:05.3709327Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:05.3709610Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:05.3709983Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:05.3710589Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.3711156Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.3711298Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:05.3711429Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:05.3711579Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:05.3711707Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.3711840Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.3711970Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:05.3712209Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:05.3712396Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.3712647Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.3712771Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:05.3712937Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:05.3713086Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:05.3713501Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:05.3713643Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:05.3713764Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:05.3713901Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:05.3714017Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:05.3714147Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:05.3714723Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:05.3715200Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.3715466Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.3715590Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:05.3715902Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:05.3716091Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:05.3716290Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:05.3716473Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:05.3716827Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:05.3716974Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:05.3717160Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:05.3717348Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:05.3717464Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:05.3717582Z U torch::autograd::Node::metadata() 2025-05-07T20:11:05.3717729Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:05.3717967Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:05.3718229Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:05.3718382Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:05.3718577Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:05.3718791Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:05.3721397Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:05.3721739Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:05.3721892Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:05.3722083Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:05.3722937Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:05.3724690Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:05.3725120Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:05.3725602Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.3725741Z U typeinfo for c10::Error 2025-05-07T20:11:05.3725896Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.3726046Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:05.3726186Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:05.3726321Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:05.3726487Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:05.3726648Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:05.3726816Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:05.3726996Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:05.3727166Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.3727271Z U vtable for c10::Error 2025-05-07T20:11:05.3727650Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.3727993Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.3728134Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.3728359Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:05.3728595Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:05.3728711Z U vtable for torch::autograd::Node 2025-05-07T20:11:05.3728907Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.3729021Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:05.3729130Z w _ITM_registerTMCloneTable 2025-05-07T20:11:05.3729232Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:05.3729346Z w __gmon_start__ 2025-05-07T20:11:05.3729446Z w __pthread_key_create 2025-05-07T20:11:05.3729560Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:05.3729689Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:05.3729842Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:05.3730137Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:05.3730145Z 2025-05-07T20:11:05.3730296Z linux-vdso.so.1 (0x00007ffc2afdb000) 2025-05-07T20:11:05.3730388Z libc10.so => not found 2025-05-07T20:11:05.3730849Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f9d10722000) 2025-05-07T20:11:05.3731337Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f9d0f000000) 2025-05-07T20:11:05.3731441Z libtorch.so => not found 2025-05-07T20:11:05.3731542Z libtorch_cpu.so => not found 2025-05-07T20:11:05.3731648Z libtorch_cuda.so => not found 2025-05-07T20:11:05.3731834Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f9d0ed9c000) 2025-05-07T20:11:05.3731987Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9d106f2000) 2025-05-07T20:11:05.3732111Z libc.so.6 => /lib64/libc.so.6 (0x00007f9d0eb94000) 2025-05-07T20:11:05.3732271Z /lib64/ld-linux-x86-64.so.2 (0x00007f9d10732000) 2025-05-07T20:11:05.3732395Z libc10.so => not found 2025-05-07T20:11:05.3732495Z libtorch_cpu.so => not found 2025-05-07T20:11:05.3732645Z libtorch_cuda.so => not found 2025-05-07T20:11:05.3732743Z libtorch.so => not found 2025-05-07T20:11:05.3732834Z libtorch.so => not found 2025-05-07T20:11:05.3732920Z libc10.so => not found 2025-05-07T20:11:05.3733041Z libc10_cuda.so => not found 2025-05-07T20:11:05.3733139Z libtorch_cpu.so => not found 2025-05-07T20:11:05.3733244Z libtorch_cuda.so => not found 2025-05-07T20:11:05.3733369Z libcudart.so.12 => not found 2025-05-07T20:11:05.3733509Z libm.so.6 => /lib64/libm.so.6 (0x00007f9d10613000) 2025-05-07T20:11:05.3733514Z 2025-05-07T20:11:05.3733621Z [CHECK] Displaying ELF information: 2025-05-07T20:11:05.3733942Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:05.3733948Z 2025-05-07T20:11:05.3762375Z 2025-05-07T20:11:05.3762908Z Dynamic section at offset 0x39abb0 contains 38 entries: 2025-05-07T20:11:05.3763065Z Tag Type Name/Value 2025-05-07T20:11:05.3763303Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:05.3763571Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:05.3763846Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:05.3764057Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:05.3764262Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:05.3764472Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:05.3764678Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:05.3764891Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:05.3765083Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:05.3765328Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:05.3765653Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:11:05.3765837Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:05.3765961Z 0x000000000000000c (INIT) 0xb9000 2025-05-07T20:11:05.3766098Z 0x000000000000000d (FINI) 0x33effc 2025-05-07T20:11:05.3766220Z 0x0000000000000019 (INIT_ARRAY) 0x397b28 2025-05-07T20:11:05.3766355Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:11:05.3766493Z 0x000000000000001a (FINI_ARRAY) 0x397c58 2025-05-07T20:11:05.3766617Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:05.3766726Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:05.3766864Z 0x000000006ffffef5 (GNU_HASH) 0x3b08 2025-05-07T20:11:05.3766977Z 0x0000000000000005 (STRTAB) 0x17258 2025-05-07T20:11:05.3767087Z 0x0000000000000006 (SYMTAB) 0x7970 2025-05-07T20:11:05.3767227Z 0x000000000000000a (STRSZ) 529940 (bytes) 2025-05-07T20:11:05.3767468Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:05.3767586Z 0x0000000000000003 (PLTGOT) 0x39ae50 2025-05-07T20:11:05.3767719Z 0x0000000000000002 (PLTRELSZ) 14112 (bytes) 2025-05-07T20:11:05.3767842Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:05.3767954Z 0x0000000000000017 (JMPREL) 0xb52c8 2025-05-07T20:11:05.3768059Z 0x0000000000000007 (RELA) 0x99e60 2025-05-07T20:11:05.3768196Z 0x0000000000000008 (RELASZ) 111720 (bytes) 2025-05-07T20:11:05.3768321Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:05.3768418Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:05.3768542Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:05.3768673Z 0x000000006ffffffe (VERNEED) 0x99d30 2025-05-07T20:11:05.3768904Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:05.3769084Z 0x000000006ffffff0 (VERSYM) 0x9886c 2025-05-07T20:11:05.3769205Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:11:05.3769306Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:05.3769311Z 2025-05-07T20:11:05.3769432Z ################################################################################ 2025-05-07T20:11:05.3769437Z 2025-05-07T20:11:05.3769441Z 2025-05-07T20:11:05.3769599Z ################################################################################ 2025-05-07T20:11:05.3769906Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:05.3770009Z [CHECK] Listing out library size: 2025-05-07T20:11:05.3770308Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:05.3770354Z 2025-05-07T20:11:05.3773922Z 343 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:05.3774952Z 2025-05-07T20:11:05.3775892Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:05.3776462Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.3776468Z 2025-05-07T20:11:05.4742147Z GLIBC_2.2.5 2025-05-07T20:11:05.4742386Z GLIBC_2.3 2025-05-07T20:11:05.4742756Z GLIBC_2.14 2025-05-07T20:11:05.4742763Z 2025-05-07T20:11:05.4742775Z 2025-05-07T20:11:05.4743244Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:05.4743808Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.4743836Z 2025-05-07T20:11:05.5703527Z GLIBCXX_3.4 2025-05-07T20:11:05.5703635Z GLIBCXX_3.4.9 2025-05-07T20:11:05.5703746Z GLIBCXX_3.4.20 2025-05-07T20:11:05.5703863Z GLIBCXX_3.4.21 2025-05-07T20:11:05.5703953Z GLIBCXX_3.4.29 2025-05-07T20:11:05.5704267Z 2025-05-07T20:11:05.5704403Z 2025-05-07T20:11:05.5728614Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.yEcwdy0veH.symbols.txt 2025-05-07T20:11:05.5728656Z 2025-05-07T20:11:05.6658192Z 2025-05-07T20:11:05.6702651Z [CHECK] Total Number of symbols: 12731 2025-05-07T20:11:05.6753869Z [CHECK] Number of fbgemm symbols: 5268 2025-05-07T20:11:05.6778440Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.7h0iC3qknE.usymbols.txt 2025-05-07T20:11:05.6778996Z 2025-05-07T20:11:05.6844495Z 2025-05-07T20:11:05.6880405Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:05.6903859Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.6904513Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:05.6904917Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.6905321Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.6905720Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.6906109Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:05.6906504Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:05.6906994Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:05.6907383Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.6907786Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:05.6908111Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:05.6908461Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:05.6908859Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:05.6909382Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:05.6909755Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:05.6910086Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:05.6910431Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:05.6910747Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:05.6911065Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:05.6911411Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:05.6911747Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:05.6912185Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:05.6912626Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:05.6913179Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:05.6913952Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:05.6914609Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:05.6915238Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:05.6916326Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.6917520Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:05.6918052Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:05.6918550Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:05.6919028Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:05.6919477Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.6920001Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.6920455Z U c10::BoolType::get() 2025-05-07T20:11:05.6920828Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:05.6921301Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:05.6921729Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:05.6922512Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:05.6923839Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:05.6924998Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.6925620Z U c10::Error::what() const 2025-05-07T20:11:05.6925932Z U c10::FloatType::get() 2025-05-07T20:11:05.6926311Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.6926779Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.6927214Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:05.6927594Z U c10::IntType::get() 2025-05-07T20:11:05.6927969Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:05.6928404Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:05.6928784Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.6929151Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.6929608Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:05.6930046Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:05.6930484Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:05.6931274Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:05.6931983Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:05.6932389Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:11:05.6932777Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:05.6933178Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:05.6933575Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:05.6933970Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:11:05.6934360Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:05.6934736Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:05.6935127Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:05.6935486Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:05.6935828Z U c10::SymIntType::get() 2025-05-07T20:11:05.6936162Z U c10::TensorType::get() 2025-05-07T20:11:05.6936495Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:05.6937495Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:05.6938499Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:05.6938953Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:05.6939408Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:05.6939784Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:05.6940157Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:05.6940541Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:05.6941047Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:05.6941535Z U c10::cuda::device_count() 2025-05-07T20:11:05.6941907Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:05.6942334Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:05.6942728Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:05.6943146Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:05.6943562Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:05.6943977Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:05.6944829Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.6945739Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:05.6946647Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.6947651Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:05.6948732Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.6949633Z U c10::get_default_dtype() 2025-05-07T20:11:05.6950015Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:05.6950389Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:05.6950970Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:05.6951611Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:05.6952216Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:05.6952564Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.6952962Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:05.6953367Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:11:05.6953717Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:11:05.6954313Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:05.6954821Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:05.6955205Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:05.6955623Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:05.6956087Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:05.6956558Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:05.6956976Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:05.6957410Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:05.6957898Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:05.6958299Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:05.6958707Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:05.6959151Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:05.6959523Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:05.6959909Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:05.6960279Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:05.6960679Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:05.6961049Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:05.6961436Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:05.6961803Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:05.6962213Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:05.6962629Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:05.6963180Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.6963760Z U float at::Tensor::item() const 2025-05-07T20:11:05.6964148Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.6964604Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.6965008Z U free@GLIBC_2.2.5 2025-05-07T20:11:05.6965321Z U int at::Tensor::item() const 2025-05-07T20:11:05.6965719Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.6966118Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.6966592Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:05.6967212Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.6967643Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.6968037Z U memcpy@GLIBC_2.14 2025-05-07T20:11:05.6968394Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:05.6968717Z U memset@GLIBC_2.2.5 2025-05-07T20:11:05.6969111Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:05.6969527Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.6970167Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:05.6971122Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.6972070Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:05.6972665Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:05.6973074Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.6973625Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.6974088Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:05.6974635Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:05.6975350Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:05.6976425Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.6977694Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.6978444Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:05.6978824Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:05.6979189Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:05.6979610Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.6980169Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.6980520Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:05.6980903Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:05.6981321Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.6982102Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.6982625Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:05.6982981Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:05.6983332Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:05.6983654Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:05.6984536Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:05.6985764Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.6986622Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.6987392Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.6988006Z U typeinfo for c10::Error 2025-05-07T20:11:05.6988379Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:05.6988831Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:05.6989281Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.6989671Z U vtable for c10::Error 2025-05-07T20:11:05.6990348Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.6991209Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.6991889Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:05.6992558Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:05.6993103Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.6993537Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:05.6993874Z w _ITM_registerTMCloneTable 2025-05-07T20:11:05.6994317Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:05.6994654Z w __gmon_start__ 2025-05-07T20:11:05.6994952Z w __pthread_key_create 2025-05-07T20:11:05.6995288Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:05.6995796Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:05.6996140Z 2025-05-07T20:11:05.6996301Z linux-vdso.so.1 (0x00007ffe34958000) 2025-05-07T20:11:05.6996578Z libc10.so => not found 2025-05-07T20:11:05.6996845Z libc10_cuda.so => not found 2025-05-07T20:11:05.6997489Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007ffbbac00000) 2025-05-07T20:11:05.6998175Z libtorch.so => not found 2025-05-07T20:11:05.6998498Z libtorch_cpu.so => not found 2025-05-07T20:11:05.6998776Z libtorch_cuda.so => not found 2025-05-07T20:11:05.6999058Z libcudart.so.12 => not found 2025-05-07T20:11:05.6999392Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ffbba99c000) 2025-05-07T20:11:05.6999875Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ffbd0f2f000) 2025-05-07T20:11:05.7000265Z libc.so.6 => /lib64/libc.so.6 (0x00007ffbba794000) 2025-05-07T20:11:05.7000641Z /lib64/ld-linux-x86-64.so.2 (0x00007ffbd0f63000) 2025-05-07T20:11:05.7000980Z libc10.so => not found 2025-05-07T20:11:05.7001245Z libc10_cuda.so => not found 2025-05-07T20:11:05.7001775Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007ffbba200000) 2025-05-07T20:11:05.7013479Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007ffbd0f21000) 2025-05-07T20:11:05.7014167Z libtorch.so => not found 2025-05-07T20:11:05.7014427Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7014724Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7014992Z libcudart.so.12 => not found 2025-05-07T20:11:05.7015308Z libm.so.6 => /lib64/libm.so.6 (0x00007ffbba125000) 2025-05-07T20:11:05.7015626Z libc10.so => not found 2025-05-07T20:11:05.7016155Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007ffbbaf85000) 2025-05-07T20:11:05.7016724Z libtorch.so => not found 2025-05-07T20:11:05.7016991Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7017268Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7017519Z libc10.so => not found 2025-05-07T20:11:05.7017769Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7018032Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7018298Z libtorch.so => not found 2025-05-07T20:11:05.7018549Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7018821Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7019073Z libtorch.so => not found 2025-05-07T20:11:05.7019253Z 2025-05-07T20:11:05.7019493Z [CHECK] Displaying ELF information: 2025-05-07T20:11:05.7020272Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:05.7020723Z 2025-05-07T20:11:05.7027377Z 2025-05-07T20:11:05.7027917Z Dynamic section at offset 0x1569a110 contains 39 entries: 2025-05-07T20:11:05.7029293Z Tag Type Name/Value 2025-05-07T20:11:05.7030211Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:05.7037114Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:05.7037684Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:05.7038227Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:05.7038809Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:05.7039327Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:05.7039862Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:05.7040471Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:05.7041023Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:05.7041548Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:05.7042061Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:05.7042668Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:11:05.7043235Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:05.7043658Z 0x000000000000000c (INIT) 0x44b000 2025-05-07T20:11:05.7044016Z 0x000000000000000d (FINI) 0x22530cc 2025-05-07T20:11:05.7044365Z 0x0000000000000019 (INIT_ARRAY) 0x15698508 2025-05-07T20:11:05.7044742Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:11:05.7045220Z 0x000000000000001a (FINI_ARRAY) 0x156987f8 2025-05-07T20:11:05.7045771Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:05.7046109Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:05.7046463Z 0x000000006ffffef5 (GNU_HASH) 0x10898 2025-05-07T20:11:05.7046812Z 0x0000000000000005 (STRTAB) 0x6f610 2025-05-07T20:11:05.7047159Z 0x0000000000000006 (SYMTAB) 0x24c70 2025-05-07T20:11:05.7047523Z 0x000000000000000a (STRSZ) 3691715 (bytes) 2025-05-07T20:11:05.7047910Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:05.7048277Z 0x0000000000000003 (PLTGOT) 0x1569a3c0 2025-05-07T20:11:05.7048651Z 0x0000000000000002 (PLTRELSZ) 10920 (bytes) 2025-05-07T20:11:05.7049020Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:05.7049407Z 0x0000000000000017 (JMPREL) 0x4484b0 2025-05-07T20:11:05.7049770Z 0x0000000000000007 (RELA) 0x3faf60 2025-05-07T20:11:05.7050122Z 0x0000000000000008 (RELASZ) 316752 (bytes) 2025-05-07T20:11:05.7050518Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:05.7050847Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:05.7051202Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:05.7051584Z 0x000000006ffffffe (VERNEED) 0x3fae50 2025-05-07T20:11:05.7051927Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:05.7052285Z 0x000000006ffffff0 (VERSYM) 0x3f4ad4 2025-05-07T20:11:05.7052623Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:11:05.7052948Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:05.7053168Z 2025-05-07T20:11:05.7053307Z ################################################################################ 2025-05-07T20:11:05.7053540Z 2025-05-07T20:11:05.7053544Z 2025-05-07T20:11:05.7053654Z ################################################################################ 2025-05-07T20:11:05.7054211Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:05.7054741Z [CHECK] Listing out library size: 2025-05-07T20:11:05.7055259Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:05.7055677Z 2025-05-07T20:11:05.7057551Z 1 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:05.7057962Z 2025-05-07T20:11:05.7058404Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:05.7059764Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.7060468Z 2025-05-07T20:11:05.7110705Z GLIBC_2.2.5 2025-05-07T20:11:05.7110945Z GLIBC_2.3 2025-05-07T20:11:05.7111161Z GLIBC_2.14 2025-05-07T20:11:05.7115850Z 2025-05-07T20:11:05.7115854Z 2025-05-07T20:11:05.7116330Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:05.7117690Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.7118358Z 2025-05-07T20:11:05.7182688Z GLIBCXX_3.4 2025-05-07T20:11:05.7182941Z GLIBCXX_3.4.9 2025-05-07T20:11:05.7183168Z GLIBCXX_3.4.18 2025-05-07T20:11:05.7183381Z GLIBCXX_3.4.20 2025-05-07T20:11:05.7183601Z GLIBCXX_3.4.21 2025-05-07T20:11:05.7183809Z GLIBCXX_3.4.29 2025-05-07T20:11:05.7184228Z 2025-05-07T20:11:05.7184248Z 2025-05-07T20:11:05.7205947Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.hQ93TO3kqH.symbols.txt 2025-05-07T20:11:05.7206583Z 2025-05-07T20:11:05.7237676Z 2025-05-07T20:11:05.7266072Z [CHECK] Total Number of symbols: 356 2025-05-07T20:11:05.7283213Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:05.7302242Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.giee8tF7HN.usymbols.txt 2025-05-07T20:11:05.7303392Z 2025-05-07T20:11:05.7318894Z 2025-05-07T20:11:05.7342715Z [CHECK] Listing out undefined symbols (123 total): 2025-05-07T20:11:05.7360190Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7361401Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7361976Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:05.7362363Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.7362769Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.7363180Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.7363566Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:05.7363978Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:05.7364334Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:05.7364728Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.7365080Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:05.7365413Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:05.7365756Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:05.7366067Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:05.7366412Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:05.7366737Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:05.7367074Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:05.7367396Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:05.7367744Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:05.7368073Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:05.7368907Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.7370474Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.7371516Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:05.7371944Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:05.7372427Z U c10::IntType::get() 2025-05-07T20:11:05.7372840Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:05.7373267Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:05.7373710Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.7374480Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:05.7375197Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:05.7375546Z U c10::TensorType::get() 2025-05-07T20:11:05.7375887Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:05.7376861Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:05.7377830Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:05.7378211Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:05.7378558Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:05.7378926Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:05.7379269Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:05.7379739Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:05.7380426Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:05.7380906Z U c10::cuda::device_count() 2025-05-07T20:11:05.7381283Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:05.7381674Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:05.7382296Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:05.7382721Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:05.7383136Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:05.7383546Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:05.7384311Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.7385246Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:05.7386160Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.7387140Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:05.7387741Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:05.7388103Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:05.7388453Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:05.7389013Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:05.7389415Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:05.7389797Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:05.7390213Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:05.7390731Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:05.7391232Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:05.7391598Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:05.7391973Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:05.7392325Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:05.7392681Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:05.7393105Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:05.7393509Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:05.7393915Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:05.7394278Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:05.7394646Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:05.7395028Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:05.7395405Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:05.7395769Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:05.7396171Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7396613Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:05.7397083Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7397463Z U memcpy@GLIBC_2.14 2025-05-07T20:11:05.7397752Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:05.7398074Z U memset@GLIBC_2.2.5 2025-05-07T20:11:05.7398413Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:05.7398829Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.7399422Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:05.7400315Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.7401199Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:05.7402051Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:05.7402697Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:05.7403074Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:05.7403449Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.7403867Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.7404301Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:05.7404873Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:05.7405619Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:05.7406698Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.7407961Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.7408753Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:05.7409115Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:05.7409499Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.7409847Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.7410244Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:05.7410638Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:05.7411062Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.7411637Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.7412131Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:05.7412541Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:05.7412869Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:05.7413200Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:05.7414076Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:05.7415436Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.7416297Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.7417058Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.7417751Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7418257Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:05.7418711Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:05.7419314Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.7420068Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7420896Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7421731Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7422439Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:05.7423009Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:05.7423494Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:05.7423829Z w _ITM_registerTMCloneTable 2025-05-07T20:11:05.7424178Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:05.7424503Z w __gmon_start__ 2025-05-07T20:11:05.7424786Z w __pthread_key_create 2025-05-07T20:11:05.7425160Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:05.7425664Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:05.7426055Z 2025-05-07T20:11:05.7426189Z linux-vdso.so.1 (0x00007fffb61f7000) 2025-05-07T20:11:05.7426495Z libtorch.so => not found 2025-05-07T20:11:05.7426773Z libc10.so => not found 2025-05-07T20:11:05.7427042Z libc10_cuda.so => not found 2025-05-07T20:11:05.7427305Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7427586Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7427849Z libcudart.so.12 => not found 2025-05-07T20:11:05.7428184Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0ef78a1000) 2025-05-07T20:11:05.7428602Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0ef7873000) 2025-05-07T20:11:05.7429011Z libc.so.6 => /lib64/libc.so.6 (0x00007f0ef766b000) 2025-05-07T20:11:05.7429383Z /lib64/ld-linux-x86-64.so.2 (0x00007f0ef7b78000) 2025-05-07T20:11:05.7429758Z libm.so.6 => /lib64/libm.so.6 (0x00007f0ef7590000) 2025-05-07T20:11:05.7429990Z 2025-05-07T20:11:05.7430117Z [CHECK] Displaying ELF information: 2025-05-07T20:11:05.7430656Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:05.7431074Z 2025-05-07T20:11:05.7436076Z 2025-05-07T20:11:05.7436237Z Dynamic section at offset 0x6a540 contains 37 entries: 2025-05-07T20:11:05.7436620Z Tag Type Name/Value 2025-05-07T20:11:05.7437093Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:05.7437633Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:05.7438161Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:05.7438700Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:05.7439219Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:05.7439797Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:05.7440324Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:05.7440865Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:05.7441370Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:05.7441914Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:05.7442524Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:11:05.7443024Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:11:05.7443380Z 0x000000000000000d (FINI) 0x2c63c 2025-05-07T20:11:05.7443707Z 0x0000000000000019 (INIT_ARRAY) 0x6b1f8 2025-05-07T20:11:05.7444072Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:11:05.7444416Z 0x000000000000001a (FINI_ARRAY) 0x6b220 2025-05-07T20:11:05.7444770Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:05.7445126Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:05.7445449Z 0x000000006ffffef5 (GNU_HASH) 0x12b0 2025-05-07T20:11:05.7445789Z 0x0000000000000005 (STRTAB) 0x3ff0 2025-05-07T20:11:05.7446116Z 0x0000000000000006 (SYMTAB) 0x1e78 2025-05-07T20:11:05.7446478Z 0x000000000000000a (STRSZ) 31424 (bytes) 2025-05-07T20:11:05.7446842Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:05.7447192Z 0x0000000000000003 (PLTGOT) 0x6b7e0 2025-05-07T20:11:05.7447547Z 0x0000000000000002 (PLTRELSZ) 4320 (bytes) 2025-05-07T20:11:05.7447916Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:05.7448245Z 0x0000000000000017 (JMPREL) 0xd0f8 2025-05-07T20:11:05.7448567Z 0x0000000000000007 (RELA) 0xbeb0 2025-05-07T20:11:05.7448918Z 0x0000000000000008 (RELASZ) 4680 (bytes) 2025-05-07T20:11:05.7449272Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:05.7449622Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:05.7449942Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:05.7450317Z 0x000000006ffffffe (VERNEED) 0xbd80 2025-05-07T20:11:05.7450653Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:05.7450973Z 0x000000006ffffff0 (VERSYM) 0xbab0 2025-05-07T20:11:05.7451313Z 0x000000006ffffff9 (RELACOUNT) 24 2025-05-07T20:11:05.7451623Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:05.7451828Z 2025-05-07T20:11:05.7451970Z ################################################################################ 2025-05-07T20:11:05.7452198Z 2025-05-07T20:11:05.7452203Z 2025-05-07T20:11:05.7452336Z ################################################################################ 2025-05-07T20:11:05.7452852Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:05.7453385Z [CHECK] Listing out library size: 2025-05-07T20:11:05.7453878Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:05.7454320Z 2025-05-07T20:11:05.7454532Z 35 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:05.7454858Z 2025-05-07T20:11:05.7455268Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:05.7456350Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.7456976Z 2025-05-07T20:11:05.7562762Z GLIBC_2.2.5 2025-05-07T20:11:05.7563799Z GLIBC_2.3 2025-05-07T20:11:05.7564080Z GLIBC_2.14 2025-05-07T20:11:05.7564227Z 2025-05-07T20:11:05.7564235Z 2025-05-07T20:11:05.7564731Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:05.7567582Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.7568252Z 2025-05-07T20:11:05.7667606Z GLIBCXX_3.4 2025-05-07T20:11:05.7667925Z GLIBCXX_3.4.9 2025-05-07T20:11:05.7668998Z GLIBCXX_3.4.11 2025-05-07T20:11:05.7669664Z GLIBCXX_3.4.15 2025-05-07T20:11:05.7670253Z GLIBCXX_3.4.18 2025-05-07T20:11:05.7670819Z GLIBCXX_3.4.20 2025-05-07T20:11:05.7671401Z GLIBCXX_3.4.21 2025-05-07T20:11:05.7671953Z GLIBCXX_3.4.29 2025-05-07T20:11:05.7672328Z 2025-05-07T20:11:05.7672364Z 2025-05-07T20:11:05.7687673Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.o7RpTCVUIh.symbols.txt 2025-05-07T20:11:05.7689158Z 2025-05-07T20:11:05.7765430Z 2025-05-07T20:11:05.7811601Z [CHECK] Total Number of symbols: 1477 2025-05-07T20:11:05.7828540Z [CHECK] Number of fbgemm symbols: 213 2025-05-07T20:11:05.7848473Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.xP4Uov8aNt.usymbols.txt 2025-05-07T20:11:05.7850044Z 2025-05-07T20:11:05.7867584Z 2025-05-07T20:11:05.7892089Z [CHECK] Listing out undefined symbols (270 total): 2025-05-07T20:11:05.7904849Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7905723Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7906288Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:05.7906658Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.7907062Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.7907464Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.7907845Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:05.7908248Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:05.7908601Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:05.7908961Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.7909358Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:05.7909687Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:05.7910025Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:05.7910336Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:05.7910467Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:05.7910572Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:05.7910676Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:05.7910801Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:05.7910906Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:05.7911016Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:05.7911115Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:05.7911393Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:05.7911495Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:05.7911659Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:05.7911825Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:05.7911999Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:05.7912130Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:05.7912333Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:05.7912481Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:05.7912667Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:05.7912795Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:05.7912911Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:05.7913349Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:11:05.7913506Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:05.7914117Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.7914773Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.7914952Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.7915115Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.7915296Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:05.7915477Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.7915779Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.7915981Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:05.7916111Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:05.7916278Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.7916477Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:11:05.7916664Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:05.7916902Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:05.7917210Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:05.7917858Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:05.7918034Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:05.7918188Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.7918678Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.7919439Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.7919594Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:05.7919746Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:05.7919930Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:05.7920046Z U at::globalContext() 2025-05-07T20:11:05.7920189Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:11:05.7920317Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:05.7920435Z U bool at::Tensor::item() const 2025-05-07T20:11:05.7920568Z U c10::AnyType::get() 2025-05-07T20:11:05.7920744Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:05.7920955Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7921074Z U c10::BoolType::get() 2025-05-07T20:11:05.7921257Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:05.7921444Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:05.7921573Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:05.7922105Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:05.7922762Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:05.7923172Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.7923274Z U c10::Error::what() const 2025-05-07T20:11:05.7923383Z U c10::GradMode::is_enabled() 2025-05-07T20:11:05.7923520Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:05.7923699Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7923853Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:05.7924004Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:05.7924117Z U c10::IValue::isBoolList() const 2025-05-07T20:11:05.7924222Z U c10::IValue::isIntList() const 2025-05-07T20:11:05.7924356Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:05.7924477Z U c10::IValue::isTensorList() const 2025-05-07T20:11:05.7924621Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:05.7924741Z U c10::IntType::get() 2025-05-07T20:11:05.7925307Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.7925479Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:05.7925602Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:05.7925751Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.7925872Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.7926161Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:05.7926327Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:05.7926454Z U c10::StringType::get() 2025-05-07T20:11:05.7926592Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:05.7927017Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:05.7927176Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:05.7927297Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:05.7927424Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:05.7927569Z U c10::SymIntType::get() 2025-05-07T20:11:05.7927727Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:05.7927842Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:05.7928313Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:05.7928494Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:05.7928594Z U c10::TensorType::get() 2025-05-07T20:11:05.7928810Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:05.7928921Z U c10::Type::is_module() const 2025-05-07T20:11:05.7929073Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:05.7929845Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:05.7929983Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:05.7930105Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:05.7930248Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:05.7930369Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:05.7930496Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:05.7930633Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:05.7930886Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:05.7930989Z U c10::cuda::device_count() 2025-05-07T20:11:05.7931146Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:05.7931282Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:05.7931432Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:05.7931587Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:05.7931744Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:05.7931861Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:05.7932322Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.7932857Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.7933124Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:05.7933651Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.7934004Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:05.7934616Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.7934911Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:05.7935115Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:05.7935239Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:05.7935362Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:05.7935722Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:05.7935938Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:05.7936102Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:05.7936276Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:05.7936399Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:05.7936558Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.7936720Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:05.7937101Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.7937267Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:05.7937428Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:05.7937592Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:05.7937731Z U c10::throwNullDataPtrError() 2025-05-07T20:11:05.7937851Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:05.7937957Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:05.7938093Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:05.7938287Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:05.7938408Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:05.7938556Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:05.7938681Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:05.7938818Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:05.7938933Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:05.7939072Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:05.7939194Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:05.7939312Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:05.7939541Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:05.7939673Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:05.7939811Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:05.7939958Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:05.7940084Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:05.7940197Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:05.7940314Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:05.7940548Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:05.7940668Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:05.7940887Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:05.7941052Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7941153Z U free@GLIBC_2.2.5 2025-05-07T20:11:05.7941302Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7941405Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:05.7941527Z U long at::Tensor::item() const 2025-05-07T20:11:05.7941715Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:05.7941865Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.7942025Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.7942125Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:05.7942230Z U memcpy@GLIBC_2.14 2025-05-07T20:11:05.7942326Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:05.7942427Z U memset@GLIBC_2.2.5 2025-05-07T20:11:05.7942626Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:05.7942784Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.7942877Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:05.7943097Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:05.7943471Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:05.7943897Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.7944265Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:05.7944644Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:05.7944788Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:05.7944927Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:05.7945070Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.7945210Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.7945399Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:05.7945540Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:05.7945678Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:05.7945929Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:05.7946316Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:05.7946935Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.7947475Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.7947607Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:05.7947735Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:05.7947868Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:05.7947984Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.7948103Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.7948224Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:05.7948343Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:05.7948530Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.7948792Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.7948921Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:05.7949097Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:05.7949251Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:05.7949700Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:05.7949839Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:05.7949959Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:05.7950055Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:05.7950153Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:05.7950303Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:05.7950928Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:05.7951429Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.7951737Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.7951857Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:05.7952155Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:05.7952363Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:05.7952595Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:05.7952779Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:05.7953153Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:05.7953310Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:05.7953498Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:05.7953686Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:05.7953815Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:05.7953934Z U torch::autograd::Node::metadata() 2025-05-07T20:11:05.7954084Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:05.7954342Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:05.7954614Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:05.7954767Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:05.7954988Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:05.7955214Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:05.7958031Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:05.7958193Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:05.7958355Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:05.7958519Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:05.7958674Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:05.7959110Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:05.7959481Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.7960087Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:05.7960215Z U typeinfo for c10::Error 2025-05-07T20:11:05.7960314Z U typeinfo for c10::Type 2025-05-07T20:11:05.7960456Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.7960620Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:05.7960754Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:05.7960867Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:05.7961033Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:05.7961190Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:05.7961375Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:05.7961544Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.7961700Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.7961793Z U vtable for c10::Error 2025-05-07T20:11:05.7961907Z U vtable for c10::ListType 2025-05-07T20:11:05.7962265Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7962604Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7962971Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.7963108Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.7963316Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:05.7963669Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:05.7963800Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:05.7963906Z U vtable for torch::autograd::Node 2025-05-07T20:11:05.7964090Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.7964198Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:05.7964302Z w _ITM_registerTMCloneTable 2025-05-07T20:11:05.7964414Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:05.7964519Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:05.7964610Z w __gmon_start__ 2025-05-07T20:11:05.7964702Z w __pthread_key_create 2025-05-07T20:11:05.7964824Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:05.7964940Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:05.7965081Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:05.7965312Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:05.7965320Z 2025-05-07T20:11:05.7965449Z linux-vdso.so.1 (0x00007fff93247000) 2025-05-07T20:11:05.7965539Z libc10.so => not found 2025-05-07T20:11:05.7965648Z libc10_cuda.so => not found 2025-05-07T20:11:05.7966215Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f8b9f850000) 2025-05-07T20:11:05.7966671Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f8b9e600000) 2025-05-07T20:11:05.7966782Z libtorch.so => not found 2025-05-07T20:11:05.7966880Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7966984Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7967076Z libcudart.so.12 => not found 2025-05-07T20:11:05.7967281Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f8b9e39c000) 2025-05-07T20:11:05.7967434Z libm.so.6 => /lib64/libm.so.6 (0x00007f8b9f775000) 2025-05-07T20:11:05.7967579Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8ba1dbb000) 2025-05-07T20:11:05.7967717Z libc.so.6 => /lib64/libc.so.6 (0x00007f8b9e194000) 2025-05-07T20:11:05.7967846Z /lib64/ld-linux-x86-64.so.2 (0x00007f8ba1def000) 2025-05-07T20:11:05.7967928Z libc10.so => not found 2025-05-07T20:11:05.7968029Z libc10_cuda.so => not found 2025-05-07T20:11:05.7968144Z libtorch.so => not found 2025-05-07T20:11:05.7968238Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7968323Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7968431Z libcudart.so.12 => not found 2025-05-07T20:11:05.7968522Z libtorch.so => not found 2025-05-07T20:11:05.7968602Z libc10.so => not found 2025-05-07T20:11:05.7968706Z libc10_cuda.so => not found 2025-05-07T20:11:05.7968827Z libtorch_cpu.so => not found 2025-05-07T20:11:05.7968912Z libtorch_cuda.so => not found 2025-05-07T20:11:05.7969000Z libcudart.so.12 => not found 2025-05-07T20:11:05.7969007Z 2025-05-07T20:11:05.7969126Z [CHECK] Displaying ELF information: 2025-05-07T20:11:05.7969378Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:05.7969383Z 2025-05-07T20:11:05.7982421Z 2025-05-07T20:11:05.7982983Z Dynamic section at offset 0x2201930 contains 41 entries: 2025-05-07T20:11:05.7983335Z Tag Type Name/Value 2025-05-07T20:11:05.7983960Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:05.7984480Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:05.7984735Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:05.7984957Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:05.7985153Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:05.7985374Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:05.7985585Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:05.7985788Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:05.7985997Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:05.7986187Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:05.7986382Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:05.7986579Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:05.7986794Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:05.7987043Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:11:05.7987237Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:05.7987351Z 0x000000000000000c (INIT) 0x51000 2025-05-07T20:11:05.7987468Z 0x000000000000000d (FINI) 0x14a27c 2025-05-07T20:11:05.7987583Z 0x0000000000000019 (INIT_ARRAY) 0x2201bc8 2025-05-07T20:11:05.7987718Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:11:05.7987834Z 0x000000000000001a (FINI_ARRAY) 0x2201c58 2025-05-07T20:11:05.7987951Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:05.7988067Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:05.7988185Z 0x000000006ffffef5 (GNU_HASH) 0x2900 2025-05-07T20:11:05.7988300Z 0x0000000000000005 (STRTAB) 0xda10 2025-05-07T20:11:05.7988418Z 0x0000000000000006 (SYMTAB) 0x4f80 2025-05-07T20:11:05.7988552Z 0x000000000000000a (STRSZ) 224745 (bytes) 2025-05-07T20:11:05.7988671Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:05.7988879Z 0x0000000000000003 (PLTGOT) 0x2202c00 2025-05-07T20:11:05.7989020Z 0x0000000000000002 (PLTRELSZ) 11784 (bytes) 2025-05-07T20:11:05.7989161Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:05.7989271Z 0x0000000000000017 (JMPREL) 0x4da10 2025-05-07T20:11:05.7989388Z 0x0000000000000007 (RELA) 0x45508 2025-05-07T20:11:05.7989521Z 0x0000000000000008 (RELASZ) 34056 (bytes) 2025-05-07T20:11:05.7989640Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:05.7989774Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:05.7989907Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:05.7990019Z 0x000000006ffffffe (VERNEED) 0x45388 2025-05-07T20:11:05.7990127Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:05.7990264Z 0x000000006ffffff0 (VERSYM) 0x447fa 2025-05-07T20:11:05.7990410Z 0x000000006ffffff9 (RELACOUNT) 388 2025-05-07T20:11:05.7990528Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:05.7990535Z 2025-05-07T20:11:05.7990678Z ################################################################################ 2025-05-07T20:11:05.7990685Z 2025-05-07T20:11:05.7990689Z 2025-05-07T20:11:05.7990806Z ################################################################################ 2025-05-07T20:11:05.7991052Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:05.7991181Z [CHECK] Listing out library size: 2025-05-07T20:11:05.7991421Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:05.7991426Z 2025-05-07T20:11:05.7997100Z 74 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:05.7997179Z 2025-05-07T20:11:05.7997915Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:05.7998393Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.7998399Z 2025-05-07T20:11:05.8364966Z GLIBC_2.2.5 2025-05-07T20:11:05.8365154Z GLIBC_2.3 2025-05-07T20:11:05.8365279Z GLIBC_2.14 2025-05-07T20:11:05.8365415Z 2025-05-07T20:11:05.8365424Z 2025-05-07T20:11:05.8365984Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:05.8366531Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.8366537Z 2025-05-07T20:11:05.8728859Z GLIBCXX_3.4 2025-05-07T20:11:05.8729157Z GLIBCXX_3.4.9 2025-05-07T20:11:05.8729430Z GLIBCXX_3.4.11 2025-05-07T20:11:05.8729661Z GLIBCXX_3.4.14 2025-05-07T20:11:05.8729890Z GLIBCXX_3.4.15 2025-05-07T20:11:05.8730146Z GLIBCXX_3.4.18 2025-05-07T20:11:05.8730400Z GLIBCXX_3.4.19 2025-05-07T20:11:05.8730622Z GLIBCXX_3.4.20 2025-05-07T20:11:05.8730877Z GLIBCXX_3.4.21 2025-05-07T20:11:05.8731100Z GLIBCXX_3.4.29 2025-05-07T20:11:05.8731124Z 2025-05-07T20:11:05.8731144Z 2025-05-07T20:11:05.8748494Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.86fWVvw49d.symbols.txt 2025-05-07T20:11:05.8748524Z 2025-05-07T20:11:05.9051565Z 2025-05-07T20:11:05.9078379Z [CHECK] Total Number of symbols: 6350 2025-05-07T20:11:05.9114755Z [CHECK] Number of fbgemm symbols: 4411 2025-05-07T20:11:05.9133029Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.mxz2wvzlg7.usymbols.txt 2025-05-07T20:11:05.9133048Z 2025-05-07T20:11:05.9177918Z 2025-05-07T20:11:05.9208070Z [CHECK] Listing out undefined symbols (483 total): 2025-05-07T20:11:05.9225101Z U GOMP_parallel 2025-05-07T20:11:05.9225550Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.9226107Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.9226374Z U VTT for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:05.9226652Z U VTT for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:05.9226769Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:05.9226913Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:11:05.9227071Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.9227270Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:05.9227416Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.9227589Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:05.9227726Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:05.9227909Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:05.9228090Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:05.9228224Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:05.9228337Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:05.9228472Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:05.9228585Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:05.9228704Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:05.9228818Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:05.9228962Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:05.9229075Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:05.9229185Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:05.9229323Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:05.9229430Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:05.9229546Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:05.9229661Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:05.9229791Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:05.9229981Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:05.9230117Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:05.9230271Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:05.9230429Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:05.9230554Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:11:05.9230697Z U at::SplitUntil32Bit::end() const 2025-05-07T20:11:05.9230849Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:11:05.9230990Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:11:05.9231245Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:05.9231447Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:05.9231638Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:11:05.9231836Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:11:05.9231987Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:11:05.9232130Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:11:05.9232286Z U at::TensorIteratorBase::numel() const 2025-05-07T20:11:05.9232450Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:11:05.9232676Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:11:05.9232930Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:11:05.9233053Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:05.9233201Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:11:05.9233397Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:11:05.9233764Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.9233995Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.9234129Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:05.9234549Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:11:05.9234776Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:05.9234974Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:11:05.9235198Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:11:05.9235410Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.9235657Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:05.9235852Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:05.9236024Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:05.9236285Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:11:05.9236644Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:11:05.9236831Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.9237469Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9238148Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9238328Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:05.9238549Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:05.9238688Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:11:05.9239248Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9239450Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.9239785Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:11:05.9240025Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:05.9240160Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:11:05.9240340Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.9240472Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:05.9240678Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.9241381Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9241587Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:05.9242136Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9242355Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:05.9242686Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:05.9242864Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:05.9243341Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.9243736Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:05.9243888Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:11:05.9244150Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:05.9244328Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:11:05.9244570Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:05.9244766Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:11:05.9245053Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:05.9245367Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:05.9246038Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:05.9246217Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:11:05.9246480Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:11:05.9246657Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:05.9246840Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:05.9247009Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:05.9247152Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:05.9247644Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9248237Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9248545Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:11:05.9248678Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:11:05.9248816Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:05.9248984Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:05.9249226Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:11:05.9249583Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:11:05.9249738Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:05.9249903Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:05.9250010Z U at::get_num_threads() 2025-05-07T20:11:05.9250138Z U at::get_thread_num() 2025-05-07T20:11:05.9250272Z U at::in_parallel_region() 2025-05-07T20:11:05.9250377Z U at::init_num_threads() 2025-05-07T20:11:05.9250649Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:11:05.9250767Z U at::internal::set_thread_num(int) 2025-05-07T20:11:05.9251014Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:11:05.9251655Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9252300Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:05.9252606Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:05.9252780Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:11:05.9252916Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:05.9253085Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:05.9253223Z U bool at::Tensor::item() const 2025-05-07T20:11:05.9253363Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9253521Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9253650Z U c10::AnyType::get() 2025-05-07T20:11:05.9253820Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:05.9254004Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9254234Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9254341Z U c10::BoolType::get() 2025-05-07T20:11:05.9254456Z U c10::DeviceObjType::get() 2025-05-07T20:11:05.9254641Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:05.9254830Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:05.9254950Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:05.9255497Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:05.9256142Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:05.9256702Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.9256848Z U c10::Error::what() const 2025-05-07T20:11:05.9256956Z U c10::FloatType::get() 2025-05-07T20:11:05.9257071Z U c10::GradMode::is_enabled() 2025-05-07T20:11:05.9257212Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:05.9257370Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9257552Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9257745Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:05.9257876Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:05.9257995Z U c10::IValue::isBoolList() const 2025-05-07T20:11:05.9258128Z U c10::IValue::isIntList() const 2025-05-07T20:11:05.9258249Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:05.9258370Z U c10::IValue::isTensorList() const 2025-05-07T20:11:05.9258549Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:05.9258717Z U c10::InferenceMode::is_enabled() 2025-05-07T20:11:05.9258822Z U c10::IntType::get() 2025-05-07T20:11:05.9259408Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.9259658Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:05.9259787Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:05.9259928Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.9260088Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:05.9260322Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.9260493Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:05.9260651Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:05.9260784Z U c10::ScalarTypeType::get() 2025-05-07T20:11:05.9261078Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:05.9261434Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:11:05.9261604Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:05.9261721Z U c10::StringType::get() 2025-05-07T20:11:05.9261893Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:05.9262047Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:05.9262209Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:05.9262659Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:05.9262802Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:05.9262949Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:11:05.9263117Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:05.9263255Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:05.9263378Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:05.9263534Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:05.9263670Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:05.9263787Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:05.9263899Z U c10::SymIntType::get() 2025-05-07T20:11:05.9264141Z U c10::SymbolicShapeMeta::init_is_channels_last_3d_contiguous() const 2025-05-07T20:11:05.9264359Z U c10::SymbolicShapeMeta::init_is_channels_last_contiguous() const 2025-05-07T20:11:05.9264520Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:05.9264664Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:05.9265128Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:05.9265288Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:05.9265469Z U c10::TensorImpl::throw_storage_access_error() const 2025-05-07T20:11:05.9265576Z U c10::TensorType::get() 2025-05-07T20:11:05.9266411Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:05.9267670Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:05.9267801Z U c10::Type::is_module() const 2025-05-07T20:11:05.9267971Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:05.9268747Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:05.9268921Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:05.9269127Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:05.9269407Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:05.9269765Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:05.9269947Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:05.9270078Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:05.9270202Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:05.9270361Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:05.9270480Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:05.9270742Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:05.9270859Z U c10::cuda::current_device() 2025-05-07T20:11:05.9270997Z U c10::cuda::device_count() 2025-05-07T20:11:05.9271142Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:05.9271287Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:05.9271458Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:05.9271606Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:05.9271773Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:05.9271916Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:05.9272476Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:05.9273187Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:05.9273471Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:05.9273983Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.9274358Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:05.9274967Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:05.9275255Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:05.9275481Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:05.9275609Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:05.9275725Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:05.9276081Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:05.9276275Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:05.9276411Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:11:05.9276606Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:05.9276760Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:05.9276965Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:05.9277114Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:05.9277240Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.9277443Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:05.9277892Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:05.9278025Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:11:05.9278153Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:05.9278304Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:11:05.9278482Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:05.9278633Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:11:05.9278762Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:11:05.9278917Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:05.9279072Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:05.9279221Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:05.9279421Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:05.9279573Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:05.9279704Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:11:05.9279845Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:05.9279969Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:05.9280089Z U c10::report_overflow(char const*) 2025-05-07T20:11:05.9280228Z U c10::throwNullDataPtrError() 2025-05-07T20:11:05.9280357Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:05.9280467Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:05.9280607Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:05.9280807Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:05.9280927Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:05.9281048Z U cublasGemmStridedBatchedEx 2025-05-07T20:11:05.9281168Z U cublasSetStream_v2 2025-05-07T20:11:05.9281301Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:05.9281434Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:11:05.9281581Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:05.9281723Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:05.9281848Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:05.9282180Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:05.9282302Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:05.9282425Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:05.9282558Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:05.9282697Z U cudaFree@libcudart.so.12 2025-05-07T20:11:05.9282836Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:05.9282964Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:05.9283125Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:05.9283254Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:05.9283408Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:05.9283561Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:05.9283732Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:05.9283878Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:11:05.9284045Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:11:05.9284199Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:11:05.9284320Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:05.9284446Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:11:05.9284617Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:11:05.9284746Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:11:05.9284863Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:11:05.9284984Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:05.9285123Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:05.9285455Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:05.9285588Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:05.9285727Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:05.9285850Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:05.9285985Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:05.9286134Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:05.9286289Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9286466Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9286591Z U exit@GLIBC_2.2.5 2025-05-07T20:11:05.9286691Z U exp10@GLIBC_2.2.5 2025-05-07T20:11:05.9286791Z U exp@GLIBC_2.2.5 2025-05-07T20:11:05.9286891Z U expf@GLIBC_2.2.5 2025-05-07T20:11:05.9287113Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:05.9287325Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:05.9287532Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:05.9287752Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:05.9287956Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:05.9288107Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9288298Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9288405Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:05.9288505Z U fmod@GLIBC_2.2.5 2025-05-07T20:11:05.9288630Z U free@GLIBC_2.2.5 2025-05-07T20:11:05.9288745Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:11:05.9288867Z U int at::Tensor::item() const 2025-05-07T20:11:05.9289042Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:05.9289182Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9289321Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9289418Z U lgamma@GLIBC_2.2.5 2025-05-07T20:11:05.9289524Z U llrint@GLIBC_2.2.5 2025-05-07T20:11:05.9289617Z U log10@GLIBC_2.2.5 2025-05-07T20:11:05.9289709Z U log2@GLIBC_2.2.5 2025-05-07T20:11:05.9289823Z U log@GLIBC_2.2.5 2025-05-07T20:11:05.9289937Z U long at::Tensor::item() const 2025-05-07T20:11:05.9290116Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:05.9290300Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:05.9290437Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9290595Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9290723Z U lrint@GLIBC_2.2.5 2025-05-07T20:11:05.9290869Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:05.9290965Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:05.9291062Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:05.9291173Z U memcpy@GLIBC_2.14 2025-05-07T20:11:05.9291275Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:05.9291370Z U memset@GLIBC_2.2.5 2025-05-07T20:11:05.9291526Z U nvmlDeviceGetCount_v2 2025-05-07T20:11:05.9291650Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:11:05.9291785Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:11:05.9291898Z U nvmlDeviceGetNvLinkState 2025-05-07T20:11:05.9292036Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:11:05.9292132Z U nvmlInit_v2 2025-05-07T20:11:05.9292281Z U omp_get_num_threads 2025-05-07T20:11:05.9292402Z U omp_get_thread_num 2025-05-07T20:11:05.9292562Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:05.9292696Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.9292837Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:05.9292965Z U pow@GLIBC_2.2.5 2025-05-07T20:11:05.9293068Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:05.9293240Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9293468Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9293566Z U sin@GLIBC_2.2.5 2025-05-07T20:11:05.9293789Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:05.9293989Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:05.9294189Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:11:05.9294356Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:05.9294770Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:05.9295035Z U std::__basic_file::~__basic_file()@GLIBCXX_3.4 2025-05-07T20:11:05.9295383Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:05.9295798Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.9296136Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:05.9296703Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:05.9297110Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:05.9297250Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:05.9297415Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:05.9297533Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:05.9297650Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:05.9297767Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:05.9297905Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:05.9298013Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:05.9298151Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:05.9298316Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.9298492Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.9298656Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.9298852Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:05.9298997Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:05.9299142Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:05.9299544Z U std::basic_filebuf >::basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:05.9299765Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:11:05.9300078Z U std::basic_filebuf >::open(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:11:05.9300358Z U std::basic_filebuf >::~basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:05.9300635Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:11:05.9300886Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:05.9301273Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:05.9301523Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:11:05.9302126Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.9302677Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:05.9302837Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:11:05.9302960Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:05.9303125Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:05.9303252Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:05.9303384Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:05.9303532Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:05.9303659Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:05.9303781Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.9303919Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:05.9304042Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:05.9304157Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:05.9304373Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:05.9304564Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.9304808Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:05.9304936Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:05.9305070Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:05.9305196Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:05.9305354Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:11:05.9305529Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:05.9305664Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:05.9305906Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:05.9306386Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:05.9306560Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:05.9306680Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:05.9306786Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:05.9306883Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:05.9306995Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:05.9307145Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:05.9307768Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:05.9308260Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.9308812Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:05.9309086Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:05.9309225Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:05.9309536Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:05.9309728Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:05.9309944Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:05.9310138Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:05.9310508Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:05.9310678Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:05.9310878Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:05.9311065Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:05.9311205Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:05.9311321Z U torch::autograd::Node::metadata() 2025-05-07T20:11:05.9311464Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:05.9311736Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:05.9312119Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:05.9312261Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:05.9312497Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:05.9312712Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:05.9315482Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:05.9315637Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:05.9315830Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:05.9316002Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:05.9316159Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:05.9316797Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:05.9317172Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.9317575Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:05.9317830Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:11:05.9317955Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:11:05.9318702Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:05.9318822Z U typeinfo for c10::Error 2025-05-07T20:11:05.9318929Z U typeinfo for c10::Type 2025-05-07T20:11:05.9319080Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.9319227Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:05.9319358Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:05.9319494Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:05.9319654Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:05.9319853Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:05.9320073Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:05.9320558Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:05.9321094Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:05.9321559Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:05.9322123Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:05.9322581Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:11:05.9323119Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:11:05.9323634Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:05.9324197Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:05.9324753Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:05.9325367Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:05.9326002Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:05.9326201Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:05.9326370Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:05.9326530Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:05.9326709Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.9326911Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:05.9327028Z U vtable for at::TensorIterator 2025-05-07T20:11:05.9327171Z U vtable for at::TensorIteratorBase 2025-05-07T20:11:05.9327273Z U vtable for c10::Error 2025-05-07T20:11:05.9327377Z U vtable for c10::ListType 2025-05-07T20:11:05.9327778Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.9328123Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.9328481Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:05.9328629Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:05.9328855Z U vtable for std::basic_filebuf >@GLIBCXX_3.4 2025-05-07T20:11:05.9329080Z U vtable for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:05.9329297Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:05.9329523Z U vtable for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:05.9329752Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:05.9329902Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:05.9330021Z U vtable for torch::autograd::Node 2025-05-07T20:11:05.9330203Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:05.9330334Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:05.9330445Z w _ITM_registerTMCloneTable 2025-05-07T20:11:05.9330556Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:05.9330667Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:05.9330761Z w __gmon_start__ 2025-05-07T20:11:05.9330858Z w __pthread_key_create 2025-05-07T20:11:05.9330971Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:05.9331101Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:05.9331199Z w pthread_once 2025-05-07T20:11:05.9331350Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:05.9331547Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:05.9331555Z 2025-05-07T20:11:05.9331699Z linux-vdso.so.1 (0x00007ffdee9a8000) 2025-05-07T20:11:05.9331794Z libc10.so => not found 2025-05-07T20:11:05.9331917Z libc10_cuda.so => not found 2025-05-07T20:11:05.9332294Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007ff01ce00000) 2025-05-07T20:11:05.9332399Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:05.9332517Z libtorch.so => not found 2025-05-07T20:11:05.9333082Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007ff01cc50000) 2025-05-07T20:11:05.9333548Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007ff01ba00000) 2025-05-07T20:11:05.9333667Z libtorch_cpu.so => not found 2025-05-07T20:11:05.9333795Z libtorch_cuda.so => not found 2025-05-07T20:11:05.9333915Z libcudart.so.12 => not found 2025-05-07T20:11:05.9334080Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff01b79c000) 2025-05-07T20:11:05.9334226Z libm.so.6 => /lib64/libm.so.6 (0x00007ff01cb75000) 2025-05-07T20:11:05.9334376Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff01d3d2000) 2025-05-07T20:11:05.9334497Z libc.so.6 => /lib64/libc.so.6 (0x00007ff01b594000) 2025-05-07T20:11:05.9334674Z /lib64/ld-linux-x86-64.so.2 (0x00007ff0221d0000) 2025-05-07T20:11:05.9334765Z libc10.so => not found 2025-05-07T20:11:05.9335134Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007ff01d357000) 2025-05-07T20:11:05.9335243Z libtorch.so => not found 2025-05-07T20:11:05.9335343Z libtorch_cpu.so => not found 2025-05-07T20:11:05.9335441Z libtorch_cuda.so => not found 2025-05-07T20:11:05.9335570Z libc10.so => not found 2025-05-07T20:11:05.9335685Z libc10_cuda.so => not found 2025-05-07T20:11:05.9335775Z libtorch.so => not found 2025-05-07T20:11:05.9335869Z libtorch_cpu.so => not found 2025-05-07T20:11:05.9335995Z libtorch_cuda.so => not found 2025-05-07T20:11:05.9336090Z libcudart.so.12 => not found 2025-05-07T20:11:05.9336181Z libtorch.so => not found 2025-05-07T20:11:05.9336277Z libc10.so => not found 2025-05-07T20:11:05.9336387Z libc10_cuda.so => not found 2025-05-07T20:11:05.9336481Z libtorch_cpu.so => not found 2025-05-07T20:11:05.9336581Z libtorch_cuda.so => not found 2025-05-07T20:11:05.9336693Z libcudart.so.12 => not found 2025-05-07T20:11:05.9336787Z libtorch_cpu.so => not found 2025-05-07T20:11:05.9336886Z libtorch_cuda.so => not found 2025-05-07T20:11:05.9336982Z libtorch.so => not found 2025-05-07T20:11:05.9336987Z 2025-05-07T20:11:05.9337114Z [CHECK] Displaying ELF information: 2025-05-07T20:11:05.9337317Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:05.9337324Z 2025-05-07T20:11:05.9337328Z 2025-05-07T20:11:05.9337497Z Dynamic section at offset 0x4953578 contains 43 entries: 2025-05-07T20:11:05.9337633Z Tag Type Name/Value 2025-05-07T20:11:05.9337829Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:05.9338036Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:05.9338256Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:05.9338468Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:05.9338672Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:05.9338943Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:05.9339162Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:05.9339452Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:05.9339669Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:05.9339898Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:05.9340102Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:05.9340294Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:05.9340505Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:05.9340700Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:05.9340919Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:05.9341140Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:11:05.9341323Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:05.9341444Z 0x000000000000000c (INIT) 0x18e000 2025-05-07T20:11:05.9341596Z 0x000000000000000d (FINI) 0x7e464c 2025-05-07T20:11:05.9341753Z 0x0000000000000019 (INIT_ARRAY) 0x494d470 2025-05-07T20:11:05.9341917Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:11:05.9342041Z 0x000000000000001a (FINI_ARRAY) 0x494d8f8 2025-05-07T20:11:05.9342181Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:05.9342295Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:05.9342419Z 0x000000006ffffef5 (GNU_HASH) 0x8530 2025-05-07T20:11:05.9342584Z 0x0000000000000005 (STRTAB) 0x363a0 2025-05-07T20:11:05.9342703Z 0x0000000000000006 (SYMTAB) 0x11038 2025-05-07T20:11:05.9342848Z 0x000000000000000a (STRSZ) 1209140 (bytes) 2025-05-07T20:11:05.9342993Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:05.9343118Z 0x0000000000000003 (PLTGOT) 0x4954868 2025-05-07T20:11:05.9343285Z 0x0000000000000002 (PLTRELSZ) 42168 (bytes) 2025-05-07T20:11:05.9343394Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:05.9343535Z 0x0000000000000017 (JMPREL) 0x183378 2025-05-07T20:11:05.9343660Z 0x0000000000000007 (RELA) 0x160a28 2025-05-07T20:11:05.9343805Z 0x0000000000000008 (RELASZ) 141648 (bytes) 2025-05-07T20:11:05.9343948Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:05.9344055Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:05.9344189Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:05.9344312Z 0x000000006ffffffe (VERNEED) 0x160878 2025-05-07T20:11:05.9344446Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:05.9344571Z 0x000000006ffffff0 (VERSYM) 0x15d6d4 2025-05-07T20:11:05.9344683Z 0x000000006ffffff9 (RELACOUNT) 516 2025-05-07T20:11:05.9344802Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:05.9344808Z 2025-05-07T20:11:05.9344931Z ################################################################################ 2025-05-07T20:11:05.9344937Z 2025-05-07T20:11:05.9344941Z 2025-05-07T20:11:05.9345059Z ################################################################################ 2025-05-07T20:11:05.9345396Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:05.9345509Z [CHECK] Listing out library size: 2025-05-07T20:11:05.9345819Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:05.9345826Z 2025-05-07T20:11:05.9346098Z 908 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:05.9346102Z 2025-05-07T20:11:05.9346535Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:05.9347079Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:05.9347109Z 2025-05-07T20:11:06.1141801Z GLIBC_2.2.5 2025-05-07T20:11:06.1142049Z GLIBC_2.3 2025-05-07T20:11:06.1142276Z GLIBC_2.14 2025-05-07T20:11:06.1142292Z 2025-05-07T20:11:06.1142331Z 2025-05-07T20:11:06.1143719Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:06.1145082Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:06.1145087Z 2025-05-07T20:11:06.2942008Z GLIBCXX_3.4 2025-05-07T20:11:06.2942659Z GLIBCXX_3.4.9 2025-05-07T20:11:06.2943295Z GLIBCXX_3.4.11 2025-05-07T20:11:06.2943909Z GLIBCXX_3.4.14 2025-05-07T20:11:06.2944476Z GLIBCXX_3.4.15 2025-05-07T20:11:06.2945078Z GLIBCXX_3.4.18 2025-05-07T20:11:06.2945640Z GLIBCXX_3.4.20 2025-05-07T20:11:06.2946250Z GLIBCXX_3.4.21 2025-05-07T20:11:06.2946805Z GLIBCXX_3.4.29 2025-05-07T20:11:06.2947192Z 2025-05-07T20:11:06.2947205Z 2025-05-07T20:11:06.2965042Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.fxa1013JL2.symbols.txt 2025-05-07T20:11:06.2965676Z 2025-05-07T20:11:06.4740801Z 2025-05-07T20:11:06.4815412Z [CHECK] Total Number of symbols: 12349 2025-05-07T20:11:06.4902212Z [CHECK] Number of fbgemm symbols: 2031 2025-05-07T20:11:06.4921184Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.9rJABooqAP.usymbols.txt 2025-05-07T20:11:06.4922132Z 2025-05-07T20:11:06.4979813Z 2025-05-07T20:11:06.5008524Z [CHECK] Listing out undefined symbols (289 total): 2025-05-07T20:11:06.5022303Z U GOMP_parallel 2025-05-07T20:11:06.5024277Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.5025275Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.5025886Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:06.5026310Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.5026727Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.5027149Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.5027546Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:06.5027964Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:06.5028335Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:06.5028744Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.5029144Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:06.5029488Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:06.5029844Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:06.5030180Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:06.5030538Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:06.5030884Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:06.5031241Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:06.5031602Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:06.5031937Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:06.5032294Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:06.5032631Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:06.5032985Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:06.5033323Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:06.5033703Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:06.5034140Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:06.5034824Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:06.5035399Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:06.5036010Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:06.5036408Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:06.5036795Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:06.5037271Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:06.5037939Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:06.5038546Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:06.5039453Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.5040924Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.5042073Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:06.5043177Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.5044338Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:06.5044895Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:06.5045338Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:06.5046146Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.5047287Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.5060224Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:06.5060828Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:06.5061256Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:06.5061671Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:06.5062077Z U at::get_num_threads() 2025-05-07T20:11:06.5062388Z U at::get_thread_num() 2025-05-07T20:11:06.5062725Z U at::globalContext() 2025-05-07T20:11:06.5063045Z U at::in_parallel_region() 2025-05-07T20:11:06.5063380Z U at::init_num_threads() 2025-05-07T20:11:06.5063725Z U at::internal::set_thread_num(int) 2025-05-07T20:11:06.5064088Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:06.5064538Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:06.5064975Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:06.5065359Z U c10::AnyType::get() 2025-05-07T20:11:06.5065795Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.5066320Z U c10::BoolType::get() 2025-05-07T20:11:06.5066690Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:06.5067123Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:06.5067542Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:06.5068280Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:06.5069490Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:06.5070574Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:06.5071310Z U c10::Error::what() const 2025-05-07T20:11:06.5071681Z U c10::FloatType::get() 2025-05-07T20:11:06.5072020Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:06.5072360Z U c10::GradMode::is_enabled() 2025-05-07T20:11:06.5072702Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:06.5073083Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.5073822Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.5074335Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:06.5074738Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:06.5075115Z U c10::IValue::isBoolList() const 2025-05-07T20:11:06.5075461Z U c10::IValue::isIntList() const 2025-05-07T20:11:06.5077651Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:06.5078026Z U c10::IValue::isTensorList() const 2025-05-07T20:11:06.5078427Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:06.5078822Z U c10::IntType::get() 2025-05-07T20:11:06.5079199Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:06.5079690Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:06.5080062Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:06.5080439Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:06.5080922Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:06.5081436Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:06.5081803Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:06.5082581Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:06.5083182Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.5083574Z U c10::StringType::get() 2025-05-07T20:11:06.5083959Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:06.5084369Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:06.5085095Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:06.5085800Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:06.5086185Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:06.5086563Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:06.5086898Z U c10::SymIntType::get() 2025-05-07T20:11:06.5087300Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:06.5087710Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:06.5088108Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.5088618Z U c10::TensorType::get() 2025-05-07T20:11:06.5088930Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:06.5089861Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:06.5090814Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:06.5091162Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:06.5091513Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:06.5091856Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:06.5092216Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:06.5092558Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:06.5093054Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:06.5093546Z U c10::cuda::device_count() 2025-05-07T20:11:06.5094071Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:06.5094482Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:06.5094964Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:06.5095395Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:06.5095880Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:06.5096268Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:06.5096960Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:06.5098077Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:06.5098989Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:06.5100171Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.5101212Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:06.5102322Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.5103188Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:06.5103528Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:06.5104102Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:06.5104759Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:06.5105220Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:06.5105677Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:06.5106192Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:06.5106535Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:06.5106892Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:06.5107512Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:06.5108115Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:06.5108474Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:06.5108863Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:06.5109413Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:06.5109853Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:06.5110246Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:11:06.5110605Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:06.5110980Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:06.5111325Z U c10::throwNullDataPtrError() 2025-05-07T20:11:06.5111672Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:06.5112006Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:06.5112419Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:06.5112865Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:06.5113218Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:06.5113600Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:06.5113961Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:06.5114337Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:06.5114707Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:06.5115085Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:06.5115456Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:06.5115795Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:06.5116169Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:06.5116536Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:06.5116929Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:06.5117297Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:06.5117651Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:06.5118007Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:06.5118354Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:06.5118727Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:06.5119762Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:06.5121005Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:11:06.5121693Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:06.5122096Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:06.5122525Z U float at::Tensor::item() const 2025-05-07T20:11:06.5122879Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.5123277Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.5123649Z U free@GLIBC_2.2.5 2025-05-07T20:11:06.5123939Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.5124481Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.5124892Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:06.5125301Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.5125691Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.5126030Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:06.5126306Z U memcpy@GLIBC_2.14 2025-05-07T20:11:06.5126576Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:06.5126869Z U memset@GLIBC_2.2.5 2025-05-07T20:11:06.5127146Z U omp_get_num_threads 2025-05-07T20:11:06.5127425Z U omp_get_thread_num 2025-05-07T20:11:06.5127745Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:06.5128127Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:06.5128673Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.5129391Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.5130116Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.5130839Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.5131577Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.5132317Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.5132829Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:06.5133493Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:11:06.5134430Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:11:06.5135243Z U sqrt@GLIBC_2.2.5 2025-05-07T20:11:06.5135558Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:06.5135958Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:06.5136696Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:06.5137563Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:06.5138404Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:06.5139288Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:06.5140191Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:06.5140616Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:06.5141086Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:06.5141440Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:06.5141804Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:06.5142226Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.5142635Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.5143072Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:06.5143524Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:06.5143915Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:06.5144441Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:06.5145195Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:06.5146278Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.5147524Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.5148305Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:06.5148674Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:06.5149056Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:06.5149407Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:06.5149775Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.5150123Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.5150483Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:06.5150842Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:06.5151260Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.5151835Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.5152325Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:06.5152765Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:06.5153212Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:06.5153727Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:06.5154595Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:06.5155288Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:06.5155676Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:06.5156032Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:06.5156325Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:06.5156660Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:06.5157514Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:06.5158767Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.5159645Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.5160161Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:06.5160721Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:06.5161328Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:06.5161867Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:06.5162404Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:06.5163070Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:06.5163721Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:06.5164206Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:06.5164711Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:06.5165159Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:06.5165515Z U torch::autograd::Node::metadata() 2025-05-07T20:11:06.5165899Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:06.5166408Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:06.5167074Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:06.5167635Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:06.5168110Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:06.5168690Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:06.5171848Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:06.5175156Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:06.5175635Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:06.5176104Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:06.5176564Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:06.5177291Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:06.5178230Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:06.5179302Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:06.5180206Z U typeinfo for c10::Error 2025-05-07T20:11:06.5180586Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:06.5180996Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:06.5181369Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:06.5181773Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:06.5182326Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:06.5183680Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:06.5186021Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:06.5187402Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:06.5187863Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:06.5188308Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:06.5188772Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:06.5189183Z U vtable for c10::Error 2025-05-07T20:11:06.5189759Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.5190609Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.5191441Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.5192059Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:06.5192550Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:06.5193115Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:06.5193608Z U vtable for torch::autograd::Node 2025-05-07T20:11:06.5194151Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:06.5194557Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:06.5194905Z w _ITM_registerTMCloneTable 2025-05-07T20:11:06.5195221Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:06.5195547Z w __gmon_start__ 2025-05-07T20:11:06.5195828Z w __pthread_key_create 2025-05-07T20:11:06.5196157Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:06.5196494Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:06.5196956Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:06.5197508Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:06.5197868Z 2025-05-07T20:11:06.5197981Z linux-vdso.so.1 (0x00007ffcfe3db000) 2025-05-07T20:11:06.5198298Z libc10.so => not found 2025-05-07T20:11:06.5198551Z libc10_cuda.so => not found 2025-05-07T20:11:06.5199261Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f562f800000) 2025-05-07T20:11:06.5200399Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f562f650000) 2025-05-07T20:11:06.5201184Z libtorch.so => not found 2025-05-07T20:11:06.5201722Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f562f000000) 2025-05-07T20:11:06.5202772Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f562de00000) 2025-05-07T20:11:06.5203432Z libtorch_cpu.so => not found 2025-05-07T20:11:06.5203699Z libtorch_cuda.so => not found 2025-05-07T20:11:06.5203985Z libcudart.so.12 => not found 2025-05-07T20:11:06.5204313Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f562db9c000) 2025-05-07T20:11:06.5204722Z libm.so.6 => /lib64/libm.so.6 (0x00007f5669d9a000) 2025-05-07T20:11:06.5205102Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f5669d6c000) 2025-05-07T20:11:06.5205473Z libc.so.6 => /lib64/libc.so.6 (0x00007f562d994000) 2025-05-07T20:11:06.5205839Z /lib64/ld-linux-x86-64.so.2 (0x00007f5669e7d000) 2025-05-07T20:11:06.5206151Z libc10.so => not found 2025-05-07T20:11:06.5206410Z libc10_cuda.so => not found 2025-05-07T20:11:06.5206998Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f5669d60000) 2025-05-07T20:11:06.5207642Z libtorch.so => not found 2025-05-07T20:11:06.5207890Z libtorch_cpu.so => not found 2025-05-07T20:11:06.5208161Z libtorch_cuda.so => not found 2025-05-07T20:11:06.5208455Z libcudart.so.12 => not found 2025-05-07T20:11:06.5208717Z libc10.so => not found 2025-05-07T20:11:06.5208983Z libc10_cuda.so => not found 2025-05-07T20:11:06.5209247Z libtorch.so => not found 2025-05-07T20:11:06.5209509Z libtorch_cpu.so => not found 2025-05-07T20:11:06.5209755Z libtorch_cuda.so => not found 2025-05-07T20:11:06.5210014Z libcudart.so.12 => not found 2025-05-07T20:11:06.5210250Z libc10.so => not found 2025-05-07T20:11:06.5210733Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f562fb85000) 2025-05-07T20:11:06.5211271Z libtorch.so => not found 2025-05-07T20:11:06.5211509Z libtorch_cpu.so => not found 2025-05-07T20:11:06.5211764Z libtorch_cuda.so => not found 2025-05-07T20:11:06.5212011Z libtorch.so => not found 2025-05-07T20:11:06.5212247Z libc10.so => not found 2025-05-07T20:11:06.5212472Z libc10_cuda.so => not found 2025-05-07T20:11:06.5212730Z libtorch_cpu.so => not found 2025-05-07T20:11:06.5212968Z libtorch_cuda.so => not found 2025-05-07T20:11:06.5213222Z libcudart.so.12 => not found 2025-05-07T20:11:06.5213458Z libc10.so => not found 2025-05-07T20:11:06.5213694Z libtorch_cpu.so => not found 2025-05-07T20:11:06.5213951Z libtorch_cuda.so => not found 2025-05-07T20:11:06.5214188Z libtorch.so => not found 2025-05-07T20:11:06.5214612Z libtorch_cpu.so => not found 2025-05-07T20:11:06.5214864Z libtorch_cuda.so => not found 2025-05-07T20:11:06.5215130Z libtorch.so => not found 2025-05-07T20:11:06.5215286Z 2025-05-07T20:11:06.5215387Z [CHECK] Displaying ELF information: 2025-05-07T20:11:06.5215868Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:06.5216238Z 2025-05-07T20:11:06.5216274Z 2025-05-07T20:11:06.5216448Z Dynamic section at offset 0x38b44998 contains 43 entries: 2025-05-07T20:11:06.5216859Z Tag Type Name/Value 2025-05-07T20:11:06.5217275Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:06.5217823Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:06.5218372Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:06.5218974Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:06.5219667Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:06.5220374Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:06.5220950Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:06.5221510Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:06.5222067Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:06.5222612Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:06.5223147Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:06.5223839Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:06.5224343Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:06.5224842Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:06.5225369Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:06.5225953Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:06.5226521Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:06.5226937Z 0x000000000000000c (INIT) 0x611000 2025-05-07T20:11:06.5227272Z 0x000000000000000d (FINI) 0x32390cc 2025-05-07T20:11:06.5227621Z 0x0000000000000019 (INIT_ARRAY) 0x38b425f8 2025-05-07T20:11:06.5227980Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:11:06.5228346Z 0x000000000000001a (FINI_ARRAY) 0x38b42d18 2025-05-07T20:11:06.5228690Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:06.5229029Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:06.5229361Z 0x000000006ffffef5 (GNU_HASH) 0x10330 2025-05-07T20:11:06.5229692Z 0x0000000000000005 (STRTAB) 0x69580 2025-05-07T20:11:06.5230032Z 0x0000000000000006 (SYMTAB) 0x20fb0 2025-05-07T20:11:06.5230385Z 0x000000000000000a (STRSZ) 4919620 (bytes) 2025-05-07T20:11:06.5230770Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:06.5231116Z 0x0000000000000003 (PLTGOT) 0x38b44c88 2025-05-07T20:11:06.5231488Z 0x0000000000000002 (PLTRELSZ) 50064 (bytes) 2025-05-07T20:11:06.5231840Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:06.5232289Z 0x0000000000000017 (JMPREL) 0x603da0 2025-05-07T20:11:06.5232632Z 0x0000000000000007 (RELA) 0x5208e0 2025-05-07T20:11:06.5232984Z 0x0000000000000008 (RELASZ) 931008 (bytes) 2025-05-07T20:11:06.5233345Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:06.5233670Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:06.5234012Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:06.5234352Z 0x000000006ffffffe (VERNEED) 0x520740 2025-05-07T20:11:06.5234706Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:06.5235027Z 0x000000006ffffff0 (VERSYM) 0x51a6c4 2025-05-07T20:11:06.5235368Z 0x000000006ffffff9 (RELACOUNT) 26208 2025-05-07T20:11:06.5235685Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:06.5235891Z 2025-05-07T20:11:06.5236004Z ################################################################################ 2025-05-07T20:11:06.5236249Z 2025-05-07T20:11:06.5236253Z 2025-05-07T20:11:06.5236392Z ################################################################################ 2025-05-07T20:11:06.5237034Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:06.5237585Z [CHECK] Listing out library size: 2025-05-07T20:11:06.5238098Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:06.5238512Z 2025-05-07T20:11:06.5238782Z 142 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:06.5239149Z 2025-05-07T20:11:06.5239588Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:06.5240755Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:06.5241393Z 2025-05-07T20:11:06.5423610Z GLIBC_2.2.5 2025-05-07T20:11:06.5424343Z GLIBC_2.3 2025-05-07T20:11:06.5424550Z GLIBC_2.14 2025-05-07T20:11:06.5424696Z 2025-05-07T20:11:06.5424700Z 2025-05-07T20:11:06.5425193Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:06.5426365Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:06.5427050Z 2025-05-07T20:11:06.5702197Z GLIBCXX_3.4 2025-05-07T20:11:06.5702868Z GLIBCXX_3.4.9 2025-05-07T20:11:06.5703460Z GLIBCXX_3.4.11 2025-05-07T20:11:06.5704046Z GLIBCXX_3.4.18 2025-05-07T20:11:06.5704609Z GLIBCXX_3.4.20 2025-05-07T20:11:06.5705192Z GLIBCXX_3.4.21 2025-05-07T20:11:06.5705764Z GLIBCXX_3.4.29 2025-05-07T20:11:06.5706148Z 2025-05-07T20:11:06.5706166Z 2025-05-07T20:11:06.5724863Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.TRjilPAqhW.symbols.txt 2025-05-07T20:11:06.5726539Z 2025-05-07T20:11:06.5960502Z 2025-05-07T20:11:06.5985395Z [CHECK] Total Number of symbols: 1624 2025-05-07T20:11:06.6007945Z [CHECK] Number of fbgemm symbols: 228 2025-05-07T20:11:06.6023547Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.II6G3lCmjD.usymbols.txt 2025-05-07T20:11:06.6024134Z 2025-05-07T20:11:06.6048675Z 2025-05-07T20:11:06.6076537Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:06.6097848Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6098767Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6099457Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:06.6099867Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.6100300Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.6100698Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.6101106Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:06.6101492Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:06.6101866Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:06.6102263Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.6102617Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:06.6102946Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:06.6103272Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:06.6103611Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:06.6103932Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:06.6104302Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:06.6104801Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:06.6105163Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:06.6105764Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:06.6106164Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:06.6106598Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:06.6107087Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:06.6107931Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6109427Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6110462Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:06.6111097Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:06.6112017Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6113180Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6114031Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:06.6114444Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:06.6114885Z U at::globalContext() 2025-05-07T20:11:06.6115270Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6115672Z U c10::BoolType::get() 2025-05-07T20:11:06.6116025Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:06.6116369Z U c10::FloatType::get() 2025-05-07T20:11:06.6116661Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:06.6117045Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6117454Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:06.6117798Z U c10::IntType::get() 2025-05-07T20:11:06.6118145Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:06.6118518Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:06.6118899Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.6119291Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:06.6119673Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:06.6120298Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:06.6120924Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:06.6121274Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:06.6121575Z U c10::SymIntType::get() 2025-05-07T20:11:06.6121919Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:06.6122300Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.6122650Z U c10::TensorType::get() 2025-05-07T20:11:06.6122955Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:06.6123879Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:06.6124900Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:06.6125232Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:06.6125563Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:06.6125895Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:06.6126235Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:06.6126560Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:06.6126999Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:06.6127447Z U c10::cuda::device_count() 2025-05-07T20:11:06.6127796Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:06.6128149Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:06.6128515Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:06.6128876Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:06.6129263Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:06.6129610Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:06.6130522Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:06.6131422Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:06.6132288Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.6133241Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:06.6134291Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.6135103Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:06.6135439Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:06.6135818Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:06.6136233Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:06.6136641Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:06.6137002Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:06.6137153Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:06.6137272Z U c10::throwNullDataPtrError() 2025-05-07T20:11:06.6137382Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:06.6137510Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:06.6137708Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:06.6137827Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:06.6137957Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:06.6138097Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:06.6138228Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:06.6138342Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:06.6138483Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:06.6138597Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:06.6138711Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:06.6138845Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:06.6138988Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:06.6139155Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:06.6139274Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:06.6139495Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:06.6139605Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:06.6139719Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:06.6140077Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:06.6140202Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:06.6142510Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:06.6142747Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:06.6142865Z U float at::Tensor::item() const 2025-05-07T20:11:06.6143025Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.6143184Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6143308Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.6143470Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6143650Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:06.6143783Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.6143946Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6144048Z U memcpy@GLIBC_2.14 2025-05-07T20:11:06.6144147Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:06.6144239Z U memset@GLIBC_2.2.5 2025-05-07T20:11:06.6144406Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:06.6144535Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:06.6144872Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.6145212Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.6145549Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.6145874Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.6146239Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:06.6146646Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:06.6146992Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:06.6147393Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:06.6147514Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:06.6147629Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:06.6147808Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.6147946Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.6148143Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:06.6148287Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:06.6148533Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:06.6148916Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:06.6149535Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.6150097Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.6150229Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:06.6150353Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:06.6150467Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.6150594Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.6150710Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:06.6150827Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:06.6151055Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.6151314Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.6151459Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:06.6151585Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:06.6151718Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:06.6151857Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:06.6152494Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:06.6153016Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.6153297Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.6153716Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:06.6154305Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:06.6155846Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6157357Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6160376Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6161892Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6163338Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6164802Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6167040Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.6169168Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.6171102Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.6173086Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.6175091Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.6177083Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.6178886Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:06.6179057Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:06.6179263Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:06.6179540Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:06.6180103Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6180511Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6180882Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6181106Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:06.6181379Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:06.6181509Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:06.6181632Z w _ITM_registerTMCloneTable 2025-05-07T20:11:06.6181781Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:06.6181887Z w __gmon_start__ 2025-05-07T20:11:06.6182398Z w __pthread_key_create 2025-05-07T20:11:06.6182523Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:06.6182682Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:06.6182839Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:06.6183118Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:06.6183126Z 2025-05-07T20:11:06.6183302Z linux-vdso.so.1 (0x00007ffc1051e000) 2025-05-07T20:11:06.6183401Z libc10.so => not found 2025-05-07T20:11:06.6183507Z libc10_cuda.so => not found 2025-05-07T20:11:06.6184128Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f335ca00000) 2025-05-07T20:11:06.6184234Z libtorch.so => not found 2025-05-07T20:11:06.6184332Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6184468Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6184570Z libcudart.so.12 => not found 2025-05-07T20:11:06.6184744Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f335c79c000) 2025-05-07T20:11:06.6184921Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f339fc8c000) 2025-05-07T20:11:06.6185058Z libc.so.6 => /lib64/libc.so.6 (0x00007f335c594000) 2025-05-07T20:11:06.6185192Z /lib64/ld-linux-x86-64.so.2 (0x00007f339fcc0000) 2025-05-07T20:11:06.6185360Z libc10.so => not found 2025-05-07T20:11:06.6185526Z libc10_cuda.so => not found 2025-05-07T20:11:06.6186015Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f335c200000) 2025-05-07T20:11:06.6186580Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f335c050000) 2025-05-07T20:11:06.6186745Z libtorch.so => not found 2025-05-07T20:11:06.6187117Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f335ba00000) 2025-05-07T20:11:06.6187594Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f335a800000) 2025-05-07T20:11:06.6187731Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6187877Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6187987Z libcudart.so.12 => not found 2025-05-07T20:11:06.6188146Z libm.so.6 => /lib64/libm.so.6 (0x00007f335bf75000) 2025-05-07T20:11:06.6188245Z libc10.so => not found 2025-05-07T20:11:06.6188349Z libc10_cuda.so => not found 2025-05-07T20:11:06.6188804Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f339fc7a000) 2025-05-07T20:11:06.6188931Z libtorch.so => not found 2025-05-07T20:11:06.6189042Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6189149Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6189281Z libcudart.so.12 => not found 2025-05-07T20:11:06.6189383Z libc10.so => not found 2025-05-07T20:11:06.6189485Z libc10_cuda.so => not found 2025-05-07T20:11:06.6189581Z libtorch.so => not found 2025-05-07T20:11:06.6189715Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6189818Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6189926Z libcudart.so.12 => not found 2025-05-07T20:11:06.6190048Z libc10.so => not found 2025-05-07T20:11:06.6190416Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f3396b85000) 2025-05-07T20:11:06.6190522Z libtorch.so => not found 2025-05-07T20:11:06.6190627Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6190759Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6190860Z libtorch.so => not found 2025-05-07T20:11:06.6190956Z libc10.so => not found 2025-05-07T20:11:06.6191082Z libc10_cuda.so => not found 2025-05-07T20:11:06.6191186Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6191295Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6191395Z libcudart.so.12 => not found 2025-05-07T20:11:06.6191503Z libc10.so => not found 2025-05-07T20:11:06.6191613Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6191717Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6191841Z libtorch.so => not found 2025-05-07T20:11:06.6191945Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6192053Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6192153Z libtorch.so => not found 2025-05-07T20:11:06.6192185Z 2025-05-07T20:11:06.6192305Z [CHECK] Displaying ELF information: 2025-05-07T20:11:06.6192609Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:06.6192615Z 2025-05-07T20:11:06.6192642Z 2025-05-07T20:11:06.6192845Z Dynamic section at offset 0x8dbfdd8 contains 39 entries: 2025-05-07T20:11:06.6192979Z Tag Type Name/Value 2025-05-07T20:11:06.6193184Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:06.6193422Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:06.6193695Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:06.6193896Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:06.6194133Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:06.6194382Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:06.6194792Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:06.6195008Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:06.6195239Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:06.6195436Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:06.6195687Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:06.6196006Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:11:06.6196199Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:06.6196319Z 0x000000000000000c (INIT) 0xbf000 2025-05-07T20:11:06.6196633Z 0x000000000000000d (FINI) 0x62dd0c 2025-05-07T20:11:06.6196762Z 0x0000000000000019 (INIT_ARRAY) 0x8dbf998 2025-05-07T20:11:06.6196903Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:11:06.6197036Z 0x000000000000001a (FINI_ARRAY) 0x8dbfa60 2025-05-07T20:11:06.6197191Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:06.6197305Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:06.6197430Z 0x000000006ffffef5 (GNU_HASH) 0x2b38 2025-05-07T20:11:06.6197565Z 0x0000000000000005 (STRTAB) 0xedf0 2025-05-07T20:11:06.6197682Z 0x0000000000000006 (SYMTAB) 0x5598 2025-05-07T20:11:06.6197824Z 0x000000000000000a (STRSZ) 594745 (bytes) 2025-05-07T20:11:06.6197986Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:06.6198116Z 0x0000000000000003 (PLTGOT) 0x8dc0088 2025-05-07T20:11:06.6198261Z 0x0000000000000002 (PLTRELSZ) 11400 (bytes) 2025-05-07T20:11:06.6198382Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:06.6198522Z 0x0000000000000017 (JMPREL) 0xbb9f8 2025-05-07T20:11:06.6198644Z 0x0000000000000007 (RELA) 0xa0f20 2025-05-07T20:11:06.6198791Z 0x0000000000000008 (RELASZ) 109272 (bytes) 2025-05-07T20:11:06.6198941Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:06.6199050Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:06.6199183Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:06.6199306Z 0x000000006ffffffe (VERNEED) 0xa0de0 2025-05-07T20:11:06.6199448Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:06.6199572Z 0x000000006ffffff0 (VERSYM) 0xa012a 2025-05-07T20:11:06.6199692Z 0x000000006ffffff9 (RELACOUNT) 3126 2025-05-07T20:11:06.6199826Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:06.6199831Z 2025-05-07T20:11:06.6199956Z ################################################################################ 2025-05-07T20:11:06.6199962Z 2025-05-07T20:11:06.6199966Z 2025-05-07T20:11:06.6200087Z ################################################################################ 2025-05-07T20:11:06.6200458Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:06.6200571Z [CHECK] Listing out library size: 2025-05-07T20:11:06.6200898Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:06.6200902Z 2025-05-07T20:11:06.6204999Z 59 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:06.6205431Z 2025-05-07T20:11:06.6205924Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:06.6207149Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:06.6207870Z 2025-05-07T20:11:06.6357899Z GLIBC_2.2.5 2025-05-07T20:11:06.6358863Z GLIBC_2.3 2025-05-07T20:11:06.6359427Z GLIBC_2.14 2025-05-07T20:11:06.6359883Z 2025-05-07T20:11:06.6359897Z 2025-05-07T20:11:06.6361329Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:06.6363314Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:06.6364058Z 2025-05-07T20:11:06.6504943Z GLIBCXX_3.4 2025-05-07T20:11:06.6505207Z GLIBCXX_3.4.9 2025-05-07T20:11:06.6505422Z GLIBCXX_3.4.11 2025-05-07T20:11:06.6505649Z GLIBCXX_3.4.15 2025-05-07T20:11:06.6505859Z GLIBCXX_3.4.18 2025-05-07T20:11:06.6506076Z GLIBCXX_3.4.20 2025-05-07T20:11:06.6506279Z GLIBCXX_3.4.21 2025-05-07T20:11:06.6506501Z GLIBCXX_3.4.29 2025-05-07T20:11:06.6506828Z 2025-05-07T20:11:06.6506833Z 2025-05-07T20:11:06.6528237Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.XrWFpHuzia.symbols.txt 2025-05-07T20:11:06.6529909Z 2025-05-07T20:11:06.6643701Z 2025-05-07T20:11:06.6670115Z [CHECK] Total Number of symbols: 1791 2025-05-07T20:11:06.6684799Z [CHECK] Number of fbgemm symbols: 94 2025-05-07T20:11:06.6702292Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.1CskuWaFl6.usymbols.txt 2025-05-07T20:11:06.6702876Z 2025-05-07T20:11:06.6727282Z 2025-05-07T20:11:06.6757139Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:11:06.6780907Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6781808Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6782691Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:06.6783054Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.6783487Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.6783896Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.6784278Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:06.6784678Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:06.6785029Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:06.6785422Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.6785794Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:06.6786144Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:06.6786476Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:06.6786785Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:06.6787128Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:06.6787455Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:06.6787795Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:06.6788128Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:06.6788453Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:06.6788775Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:06.6789239Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:06.6789562Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:06.6789875Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:06.6790193Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:06.6790550Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:06.6790988Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:06.6791488Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:06.6791829Z U at::RecordFunction::end() 2025-05-07T20:11:06.6792327Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:06.6792690Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:06.6793187Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:06.6793625Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:06.6794511Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6795810Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6796901Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:06.6797730Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6798920Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.6799748Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:06.6800140Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:06.6800542Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:06.6800919Z U at::globalContext() 2025-05-07T20:11:06.6801261Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:06.6801598Z U c10::AnyType::get() 2025-05-07T20:11:06.6802017Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6802545Z U c10::BoolType::get() 2025-05-07T20:11:06.6802901Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:06.6803343Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:06.6803734Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:06.6804454Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:06.6805669Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:06.6806734Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:06.6807319Z U c10::Error::what() const 2025-05-07T20:11:06.6807611Z U c10::FloatType::get() 2025-05-07T20:11:06.6807919Z U c10::GradMode::is_enabled() 2025-05-07T20:11:06.6808234Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:06.6808598Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6809301Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:06.6809676Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:06.6810020Z U c10::IValue::isBoolList() const 2025-05-07T20:11:06.6810334Z U c10::IValue::isIntList() const 2025-05-07T20:11:06.6810672Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:06.6811017Z U c10::IValue::isTensorList() const 2025-05-07T20:11:06.6811372Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:06.6811750Z U c10::IntType::get() 2025-05-07T20:11:06.6812135Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:06.6812568Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:06.6812945Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:06.6813336Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:06.6813802Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:06.6814445Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:06.6815123Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.6815469Z U c10::StringType::get() 2025-05-07T20:11:06.6815817Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:06.6816210Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:06.6816631Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:06.6817057Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:06.6817451Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:06.6818101Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:06.6818730Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:06.6819089Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:06.6819544Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:06.6820075Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:06.6820451Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:06.6820873Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:06.6821252Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:06.6821614Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:06.6821933Z U c10::SymIntType::get() 2025-05-07T20:11:06.6822320Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:06.6822708Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:06.6823112Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.6823499Z U c10::TensorType::get() 2025-05-07T20:11:06.6823833Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:06.6824820Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:06.6825823Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:06.6826315Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:06.6826648Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:06.6826970Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:06.6827308Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:06.6827816Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:06.6828297Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:06.6828767Z U c10::cuda::device_count() 2025-05-07T20:11:06.6829108Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:06.6829492Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:06.6829869Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:06.6830451Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:06.6830862Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:06.6831307Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:06.6832008Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:06.6833109Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:06.6834070Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:06.6834974Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.6835943Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:06.6837045Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.6837900Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:06.6838234Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:06.6838797Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:06.6839432Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:06.6839894Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:06.6840343Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:06.6840743Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:06.6841114Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:06.6841498Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:06.6842164Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:06.6842808Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:06.6843193Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:06.6843605Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:06.6844023Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:06.6844469Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:06.6844896Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:06.6845264Z U c10::throwNullDataPtrError() 2025-05-07T20:11:06.6845608Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:06.6845929Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:06.6846372Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:06.6846946Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:06.6847426Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:06.6847788Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:06.6848130Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:06.6848487Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:06.6848817Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:06.6849150Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:06.6849470Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:06.6849809Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:06.6850157Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:06.6850612Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:06.6851146Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:06.6851548Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:06.6851885Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:06.6852245Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:06.6852598Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:06.6852972Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:06.6855495Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:06.6858097Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:06.6858609Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.6859037Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6859486Z U free@GLIBC_2.2.5 2025-05-07T20:11:06.6860002Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.6860408Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6860852Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:06.6861299Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.6861697Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.6862093Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:06.6862412Z U memcpy@GLIBC_2.14 2025-05-07T20:11:06.6862701Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:06.6863016Z U memset@GLIBC_2.2.5 2025-05-07T20:11:06.6863362Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:06.6863771Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:06.6864342Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.6865145Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.6865726Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:06.6866138Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:06.6866855Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:06.6867723Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:06.6868612Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:06.6869428Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:06.6870282Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:06.6871188Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:06.6871859Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:06.6872242Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:06.6872664Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.6873072Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.6873538Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:06.6873998Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:06.6874421Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:06.6874954Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:06.6875687Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:06.6876794Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.6878100Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.6878890Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:06.6879277Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:06.6879668Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:06.6880037Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.6880411Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.6880774Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:06.6881138Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:06.6881560Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.6882306Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.6882825Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:06.6883251Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:06.6883694Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:06.6884389Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:06.6885117Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:06.6885504Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:06.6885829Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:06.6886145Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:06.6886465Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:06.6887335Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:06.6888558Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.6889418Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.6889938Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:06.6890507Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:06.6891122Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:06.6891646Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:06.6892237Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:06.6892922Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:06.6893611Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:06.6894084Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:06.6894599Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:06.6895055Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:06.6895438Z U torch::autograd::Node::metadata() 2025-05-07T20:11:06.6895825Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:06.6896328Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:06.6897025Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:06.6897576Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:06.6898074Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:06.6898658Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:06.6901948Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:06.6905227Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:06.6905661Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:06.6906123Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:06.6907242Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:06.6908358Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:06.6909081Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:06.6909993Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:06.6911072Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:06.6911879Z U typeinfo for c10::Error 2025-05-07T20:11:06.6912244Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:06.6912652Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:06.6913030Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:06.6913435Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:06.6913828Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:06.6915586Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6918811Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6921765Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6924686Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6927570Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6930687Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:06.6932391Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:06.6932837Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:06.6933315Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:06.6933758Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:06.6934166Z U vtable for c10::Error 2025-05-07T20:11:06.6934770Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6935589Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6936423Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.6937035Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:06.6937527Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:06.6938110Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:06.6938581Z U vtable for torch::autograd::Node 2025-05-07T20:11:06.6939020Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:06.6939509Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:06.6939899Z w _ITM_registerTMCloneTable 2025-05-07T20:11:06.6940257Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:06.6940616Z w __gmon_start__ 2025-05-07T20:11:06.6940939Z w __pthread_key_create 2025-05-07T20:11:06.6941269Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:06.6941642Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:06.6942029Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:06.6944156Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:06.6944571Z 2025-05-07T20:11:06.6944720Z linux-vdso.so.1 (0x00007ffd7a99c000) 2025-05-07T20:11:06.6945030Z libc10.so => not found 2025-05-07T20:11:06.6945319Z libc10_cuda.so => not found 2025-05-07T20:11:06.6946097Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f4cdb200000) 2025-05-07T20:11:06.6946955Z libtorch.so => not found 2025-05-07T20:11:06.6947237Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6947548Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6947827Z libcudart.so.12 => not found 2025-05-07T20:11:06.6948203Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4cdaf9c000) 2025-05-07T20:11:06.6948670Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4d19090000) 2025-05-07T20:11:06.6949082Z libc.so.6 => /lib64/libc.so.6 (0x00007f4cdad94000) 2025-05-07T20:11:06.6949490Z /lib64/ld-linux-x86-64.so.2 (0x00007f4d190c4000) 2025-05-07T20:11:06.6961325Z libc10.so => not found 2025-05-07T20:11:06.6961656Z libc10_cuda.so => not found 2025-05-07T20:11:06.6962331Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f4cdaa00000) 2025-05-07T20:11:06.6963468Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f4cda850000) 2025-05-07T20:11:06.6964239Z libtorch.so => not found 2025-05-07T20:11:06.6964756Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f4cda200000) 2025-05-07T20:11:06.6965679Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f4cd9000000) 2025-05-07T20:11:06.6966351Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6966616Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6966891Z libcudart.so.12 => not found 2025-05-07T20:11:06.6967185Z libm.so.6 => /lib64/libm.so.6 (0x00007f4cda775000) 2025-05-07T20:11:06.6967508Z libc10.so => not found 2025-05-07T20:11:06.6967741Z libc10_cuda.so => not found 2025-05-07T20:11:06.6968352Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f4d1907e000) 2025-05-07T20:11:06.6968990Z libtorch.so => not found 2025-05-07T20:11:06.6969254Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6969527Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6969967Z libcudart.so.12 => not found 2025-05-07T20:11:06.6970239Z libc10.so => not found 2025-05-07T20:11:06.6970468Z libc10_cuda.so => not found 2025-05-07T20:11:06.6970743Z libtorch.so => not found 2025-05-07T20:11:06.6970992Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6971276Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6971538Z libcudart.so.12 => not found 2025-05-07T20:11:06.6971814Z libc10.so => not found 2025-05-07T20:11:06.6972320Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f4d15385000) 2025-05-07T20:11:06.6973014Z libtorch.so => not found 2025-05-07T20:11:06.6973277Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6973527Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6973784Z libtorch.so => not found 2025-05-07T20:11:06.6974015Z libc10.so => not found 2025-05-07T20:11:06.6974254Z libc10_cuda.so => not found 2025-05-07T20:11:06.6974602Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6974871Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6975150Z libcudart.so.12 => not found 2025-05-07T20:11:06.6975419Z libc10.so => not found 2025-05-07T20:11:06.6975651Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6975928Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6976188Z libtorch.so => not found 2025-05-07T20:11:06.6976436Z libtorch_cpu.so => not found 2025-05-07T20:11:06.6976707Z libtorch_cuda.so => not found 2025-05-07T20:11:06.6976988Z libtorch.so => not found 2025-05-07T20:11:06.6977138Z 2025-05-07T20:11:06.6977266Z [CHECK] Displaying ELF information: 2025-05-07T20:11:06.6977759Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:06.6978169Z 2025-05-07T20:11:06.6978205Z 2025-05-07T20:11:06.6978369Z Dynamic section at offset 0x3a22e50 contains 39 entries: 2025-05-07T20:11:06.6978796Z Tag Type Name/Value 2025-05-07T20:11:06.6979201Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:06.6980021Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:06.6980607Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:06.6981236Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:06.6981754Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:06.6982483Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:06.6983025Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:06.6983548Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:06.6984064Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:06.6984574Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:06.6985105Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:06.6985716Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:11:06.6986316Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:06.6986735Z 0x000000000000000c (INIT) 0x7a000 2025-05-07T20:11:06.6987068Z 0x000000000000000d (FINI) 0x26a70c 2025-05-07T20:11:06.6987421Z 0x0000000000000019 (INIT_ARRAY) 0x3a23350 2025-05-07T20:11:06.6987785Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:11:06.6988149Z 0x000000000000001a (FINI_ARRAY) 0x3a23408 2025-05-07T20:11:06.6988494Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:06.6988838Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:06.6989177Z 0x000000006ffffef5 (GNU_HASH) 0x2e00 2025-05-07T20:11:06.6989520Z 0x0000000000000005 (STRTAB) 0x101c8 2025-05-07T20:11:06.6989856Z 0x0000000000000006 (SYMTAB) 0x59c8 2025-05-07T20:11:06.6990201Z 0x000000000000000a (STRSZ) 353759 (bytes) 2025-05-07T20:11:06.6990567Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:06.6990916Z 0x0000000000000003 (PLTGOT) 0x3a24100 2025-05-07T20:11:06.6991285Z 0x0000000000000002 (PLTRELSZ) 13056 (bytes) 2025-05-07T20:11:06.6991627Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:06.6991963Z 0x0000000000000017 (JMPREL) 0x75e68 2025-05-07T20:11:06.6992298Z 0x0000000000000007 (RELA) 0x67708 2025-05-07T20:11:06.6992631Z 0x0000000000000008 (RELASZ) 59232 (bytes) 2025-05-07T20:11:06.6993001Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:06.6993326Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:06.6993662Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:06.6994210Z 0x000000006ffffffe (VERNEED) 0x675a8 2025-05-07T20:11:06.6994555Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:06.6994916Z 0x000000006ffffff0 (VERSYM) 0x667a8 2025-05-07T20:11:06.6995254Z 0x000000006ffffff9 (RELACOUNT) 1167 2025-05-07T20:11:06.6995571Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:06.6995771Z 2025-05-07T20:11:06.6995879Z ################################################################################ 2025-05-07T20:11:06.6996112Z 2025-05-07T20:11:06.6996152Z 2025-05-07T20:11:06.6996260Z ################################################################################ 2025-05-07T20:11:06.6996793Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:06.6997333Z [CHECK] Listing out library size: 2025-05-07T20:11:06.6997842Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:06.6998398Z 2025-05-07T20:11:06.6998632Z 329 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:06.6998978Z 2025-05-07T20:11:06.6999394Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:06.7000424Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:06.7001030Z 2025-05-07T20:11:06.7461669Z GLIBC_2.2.5 2025-05-07T20:11:06.7461935Z GLIBC_2.3 2025-05-07T20:11:06.7462116Z GLIBC_2.14 2025-05-07T20:11:06.7462250Z 2025-05-07T20:11:06.7462254Z 2025-05-07T20:11:06.7462739Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:06.7463911Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:06.7464592Z 2025-05-07T20:11:06.8044934Z GLIBCXX_3.4 2025-05-07T20:11:06.8045287Z GLIBCXX_3.4.9 2025-05-07T20:11:06.8046521Z GLIBCXX_3.4.11 2025-05-07T20:11:06.8046845Z GLIBCXX_3.4.18 2025-05-07T20:11:06.8047096Z GLIBCXX_3.4.20 2025-05-07T20:11:06.8047317Z GLIBCXX_3.4.21 2025-05-07T20:11:06.8047555Z GLIBCXX_3.4.29 2025-05-07T20:11:06.8047691Z 2025-05-07T20:11:06.8047696Z 2025-05-07T20:11:06.8061914Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.4qHxuknt7V.symbols.txt 2025-05-07T20:11:06.8062463Z 2025-05-07T20:11:06.8609251Z 2025-05-07T20:11:06.8644460Z [CHECK] Total Number of symbols: 3670 2025-05-07T20:11:06.8677968Z [CHECK] Number of fbgemm symbols: 456 2025-05-07T20:11:06.8692926Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.WbaFZBFvhr.usymbols.txt 2025-05-07T20:11:06.8693547Z 2025-05-07T20:11:06.8721782Z 2025-05-07T20:11:06.8744879Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:11:06.8762596Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.8763818Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.8764529Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:06.8765054Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.8765525Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:06.8765922Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.8766294Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:06.8766685Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:06.8767045Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:06.8767426Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:06.8769112Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:06.8769549Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:06.8769864Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:06.8770187Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:06.8770519Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:06.8770844Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:06.8771345Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:06.8771636Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:06.8771976Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:06.8772350Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:06.8772762Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:06.8773228Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:06.8773663Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:06.8774505Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.8775807Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.8776735Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:06.8777333Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:06.8778210Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.8779435Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:06.8780499Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:06.8780935Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:06.8781286Z U at::globalContext() 2025-05-07T20:11:06.8781705Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.8782334Z U c10::BoolType::get() 2025-05-07T20:11:06.8782709Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:06.8783097Z U c10::FloatType::get() 2025-05-07T20:11:06.8783430Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:06.8783848Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.8784285Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:06.8784646Z U c10::IntType::get() 2025-05-07T20:11:06.8785010Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:06.8785433Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:06.8785820Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.8786252Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:06.8786664Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:06.8787086Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:06.8787538Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:06.8788516Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:06.8789193Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:06.8789567Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:06.8789904Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:06.8790248Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:06.8790625Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:06.8790991Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:06.8791330Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:06.8791622Z U c10::SymIntType::get() 2025-05-07T20:11:06.8791971Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:06.8792404Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:06.8792762Z U c10::TensorType::get() 2025-05-07T20:11:06.8793060Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:06.8794179Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:06.8795176Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:06.8795540Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:06.8795911Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:06.8796304Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:06.8796641Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:06.8796996Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:06.8797488Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:06.8797955Z U c10::cuda::device_count() 2025-05-07T20:11:06.8798310Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:06.8798681Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:06.8799077Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:06.8799459Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:06.8799877Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:06.8800272Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:06.8801097Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:06.8801947Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:06.8802790Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.8803694Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:06.8804929Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:06.8805757Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:06.8806086Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:06.8806463Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:06.8806886Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:06.8807298Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:06.8807671Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:06.8808064Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:06.8808455Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:06.8808842Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:06.8809238Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:06.8809646Z U c10::throwNullDataPtrError() 2025-05-07T20:11:06.8809769Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:06.8809877Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:06.8810072Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:06.8810213Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:06.8810372Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:06.8810495Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:06.8810650Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:06.8810765Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:06.8810889Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:06.8811001Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:06.8811134Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:06.8811254Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:06.8811376Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:06.8811528Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:06.8811650Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:06.8811760Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:06.8811888Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:06.8812003Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:06.8812128Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:06.8812248Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:06.8814547Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:06.8814746Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:06.8814881Z U float at::Tensor::item() const 2025-05-07T20:11:06.8815016Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.8815171Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.8815297Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.8815433Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.8815602Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:06.8815744Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:06.8815889Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:06.8815984Z U memcpy@GLIBC_2.14 2025-05-07T20:11:06.8816093Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:06.8816180Z U memset@GLIBC_2.2.5 2025-05-07T20:11:06.8816326Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:06.8816467Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:06.8816795Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.8817116Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.8817439Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.8817777Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.8818081Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.8818400Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:06.8818755Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:06.8819135Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:06.8819601Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:06.8820169Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:06.8820289Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:06.8820530Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:06.8820673Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.8820820Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.8821027Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:06.8821170Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:06.8821419Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:06.8821814Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:06.8822424Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.8822951Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:06.8823096Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:06.8823222Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:06.8823347Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.8823493Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:06.8823609Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:06.8823722Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:06.8823936Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.8824187Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:06.8824316Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:06.8824444Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:06.8824546Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:06.8824671Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:06.8825331Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:06.8825846Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.8826116Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:06.8826526Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:06.8827105Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:06.8829211Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.8831387Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.8833241Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.8835095Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.8836928Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.8838808Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:06.8840531Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:06.8840675Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:06.8840830Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:06.8841024Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:06.8841359Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.8841673Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.8842017Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:06.8842214Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:06.8842451Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:06.8842563Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:06.8842661Z w _ITM_registerTMCloneTable 2025-05-07T20:11:06.8842757Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:06.8842877Z w __gmon_start__ 2025-05-07T20:11:06.8842968Z w __pthread_key_create 2025-05-07T20:11:06.8843075Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:06.8843209Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:06.8843351Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:06.8843591Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:06.8843599Z 2025-05-07T20:11:06.8843740Z linux-vdso.so.1 (0x00007ffedc953000) 2025-05-07T20:11:06.8843832Z libc10.so => not found 2025-05-07T20:11:06.8843921Z libc10_cuda.so => not found 2025-05-07T20:11:06.8844495Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fd772200000) 2025-05-07T20:11:06.8844589Z libtorch.so => not found 2025-05-07T20:11:06.8844859Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8844956Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8845075Z libcudart.so.12 => not found 2025-05-07T20:11:06.8845242Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd771f9c000) 2025-05-07T20:11:06.8845392Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd7ac3d2000) 2025-05-07T20:11:06.8845536Z libc.so.6 => /lib64/libc.so.6 (0x00007fd771d94000) 2025-05-07T20:11:06.8845668Z /lib64/ld-linux-x86-64.so.2 (0x00007fd7c1406000) 2025-05-07T20:11:06.8845758Z libc10.so => not found 2025-05-07T20:11:06.8845865Z libc10_cuda.so => not found 2025-05-07T20:11:06.8846344Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fd771a00000) 2025-05-07T20:11:06.8846902Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fd771850000) 2025-05-07T20:11:06.8847011Z libtorch.so => not found 2025-05-07T20:11:06.8847364Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007fd771200000) 2025-05-07T20:11:06.8847858Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fd770000000) 2025-05-07T20:11:06.8848008Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8848121Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8848223Z libcudart.so.12 => not found 2025-05-07T20:11:06.8848351Z libm.so.6 => /lib64/libm.so.6 (0x00007fd771775000) 2025-05-07T20:11:06.8848446Z libc10.so => not found 2025-05-07T20:11:06.8848593Z libc10_cuda.so => not found 2025-05-07T20:11:06.8849037Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007fd7ac3c4000) 2025-05-07T20:11:06.8849144Z libtorch.so => not found 2025-05-07T20:11:06.8849239Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8849341Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8849465Z libcudart.so.12 => not found 2025-05-07T20:11:06.8849575Z libc10.so => not found 2025-05-07T20:11:06.8849670Z libc10_cuda.so => not found 2025-05-07T20:11:06.8849768Z libtorch.so => not found 2025-05-07T20:11:06.8849893Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8849991Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8850087Z libcudart.so.12 => not found 2025-05-07T20:11:06.8850179Z libc10.so => not found 2025-05-07T20:11:06.8850673Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007fd7ac345000) 2025-05-07T20:11:06.8850758Z libtorch.so => not found 2025-05-07T20:11:06.8850849Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8850949Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8851030Z libtorch.so => not found 2025-05-07T20:11:06.8851106Z libc10.so => not found 2025-05-07T20:11:06.8851191Z libc10_cuda.so => not found 2025-05-07T20:11:06.8851287Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8851372Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8851458Z libcudart.so.12 => not found 2025-05-07T20:11:06.8851556Z libc10.so => not found 2025-05-07T20:11:06.8851640Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8851725Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8851812Z libtorch.so => not found 2025-05-07T20:11:06.8851911Z libtorch_cpu.so => not found 2025-05-07T20:11:06.8851994Z libtorch_cuda.so => not found 2025-05-07T20:11:06.8852076Z libtorch.so => not found 2025-05-07T20:11:06.8852082Z 2025-05-07T20:11:06.8852197Z [CHECK] Displaying ELF information: 2025-05-07T20:11:06.8852467Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:06.8852472Z 2025-05-07T20:11:06.8863458Z 2025-05-07T20:11:06.8864141Z Dynamic section at offset 0x148571f8 contains 39 entries: 2025-05-07T20:11:06.8864272Z Tag Type Name/Value 2025-05-07T20:11:06.8864470Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:06.8864702Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:06.8864979Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:06.8865184Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:06.8865407Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:06.8865618Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:06.8865823Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:06.8866034Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:06.8866231Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:06.8866419Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:06.8866643Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:06.8866922Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:11:06.8867256Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:06.8867432Z 0x000000000000000c (INIT) 0x1c3000 2025-05-07T20:11:06.8867555Z 0x000000000000000d (FINI) 0xf0879c 2025-05-07T20:11:06.8867679Z 0x0000000000000019 (INIT_ARRAY) 0x14856518 2025-05-07T20:11:06.8867804Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:11:06.8867938Z 0x000000000000001a (FINI_ARRAY) 0x148567c0 2025-05-07T20:11:06.8868098Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:06.8868207Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:06.8868336Z 0x000000006ffffef5 (GNU_HASH) 0x4b88 2025-05-07T20:11:06.8868469Z 0x0000000000000005 (STRTAB) 0x1fa30 2025-05-07T20:11:06.8868580Z 0x0000000000000006 (SYMTAB) 0xa208 2025-05-07T20:11:06.8868773Z 0x000000000000000a (STRSZ) 1419969 (bytes) 2025-05-07T20:11:06.8868894Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:06.8869010Z 0x0000000000000003 (PLTGOT) 0x148574a8 2025-05-07T20:11:06.8869149Z 0x0000000000000002 (PLTRELSZ) 18120 (bytes) 2025-05-07T20:11:06.8869262Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:06.8869375Z 0x0000000000000017 (JMPREL) 0x1bded8 2025-05-07T20:11:06.8869488Z 0x0000000000000007 (RELA) 0x17c2e0 2025-05-07T20:11:06.8869638Z 0x0000000000000008 (RELASZ) 269304 (bytes) 2025-05-07T20:11:06.8869755Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:06.8869856Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:06.8869989Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:06.8870118Z 0x000000006ffffffe (VERNEED) 0x17c1a0 2025-05-07T20:11:06.8870229Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:06.8870352Z 0x000000006ffffff0 (VERSYM) 0x17a4f2 2025-05-07T20:11:06.8870483Z 0x000000006ffffff9 (RELACOUNT) 7406 2025-05-07T20:11:06.8870582Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:06.8870591Z 2025-05-07T20:11:06.8870706Z ################################################################################ 2025-05-07T20:11:06.8870711Z 2025-05-07T20:11:06.8870715Z 2025-05-07T20:11:06.8870943Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:11:06.8982238Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9008483Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9224520Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9260699Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9309680Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9341938Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9377449Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9408427Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:06.9519733Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:06.9542890Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:06.9769502Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:06.9804881Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:06.9855039Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:06.9887349Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:06.9921542Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:06.9949683Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.0338968Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.0690435Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.0873049Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.1783483Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.1814230Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.1892364Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.2199091Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:07.2200977Z ################################################################################ 2025-05-07T20:11:07.2202518Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:07.2203531Z 2025-05-07T20:11:07.2204107Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:07.2204656Z 2025-05-07T20:11:19.2541742Z 2025-05-07T20:11:19.2542623Z fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:19.2543304Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:19.2543616Z 2025-05-07T20:11:19.2543800Z The wheel references external versioned symbols in these 2025-05-07T20:11:19.2544314Z system-provided shared libraries: libgcc_s.so.1 with versions 2025-05-07T20:11:19.2544811Z {'GCC_3.4', 'GCC_3.0'}, libstdc++.so.6 with versions {'CXXABI_1.3.5', 2025-05-07T20:11:19.2545359Z 'GLIBCXX_3.4.15', 'GLIBCXX_3.4.18', 'CXXABI_1.3.7', 'CXXABI_1.3.11', 2025-05-07T20:11:19.2545870Z 'GLIBCXX_3.4.14', 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.11', 'CXXABI_1.3.9', 2025-05-07T20:11:19.2546365Z 'GLIBCXX_3.4.20', 'CXXABI_1.3', 'CXXABI_1.3.8', 'GLIBCXX_3.4.19', 2025-05-07T20:11:19.2546866Z 'GLIBCXX_3.4.29', 'GLIBCXX_3.4.9', 'GLIBCXX_3.4', 'CXXABI_1.3.3'}, 2025-05-07T20:11:19.2547357Z libc.so.6 with versions {'GLIBC_2.14', 'GLIBC_2.2.5'}, libm.so.6 with 2025-05-07T20:11:19.2547860Z versions {'GLIBC_2.2.5'}, libcudart.so.12 with versions 2025-05-07T20:11:19.2548237Z {'libcudart.so.12'} 2025-05-07T20:11:19.2548421Z 2025-05-07T20:11:19.2548640Z This constrains the platform tag to "manylinux_2_34_x86_64". In order 2025-05-07T20:11:19.2549206Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:19.2549711Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:19.2550160Z libraries, such as a recent manylinux image. 2025-05-07T20:11:19.3441277Z 2025-05-07T20:11:19.3441297Z 2025-05-07T20:11:19.3442299Z ################################################################################ 2025-05-07T20:11:19.3442866Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:19.3443498Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:19.3443853Z 2025-05-07T20:11:19.3461356Z -rw-r--r--. 1 root root 511M May 7 20:11 dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:19.3461860Z 2025-05-07T20:11:19.3462164Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:19.3462683Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:19.3463058Z 2025-05-07T20:11:20.3060363Z 62e56cdc76de05aeb6c63d633cf915020e4286fc dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:20.3062018Z 2025-05-07T20:11:20.3062784Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:20.3064246Z 2025-05-07T20:11:22.5342382Z 8328af3aadb2c91b3f68117939dd362dc61ffce9adeaba2897d4cf5858e520e9 dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:22.5343080Z 2025-05-07T20:11:22.5343340Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:22.5343746Z 2025-05-07T20:11:23.3906892Z 087e6f774581b97c394c6d09a25f633e dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:23.3908365Z 2025-05-07T20:11:23.3908786Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:23.4021511Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:23.4021876Z with: 2025-05-07T20:11:23.4022170Z name: fbgemm_default_x86_gcc_py3.9_cu12.6.3.whl 2025-05-07T20:11:23.4022530Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:23.4022878Z if-no-files-found: error 2025-05-07T20:11:23.4023159Z compression-level: 6 2025-05-07T20:11:23.4023455Z overwrite: false 2025-05-07T20:11:23.4023741Z include-hidden-files: false 2025-05-07T20:11:23.4024023Z env: 2025-05-07T20:11:23.4024289Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:23.4024625Z BUILD_ENV: build_binary 2025-05-07T20:11:23.4024916Z BUILD_TARGET: default 2025-05-07T20:11:23.4025170Z BUILD_VARIANT: cuda 2025-05-07T20:11:23.4025452Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:23.4025724Z ##[endgroup] 2025-05-07T20:11:23.4029764Z ##[command]/usr/bin/docker exec 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:23.8711971Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:23.8714140Z Artifact name is valid! 2025-05-07T20:11:23.8715534Z Root directory input is valid! 2025-05-07T20:11:23.9481642Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:24.5991981Z Uploaded bytes 8388608 2025-05-07T20:11:24.8605750Z Uploaded bytes 16777216 2025-05-07T20:11:25.1077092Z Uploaded bytes 25165824 2025-05-07T20:11:25.4722019Z Uploaded bytes 33554432 2025-05-07T20:11:25.7733640Z Uploaded bytes 41943040 2025-05-07T20:11:26.0459172Z Uploaded bytes 50331648 2025-05-07T20:11:26.3909497Z Uploaded bytes 58720256 2025-05-07T20:11:26.6949549Z Uploaded bytes 67108864 2025-05-07T20:11:27.0062728Z Uploaded bytes 75497472 2025-05-07T20:11:27.3401542Z Uploaded bytes 83886080 2025-05-07T20:11:27.7020531Z Uploaded bytes 92274688 2025-05-07T20:11:27.9591189Z Uploaded bytes 100663296 2025-05-07T20:11:28.3102005Z Uploaded bytes 109051904 2025-05-07T20:11:28.7366032Z Uploaded bytes 117440512 2025-05-07T20:11:29.0765410Z Uploaded bytes 125829120 2025-05-07T20:11:29.2193519Z Uploaded bytes 134217728 2025-05-07T20:11:29.5302029Z Uploaded bytes 142606336 2025-05-07T20:11:29.8111984Z Uploaded bytes 150994944 2025-05-07T20:11:30.1896431Z Uploaded bytes 159383552 2025-05-07T20:11:30.4724271Z Uploaded bytes 167772160 2025-05-07T20:11:30.7823246Z Uploaded bytes 176160768 2025-05-07T20:11:31.1406991Z Uploaded bytes 184549376 2025-05-07T20:11:31.4953249Z Uploaded bytes 192937984 2025-05-07T20:11:31.7583496Z Uploaded bytes 201326592 2025-05-07T20:11:32.1133471Z Uploaded bytes 209715200 2025-05-07T20:11:32.3960591Z Uploaded bytes 218103808 2025-05-07T20:11:32.7352499Z Uploaded bytes 226492416 2025-05-07T20:11:33.0240348Z Uploaded bytes 234881024 2025-05-07T20:11:33.3130589Z Uploaded bytes 243269632 2025-05-07T20:11:33.6068200Z Uploaded bytes 251658240 2025-05-07T20:11:33.8620255Z Uploaded bytes 260046848 2025-05-07T20:11:34.1258758Z Uploaded bytes 268435456 2025-05-07T20:11:34.4624372Z Uploaded bytes 276824064 2025-05-07T20:11:34.7405136Z Uploaded bytes 285212672 2025-05-07T20:11:35.0157947Z Uploaded bytes 293601280 2025-05-07T20:11:35.4001575Z Uploaded bytes 301989888 2025-05-07T20:11:35.6779169Z Uploaded bytes 310378496 2025-05-07T20:11:36.0053255Z Uploaded bytes 318767104 2025-05-07T20:11:36.3247345Z Uploaded bytes 327155712 2025-05-07T20:11:36.5630834Z Uploaded bytes 335544320 2025-05-07T20:11:36.9319863Z Uploaded bytes 343932928 2025-05-07T20:11:37.2815006Z Uploaded bytes 352321536 2025-05-07T20:11:37.6128449Z Uploaded bytes 360710144 2025-05-07T20:11:37.8710895Z Uploaded bytes 369098752 2025-05-07T20:11:38.2180483Z Uploaded bytes 377487360 2025-05-07T20:11:38.5151353Z Uploaded bytes 385875968 2025-05-07T20:11:38.8878710Z Uploaded bytes 394264576 2025-05-07T20:11:39.2122447Z Uploaded bytes 402653184 2025-05-07T20:11:39.4560011Z Uploaded bytes 411041792 2025-05-07T20:11:39.8301576Z Uploaded bytes 419430400 2025-05-07T20:11:40.1207881Z Uploaded bytes 427819008 2025-05-07T20:11:40.3858071Z Uploaded bytes 436207616 2025-05-07T20:11:40.7136749Z Uploaded bytes 444596224 2025-05-07T20:11:40.9951155Z Uploaded bytes 452984832 2025-05-07T20:11:41.3138226Z Uploaded bytes 461373440 2025-05-07T20:11:41.6816138Z Uploaded bytes 469762048 2025-05-07T20:11:41.9160392Z Uploaded bytes 478150656 2025-05-07T20:11:42.1958016Z Uploaded bytes 486539264 2025-05-07T20:11:42.4614435Z Uploaded bytes 494927872 2025-05-07T20:11:42.7581544Z Uploaded bytes 503316480 2025-05-07T20:11:43.0785540Z Uploaded bytes 511705088 2025-05-07T20:11:43.3619923Z Uploaded bytes 520093696 2025-05-07T20:11:43.5193922Z Uploaded bytes 524562427 2025-05-07T20:11:43.5353119Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:43.5354423Z SHA256 digest of uploaded artifact zip is a3d451be0b1d859d3b5e8a0a5710537ab22d26aaa01cf566c5672e04cf24cc2d 2025-05-07T20:11:43.5355647Z Finalizing artifact upload 2025-05-07T20:11:43.6399935Z Artifact fbgemm_default_x86_gcc_py3.9_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081458740 2025-05-07T20:11:43.6402757Z Artifact fbgemm_default_x86_gcc_py3.9_cu12.6.3.whl has been successfully uploaded! Final size is 524562427 bytes. Artifact ID is 3081458740 2025-05-07T20:11:43.6406382Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081458740 2025-05-07T20:11:43.6641024Z Post job cleanup. 2025-05-07T20:11:43.6645748Z ##[command]/usr/bin/docker exec 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:43.9531271Z [command]/usr/bin/git version 2025-05-07T20:11:43.9568417Z git version 2.47.1 2025-05-07T20:11:43.9598305Z Copying '/github/home/.gitconfig' to '/__w/_temp/978e0d29-7f5a-40e1-a744-f1bfbd08fbc1/.gitconfig' 2025-05-07T20:11:43.9606295Z Temporarily overriding HOME='/__w/_temp/978e0d29-7f5a-40e1-a744-f1bfbd08fbc1' before making global git config changes 2025-05-07T20:11:43.9607143Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:43.9611176Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:43.9646186Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:43.9672879Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:43.9947923Z Entering 'external/asmjit' 2025-05-07T20:11:44.0000376Z Entering 'external/composable_kernel' 2025-05-07T20:11:44.0058356Z Entering 'external/cpuinfo' 2025-05-07T20:11:44.0103996Z Entering 'external/cutlass' 2025-05-07T20:11:44.0166802Z Entering 'external/googletest' 2025-05-07T20:11:44.0216692Z Entering 'external/hipify_torch' 2025-05-07T20:11:44.0261934Z Entering 'external/json' 2025-05-07T20:11:44.0327037Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:44.0343680Z http.https://github.com/.extraheader 2025-05-07T20:11:44.0351452Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:44.0376953Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:44.0646012Z Entering 'external/asmjit' 2025-05-07T20:11:44.0678290Z http.https://github.com/.extraheader 2025-05-07T20:11:44.0703914Z Entering 'external/composable_kernel' 2025-05-07T20:11:44.0755186Z http.https://github.com/.extraheader 2025-05-07T20:11:44.0801057Z Entering 'external/cpuinfo' 2025-05-07T20:11:44.0829907Z http.https://github.com/.extraheader 2025-05-07T20:11:44.0870613Z Entering 'external/cutlass' 2025-05-07T20:11:44.0918234Z http.https://github.com/.extraheader 2025-05-07T20:11:44.0966591Z Entering 'external/googletest' 2025-05-07T20:11:44.0999725Z http.https://github.com/.extraheader 2025-05-07T20:11:44.1029261Z Entering 'external/hipify_torch' 2025-05-07T20:11:44.1073222Z http.https://github.com/.extraheader 2025-05-07T20:11:44.1118323Z Entering 'external/json' 2025-05-07T20:11:44.1147899Z http.https://github.com/.extraheader 2025-05-07T20:11:44.1316110Z Stop and remove container: 150a28556a5a407db4177bd68a8ce0e0_amazonlinux2023_748304 2025-05-07T20:11:44.1321506Z ##[command]/usr/bin/docker rm --force 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 2025-05-07T20:11:44.8514561Z 8b1652485f5b621f7f0b78973faa33c57195fcb1b1a66550db5cff7ec6c55546 2025-05-07T20:11:44.8544547Z Remove container network: github_network_7a3f3acfbff84df5a5ba037f14b56b19 2025-05-07T20:11:44.8548987Z ##[command]/usr/bin/docker network rm github_network_7a3f3acfbff84df5a5ba037f14b56b19 2025-05-07T20:11:45.8231407Z github_network_7a3f3acfbff84df5a5ba037f14b56b19 2025-05-07T20:11:45.8266318Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:45.8286332Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:45.8292555Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:45.8292979Z ##[endgroup] 2025-05-07T20:11:45.8402597Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:11:55.9000737Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:12:11.8673788Z Cleaning up orphan processes